Prevent ChatGPT from Scraping Your Site

August 11th, 2023 • filed under Programming

A short but sweet post. Ready?

OpenAI quietly published the crawler name/user agent for ChatGPT, creatively named GPTBot.

Since we know the user agent, now, we can effectively prevent it from crawling a site using robots.txt like so:

User-agent: GPTBot
Disallow: /

IP Ranges

OpenAI was also generous enough to provide a list of IP ranges their crawler will connect from, so go ahead and add these to your firewall rules, too: