~/home/post/no-gpt-scrapey-scrapey

Prevent ChatGPT from Scraping Your Site

August 11th, 2023 • filed under Programming

A short but sweet post. Ready?

OpenAI quietly published the crawler name/user agent for ChatGPT, creatively named GPTBot.

Since we know the user agent, now, we can effectively prevent it from crawling a site using robots.txt like so:

User-agent: GPTBot
Disallow: /

ChatGPT/OpenAI IP Ranges

OpenAI was also generous enough to provide a list of IP ranges their crawler will connect from, so go ahead and add these to your firewall rules, too:

20.15.240.64/28
20.15.240.80/28
20.15.240.96/28
20.15.240.176/28
20.15.241.0/28
20.15.242.128/28
20.15.242.144/28
20.15.242.192/28
40.83.2.64/28