The New York Times blocks OpenAI’s web crawler::The New York Times has officially blocked GPTBot, OpenAI’s web crawler. The outlet’s robot.txt page specifically disallows GPTBot, preventing OpenAI from scraping content from its website to train AI models.
You are viewing a single thread.
View all comments 19 points
The question is: Does that crawler adhere to robot.txt policies?
3 points