GnuLinuxDude@lemmy.ml to lemmy.ml meta@lemmy.ml · 1 year ago

Should lemmy.ml block chatgpt scraping in robots.txt?

39

Should lemmy.ml block chatgpt scraping in robots.txt?

GnuLinuxDude@lemmy.ml to lemmy.ml meta@lemmy.ml · 1 year ago

Some context about this here: https://arstechnica.com/information-technology/2023/08/openai-details-how-to-keep-chatgpt-from-gobbling-up-website-data/

the robots.txt would be updated with this entry

User-agent: GPTBot
Disallow: /

Obviously this is meaningless against non-openai scrapers or anyone who just doesn’t give a shit.

Chat

7heo@lemmy.ml
link
fedilink
arrow-up
3
arrow-down
3·
1 year ago
That won’t stop OpenAI. We need actual blocking, on the server side. Problem is, with federation and all, it will be really, really difficult to do. And expensive.