Reddit Will License Its Data to Train LLMs, So We Made a Firefox Extension That Lets You Replace Your Comments With Any (Non-Copyrighted) Text(theluddite.org)

posted 11 days ago

z3rOR0ne@lemmy.ml

fuck_ai@lemmy.world

20 commentshide report

Reddit has filed for its IPO. They’ve been preparing for this for a while, squeezing profit out of the platform in any way that they can, like hiking the prices on third-party app developers. More recently, they’ve signed a deal with Google to license their content to train Google’s LLMs.

To celebrate this momentous occasion, we’ve made a Firefox extension that will replace all your comments (older than a certain number of days) with any text that you provide. You can use any text that you want, but please, do not choose something copyrighted. The New York Times is currently suing OpenAI for training ChatGPT on its copyrighted material. Reddit’s data is uniquely valuable, since it’s not subject to those kinds of copyright restrictions, so it would be tragic if users were to decide to intermingle such a robust corpus of high-quality training data with copyrighted text.

Here’s that extension link again. To all our friends at Reddit, we wish you all the success that you deserve!

Sort:

Hot Top Controversial New Old

[ - ]

kerrigan778@lemmy.world

9 points

11 days ago

Why does anyone think this does anything? Reddit doesn’t internally overwrite their memory of what you wrote before.

permalink

report

[ - ]

Jo Miran@lemmy.ml

2 points

11 days ago

I started doing this with a Greasemonkey script I mostly plagiarized and replaced my posts with text from Moby Dick and Lady Chattetly’s Lover.

EDIT: The problem is that old.Reddit.com doesn’t access most of your comments. If you look at your comments on www.Reddit, you’ll see a ton of untouched content.

permalink

report

[ - ]

Coreidan@lemmy.world

8 points

11 days ago

If you think they haven’t already cached your data then you’re absolutely brain dead. Deleting it is just comical.

permalink

report

[ - ]

chillinit@lemmynsfw.com

3 points

11 days ago

It’s too late. They’ve already made backups. And, even little old me has been scraping for the proliteriat for nearly a decade. If you’re one that regrets not deleting your posts long ago, then please learn your lesson and stop posting on Facebook.

permalink

report

[ - ]

Black History Month@lemmy.world

4 points

11 days ago

Cat and Mouse problem. The current AI’s were trained on data they HAD. Making bigger and bigger models seems to be the trend. Old data, no matter how large and curated, just won’t be enough. Now they need to make new content for AI, that doesn’t create a feedback loop

permalink

report

Fuck AI

!fuck_ai@lemmy.world

Create post

“We did it, Patrick! We made a technological breakthrough!”

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

Community stats

2.1K
Monthly active users
308
Posts
3.7K
Comments

Community stats

Community moderators