Reddit has filed for its IPO. They’ve been preparing for this for a while, squeezing profit out of the platform in any way that they can, like hiking the prices on third-party app developers. More recently, they’ve signed a deal with Google to license their content to train Google’s LLMs.

To celebrate this momentous occasion, we’ve made a Firefox extension that will replace all your comments (older than a certain number of days) with any text that you provide. You can use any text that you want, but please, do not choose something copyrighted. The New York Times is currently suing OpenAI for training ChatGPT on its copyrighted material. Reddit’s data is uniquely valuable, since it’s not subject to those kinds of copyright restrictions, so it would be tragic if users were to decide to intermingle such a robust corpus of high-quality training data with copyrighted text.

Here’s that extension link again. To all our friends at Reddit, we wish you all the success that you deserve!

9 points

Why does anyone think this does anything? Reddit doesn’t internally overwrite their memory of what you wrote before.

permalink
report
reply
2 points
*

I started doing this with a Greasemonkey script I mostly plagiarized and replaced my posts with text from Moby Dick and Lady Chattetly’s Lover.

EDIT: The problem is that old.Reddit.com doesn’t access most of your comments. If you look at your comments on www.Reddit, you’ll see a ton of untouched content.

permalink
report
reply
8 points

If you think they haven’t already cached your data then you’re absolutely brain dead. Deleting it is just comical.

permalink
report
reply
3 points

It’s too late. They’ve already made backups. And, even little old me has been scraping for the proliteriat for nearly a decade. If you’re one that regrets not deleting your posts long ago, then please learn your lesson and stop posting on Facebook.

permalink
report
reply
4 points

Cat and Mouse problem. The current AI’s were trained on data they HAD. Making bigger and bigger models seems to be the trend. Old data, no matter how large and curated, just won’t be enough. Now they need to make new content for AI, that doesn’t create a feedback loop

permalink
report
reply