You are viewing a single thread.
View all comments
193 points

Our data, you mean?

permalink
report
reply
73 points
*

well, not mine. i used a script to replace all of my comments with gibberish before i deleted them and then my account. if they went back and restored my comments, then all they’ll get is comments full of gibberish, especially since i overwrote them 3 times before deleting them, just in case they tried to roll back to the previous version.

have fun with that!

permalink
report
parent
reply
75 points

I like your style, but honestly I wouldn’t be surprised if they keep every single version.

permalink
report
parent
reply
35 points

Here’s the thing: Nothing in Reddit’s history indicates that they are that competent.

permalink
report
parent
reply
17 points
*

i bet they do now, but i’ve checked back now and then, and all of my comments and posts are most assuredly gone.

edit: i’ve gone back to check some old haunts, place i know i’ve commented, and i did some seaching with google using my old usernames, as google uses its cache to match to the posts\comments, even though they’re not there any more.

i see old posts that are graveyards of deleted comments, some with simply deleted accounts, and many others where both the account and comment are deleted. i don’t see any gibberish comments. the ones i know are mine (because replies quote the comment above, which i recognize as mine), are all just deleted in their entirety, so it seems they didn’t do comment versioning, at least not past the first edit. i see no posts under any former username of mine.

the efforts to scrub my content from reddit last May appears to have worked. sadly, since the API lockdown, those tools no longer work.

permalink
report
parent
reply
3 points
*

Reddit used to be open source and the source is still on github as a read only archive.

AFAIK back then edit history was only kept briefly. Enough to roll back an accidental edit (if you have admin privileges anyway) but not far enough back to view old versions of posts.

Of course, they would have backups, and maybe the code has changed, but I wouldn’t be surprised if it hasn’t changed and those backups are impractical (slow/expensive) to access.

Keeping old revisions is a common practice but it’s also expensive and in reddit’s case totally unnecessary.

permalink
report
parent
reply
3 points

you can request your reddit data, and they provide every comment along with edits as far as I remember, it was uncomfortable but i’d never posted anything regrettable at least

imagine getting your hands on u/spez’s reddit data

permalink
report
parent
reply
16 points

Yeah…all that comment data isn’t really that large. They’ll have backups captured for likely several years back. All you can view is the info on the current live servers. You might have kept them from getting like 3 months worth of your comments at best.

permalink
report
parent
reply
8 points

I did the same, but we’re both fools if we think reddit didn’t keep every character we typed (yet alone submitted) in a private, proprietary database.

We weren’t paid for our data. We were given access to a website free of charge. The consent we gave was supposed to be for the operation of the website, not for training AI.

They should fucking pay us.

permalink
report
parent
reply
6 points

LOL. I did the same. And I confirmed many months later that the comments were not restored.

Now I hear that Google wants to train their AI on reddit content. Haha. Good luck with that, Lorem Ipsum! 😁

permalink
report
parent
reply
3 points

If you actually replaced with “Lorem ipsum” texts, it would probably be easy to filter the garbage from the dataset.

Also, they probably have copies of the comments before the edits that are just not presented in the frontend.

permalink
report
parent
reply
1 point

Me too. Feelin’ mighty fine about that decision now. Long Live Lemmy

permalink
report
parent
reply
33 points

Our data

permalink
report
parent
reply
9 points

Correct, our data.

  • Spez
permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 16K

    Monthly active users

  • 12K

    Posts

  • 554K

    Comments