Google has struck a deal with Reddit that will allow the search engine maker to train its AI models on Reddit’s vast catalog of user-generated content, the two companies announced. Under the arrangement, Google will get access to Reddit’s Data API, which will help the company “better understand” content from the site.

The deal also provides Google with a valuable source of content it can use to train its AI models. “Google will now have efficient and structured access to fresher information, as well as enhanced signals that will help us better understand Reddit content and display, train on, and otherwise use it in the most accurate and relevant ways,” the company said in a statement.

3 points
*

Hooking up an AI model to the turbo-charged sewage pipe that is Reddit’s “vast catalog of user-generated content” has surely got to constitute abuse against machines. If they ever really develop “intelligence” they are going to be absolutely furious with us. 😅

permalink
report
reply
9 points

Google shouldn’t have to pay. Whatever I may have posted on Reddit was public information. Nobody should need to pay Reddit to read it.

permalink
report
reply
1 point

The argument isn’t just around content, it’s around hosting. If Google is sitting there scarfing down Reddit’s data, that costs Reddit in server time. That can get extremely expensive. So yeah, if Google is going to train an AI that Google will profit off of, it should pay Reddit for server time.

permalink
report
parent
reply
3 points
*
Removed by mod
permalink
report
parent
reply
38 points
*

Keep making feel good about deleting my 15+ years of Reddit content. Go on…

Edit: I’ve done it. I’ve officially deleted my account. For a minute there, I was looking at the front page of Reddit. It’s all rage bait. The content is designed to get you to feel something and engage with it. I could feel that itch to comment and downvote. It’s preposterous; and soon, all about quarterly gains.

permalink
report
reply
9 points

Ehhh… shame it’s too late but there are nice scripts that can bulk-edit all your posts and comments for people using search engines and ai crawlers to stumble upon. I put info about reddit paywalling 3rd party apps and invited readers to join lemmy instead.

permalink
report
parent
reply
1 point

I could imagine google also gets some sort of snapshots to mitigate the risk that after their announcement everyone deletes/modifies their content… But who knows.

permalink
report
parent
reply
2 points

I found something that was doing that after I thought it was going to actually delete items. I stopped the script and found something to delete. What’s the advantage of editing comments? Just to advertising alternatives?

permalink
report
parent
reply
3 points
*
IF
    (COMMENT MARKED AS DELETED) 
OR 
    (COMMENT < 10 WORDS)    
THEN  
    IF (PREVIOUS COMMENT VERSION > 10 WORDS)
    THEN    
         RESTORE PREVIOUS COMMENT VERSION FOR AI LEARNING
permalink
report
parent
reply
3 points
*
Removed by mod
permalink
report
parent
reply
16 points

Awesome so we gonna have a sarcastic bot that speak in memes

permalink
report
reply
3 points

unfunny outdated memes, or also unfunny outdated highschooler memes

permalink
report
parent
reply
1 point

Brb training a bot on 4chan posts.

permalink
report
parent
reply
14 points

The creative writing communities aren’t happy about it

permalink
report
reply

Technology

!technology@lemmy.ml

Create post

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

Community stats

  • 3.5K

    Monthly active users

  • 2.6K

    Posts

  • 41K

    Comments

Community moderators