Maven, a new social network backed by OpenAI’s Sam Altman, found itself in a controversy today when it imported a huge amount of posts and profiles from the Fediverse, and then ran AI analysis to alter the content.

126 points

Pretty wild

permalink
report
reply
145 points

The wildest part is that he’s surprised that Mastodon peeps would react negatively to their posts being scrapped without consent or even notification and fed into an AI model. Like, are you for real dude? Have you spent more than 4 seconds on Mastodon and noticed their (our?) general attitude towards AI? Come the hell on…

permalink
report
parent
reply
32 points

People can complain, but the Fediverse is built to make consuming user’s data easy. If you don’t want AI using your data, don’t put it on such an easily “scrapable” network.

permalink
report
parent
reply
47 points

Yeah, and girls dress for rape. They are just aaasking for it!

I will go off on a tangent.

Just because something is online it does not mean I give a full green light on anything.

Fuck this noise of social parasitic networks hammering free service therefore pay with data into everyone’s skull. And everyone posts crap.

It is a billion dollar business. LLMs are extracting millions and will generate more.

You know why? Because worthless shit you post online is not worthless after all.

Yes, you are reading it right. Pay me. Pay us.

Before anyone ridicules this. Yall be defending billion dollar corporations, staffed with millionaires below C-levels.

People should start demanding money from these greedy assholes.

permalink
report
parent
reply
15 points

Alternatively, use a closed ecosystem susceptible to data rot and loss.

Want to contribute to our open source project? Join our discord

Would you want art to be unfindable because scraping for AI image generation happens? It’s a solution looking for problems.

permalink
report
parent
reply
8 points

This is what I’ve been saying the entire time. It sucks, and it’s wrong, but the fediverse is built from the ground up as an open sharing platform, where amour data is shared with anyone. It shouldn’t be, and it’s wrong, but there is nothing to stop anyone from doing it. To change that would alter federation at a core level

permalink
report
parent
reply
1 point

People can complain, but the Fediverse is built to make consuming user’s data easy

Correction: it is built to make consuming users’s data not easy, but more human.

WHat you are thinking of is AP, not “Fediverse”, and even then that’s a stretch.

permalink
report
parent
reply
1 point

Just because our data is accessible doesn’t mean it’s legally licensed to be used by a for profit company. Free doesn’t meant you can do what you want with it, it just means no cost.

permalink
report
parent
reply

It sounds like they weren’t “being fed into an AI model” as in being used as training material, they were just being evaluated by an AI model. However…

Have you spent more than 4 seconds on Mastodon and noticed their (our?) general attitude towards AI?

Yeah, the general attitude of wild witch-hunts and instant zero-to-11 rage at the slightest mention of it. Doesn’t matter what you’re actually doing with AI, the moment the mob thinks they scent blood the avalanche is rolling.

It sounds like Maven wants to play nice, but if the “general attitude” means that playing nice is impossible why should they even bother to try?

permalink
report
parent
reply
6 points

The anti-AI knee-jerk reactions can be extreme, I agree, but at the same time one of important features of Mastodon is that your feed is nor controlled by an algorithm in any way.

So when a company comes, takes those posts and screws with them to create an algorithm to show them, I understand people getting angry - at least some of them joined to be free of that exact thing…

permalink
report
parent
reply
2 points

Yeah, the general attitude of wild witch-hunts and instant zero-to-11 rage at the slightest mention of it. Doesn’t matter what you’re actually doing with AI, the moment the mob thinks they scent blood the avalanche is rolling.

This wasn’t always the case. A lot of research on NLP uses scraped social media posts (2010’s). People never had a problem with that (at least the outrage wasn’t visible back then). The problem now is that our content is being used to create an AI product where there is zero consent taken from the end-user.

Source: My research colleagues used to work on NLP

permalink
report
parent
reply
6 points

It’s not surprised. He’s acting surprised because he got caught. It’s pretty standard for these jerkass tech bros. “Move fast break things” is code “break laws be unethical” - as I think we’ve all seen if you do it often and fast enough you can keep way ahead of any kind of accountability because everybody else is trying to play catch up well the last thing has already filtered out of the news cycle.

permalink
report
parent
reply
-4 points

I’m surprised as well. We put our posts up for anyone to replicate and republish, yet we still get mad when somebody replicates and republishes it. It does not make sense. Activitypub is an open network with zero privacy expectations.

permalink
report
parent
reply
6 points

And yet we don’t want our posts to be fed into AI slop, nor do we want independent hosts to pay for the massive amount of traffic generated by a massive corporate entity to trying to consume data en masse.

permalink
report
parent
reply
2 points

What has our copyright got to do with privacy expectation?

permalink
report
parent
reply
17 points
6 points

Look at that shit-eating grin, he knows. There’s no way someone can be that out of touch, right? Right?!?

permalink
report
parent
reply
2 points

How does someone with a last name that close to secretion choose to go by Jimmy?

permalink
report
parent
reply
88 points

I was confused why a package manager would need to import posts from a social network.

Why name a new product the same as a very popular existing product?

permalink
report
reply
17 points

Obviously it’s named after Maven Black-Briar

permalink
report
parent
reply
1 point

I mean maven is super bloated so it wouldn’t surprise me

permalink
report
parent
reply
40 points

I was confused on what they were trying to accomplish, and even after reading the article I am still somewhat confused.

Instead, when a user posts something, the algorithm automatically reads the content and tags it with relevant interests so it shows up on those pages. Users can turn up the serendipity slider to branch out beyond their stated interests, and the algorithm running the platform connects users with related interests.

Perhaps I’m a minority, but I don’t see myself getting much utility out of this. I already know what my interests are, and don’t have much interest in growing them algorithmically. If a topic is really interesting, I’ll eventually find out about it via an actual human.

permalink
report
reply
28 points

Yeah, we’re trying to get the fuck away from algorithms. That’s what makes the fediverse such a big draw currently, for me.

permalink
report
parent
reply
13 points

Only algorithm I need is posts I subscribe to, in descending order. That’s about it

permalink
report
parent
reply

You’re on slrpnk.net, I assume it’s not implementing any of this stuff. As long as you don’t sign up for Maven I don’t see how this is going to affect you.

permalink
report
parent
reply
7 points
*

I mean yeah, maybe it won’t affect me directly, I like the instance I’m on and it’s a pretty respectable one. However, indirectly, this is very relevant to any Fediverse user, regardless of the instance or platform they’re using. Allowing abuses like this to happen without any pushback is a surefire way of turning this place into a shithole just like the rest of the internet. I appreciate the fact that, at least for now, it’s different here.

Also, maybe this isn’t my only homebase? Just saying.

permalink
report
parent
reply
11 points

TikTok is really popular operating on essentially the same principle. I, for one want nothing to do with that.

permalink
report
parent
reply
1 point

Instead, when a user posts something, the algorithm automatically reads the content and tags it with relevant interests so it shows up on those pages.

Motherfucker this is what hashtags are for.

permalink
report
parent
reply
0 points
*

So you don’t ever want to learn about new things? And even if you did, you wouldn’t want those new things be efficiently suggested to you and instead be bundled with a bunch of other boring crap?

Also, what you’re asking for is what the tool seems to do. You would put the slider all the way to one side to avoid having new stuff suggested. Existing social media platforms often just shove stuff at you endlessly.

permalink
report
parent
reply
29 points
*

That’s why I keep saying it’s pointless to defederate corpos. They’ll just scrape everything before you notice.

permalink
report
reply
28 points

The fact they even got DMs from at least one instance is crazy.

permalink
report
parent
reply
27 points
*

And it’s also damming for private messaging on mastodon.

I once read vague complaints about it being a rushed implementation. While I won’t trust those without evidence, I for sure wouldn’t trust mastodon with my PMs. At least, not until how this was allowed to happen is figured out and fixed if necessary.

P.S. I’m still not sure I believe in PMs in the fediverse. If I need to share something and care about keeping it private, I’d rather move the conversation elsewhere.

permalink
report
parent
reply
17 points

I was under the impression that DM’s on Mastodon (and Lemmy too) weren’t ever stated as being secure and I think that they were both pretty transparent about this particular aspect.

permalink
report
parent
reply
1 point

Well the problem is user perception/understanding.

The reality is they were literally direct messages, not private messages.

permalink
report
parent
reply
9 points

Defederation is more about not being flooded with 1000x more users than the Fediverse currently has

permalink
report
parent
reply
1 point

So far we only have a corpo fedi-twitter in form of Threads. In that case non-corpo instance user has to specifically follow someone before their content is federated so that sounds like a bit overblown issue.

permalink
report
parent
reply
1 point

Seems pretty easy for any corporation to setup something like https://lemmy-federate.com/ but for Maston/IceShrimp/Misskey accounts to federate the important corporate accounts to the targeted non-corpo instances

permalink
report
parent
reply
1 point

Unfortunately a lot of people think it’s to do with scraping as well. The amount of “defederate Threads so that they can’t scrape my data” posts I saw was about 50-50 with the sensible takes.

permalink
report
parent
reply
3 points

Plus even if you defederate them, oops, it’s all public anyway!

permalink
report
parent
reply

Classic Scam Altman!

permalink
report
reply

Fediverse

!fediverse@lemmy.world

Create post

A community to talk about the Fediverse and all it’s related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

  • Posts must be on topic.
  • Be respectful of others.
  • Cite the sources used for graphs and other statistics.
  • Follow the general Lemmy.world rules.

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)

Community stats

  • 9.4K

    Monthly active users

  • 2.2K

    Posts

  • 78K

    Comments