TikTok’s parent company, ByteDance, has been secretly using OpenAI’s technology to develop its own competing large language model (LLM). “This practice is generally considered a faux pas in the AI world,” writes The Verge’s Alex Heath. “It’s also in direct violation of OpenAI’s terms of service, which state that its model output can’t be used ‘to develop any artificial intelligence models that compete with our products and services.’”

22 points

I hope this harms OpenAI in their lawsuits somehow. Their argument of “we can train on the output of others, but nobody can train on our output” has no moral foundation. Pick a lane.

permalink
report
reply
20 points

Should change its name to closed ai.

permalink
report
reply
6 points

I wonder if that clause is legal. It could be argued that it legitimately protects the capital investment needed to make the model. I’m not sure if that’s true, though.

permalink
report
reply
2 points

I can’t speak for every jurisdiction, but I’d be hard pressed to see why it wouldn’t be legal in the US, especially in these circumstances. ByteDance is a massive legally sophisticated corporation, so they should’ve been expected to fully read and understand the terms and conditions before accepting them. They probably won’t bring a legal challenge, because they know they don’t have a particularly strong legal argument or a sympathetic angle to use.

permalink
report
parent
reply
3 points

The issue is that what they are doing there is blatantly anticompetitive.

https://www.ftc.gov/advice-guidance/competition-guidance/guide-antitrust-laws/single-firm-conduct/refusal-deal

permalink
report
parent
reply
1 point

Sorry for the late reply, but this doesn’t really seem like it’d come close to invoking any of the US’s neutered antitrust enforcement. Open AI doesn’t have a monopoly position to abuse, since there are other large firms offering LLMs that see reasonable amounts of usage. This clause amounts more to an effort to stop reverse engineering than stifle anyone trying to build an LLM.

permalink
report
parent
reply
17 points

A lot of open source models are trained using data from gpt outputs actually. It’s a cheap way to generate huge training data. The difference is those models are made by independent researchers not backed by a huge company for commercial purpose so OpenAI probably left them alone.

permalink
report
reply
9 points

Probably an honest mistake. Who hasn’t bent down to tie their shoe, lost their balance and accidentally coded up a LLM to steal from an existing product? I’d still trust them to plant listening devices, cameras and keyloggers in my pocket since they’ve displayed such a commitment to honesty, integrity and transparency.

permalink
report
reply
-3 points

Yes, honestly you have also been a subject of a lot of propaganda. The US and the US media are villifying a lot of Chinese companies while American companies are not much better if not worse.

permalink
report
parent
reply
2 points

Who hasnt been subject to a lot of propaganda?

This is the mis-information age.

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 17K

    Monthly active users

  • 12K

    Posts

  • 543K

    Comments