The lawsuit alleges OpenAI crawled the web to amass huge amounts of data without people’s permission.

You are viewing a single thread.
View all comments View context
16 points

I doubt it’s only about some Reddit posts. The scrapping was done on the whole web, capturing everything it could. So besides stealing data and presenting it as its own, it seems to have collected some even more problematic data which wasn’t properly protected.

permalink
report
parent
reply
11 points

But that really isn’t OpenAI’s fault. Whoever was in charge of securing the patients data really fucked up.

permalink
report
parent
reply
18 points

Leaving your front door open isn’t prudent but doesn’t grant permission to others to enter and take/copy your belongings or data.

The security teams may have royally screwed up, but OpenAI has a legal obligation to respect copyright and laws regarding data ownership.

Likewise, they could have scraped pages that included terms of use, copyright, disclaimers, etc., and failed to honor them.

All parties can be in the wrong for different reasons.

permalink
report
parent
reply
13 points

That’s like saying you didn’t lock your front door so whoever robs you is innocent.

permalink
report
parent
reply
6 points

I think it’s a little closer to being mad that the Google street car drove by and snapped a picture of the front of your house, tbh.

permalink
report
parent
reply
2 points

It’s more like leaving an important letter in the open for everyone to read. It’s certainly your fault for leaving it that open.

permalink
report
parent
reply
2 points

But does leaving your front door open allow one to legally take a picture of the inside from across the street? I’d say scraping is more akin to that than it is theft. Nothing is removed in scraping, just copied

permalink
report
parent
reply
0 points

Yeah, but what were all these people whose data was scraped wearing?

permalink
report
parent
reply
6 points

It’s certainly their fault that they used it, though.

If they cared, they could have ensured they weren’t using sensitive or otherwise highly problematic information, but they chose not to. That’s on them.

permalink
report
parent
reply
-3 points

It’s called “disrupting” the established norms. You wouldn’t get it because you’re not on the bleeding edge of a revolutionary platform that’s seeing scalable vertical growth due to its paradigm shift.

permalink
report
parent
reply
1 point

They certainly fucked up, but it might well be OpenAI’s post too.

permalink
report
parent
reply
-1 points

if it was unsecured it’s basically public. whomever put that data on a publicly accessible server is at fault

permalink
report
parent
reply
6 points
*

That’s not necessarily true. Even if a company makes the mistake of not securing data correctly, those that make use of this data can still be at fault.

If a company leaves a server wide open, you still can’t legally steal information from it.

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 18K

    Monthly active users

  • 11K

    Posts

  • 507K

    Comments