The lawsuit alleges OpenAI crawled the web to amass huge amounts of data without people’s permission.
I doubt it’s only about some Reddit posts. The scrapping was done on the whole web, capturing everything it could. So besides stealing data and presenting it as its own, it seems to have collected some even more problematic data which wasn’t properly protected.
But that really isn’t OpenAI’s fault. Whoever was in charge of securing the patients data really fucked up.
Leaving your front door open isn’t prudent but doesn’t grant permission to others to enter and take/copy your belongings or data.
The security teams may have royally screwed up, but OpenAI has a legal obligation to respect copyright and laws regarding data ownership.
Likewise, they could have scraped pages that included terms of use, copyright, disclaimers, etc., and failed to honor them.
All parties can be in the wrong for different reasons.
That’s like saying you didn’t lock your front door so whoever robs you is innocent.
I think it’s a little closer to being mad that the Google street car drove by and snapped a picture of the front of your house, tbh.
But does leaving your front door open allow one to legally take a picture of the inside from across the street? I’d say scraping is more akin to that than it is theft. Nothing is removed in scraping, just copied
It’s certainly their fault that they used it, though.
If they cared, they could have ensured they weren’t using sensitive or otherwise highly problematic information, but they chose not to. That’s on them.
It’s called “disrupting” the established norms. You wouldn’t get it because you’re not on the bleeding edge of a revolutionary platform that’s seeing scalable vertical growth due to its paradigm shift.
if it was unsecured it’s basically public. whomever put that data on a publicly accessible server is at fault