Suing Writers Seethe at OpenAI's Excuses in Court(futurism.com)

posted 1 year ago

floofloof@lemmy.ca

technology@lemmy.ml

125 commentshide report

Sort:

Hot Top Controversial New Old

[ - ]

NounsAndWords@lemmy.world

5 points

1 year ago

I take it we don’t use the phrase “good writers borrow, great writers steal” in this day and age…

permalink

report

[ - ]

AphoticDev@lemmy.dbzer0.com

10 points

1 year ago

Wait till they find out photographers spend their whole careers trying to emulate the style of previous generations. Or that Adobe has been implementing AI-driven content creation into Photoshop and Lightroom for years now, and we’ve been pretending we don’t notice because it makes our jobs easier.

permalink

report

parent

[ - ]

ZILtoid1991@kbin.social

19 points

1 year ago

seethe

Very concerning word use from you.

The issue art faces isn’t that there’s not enough throughput, but rather there’s not enough time, both to make them and enjoy them.

permalink

report

[ - ]

sadreality@kbin.social

-23 points

1 year ago

Headline is stupid.

Millenails journalism is fucking got to stop with these clown word choices…

permalink

report

parent

[ - ]

MooseBoys@lemmy.world

29 points

1 year ago

Honestly it’s refreshing to not see the word “slammed” for once…

permalink

report

parent

[ - ]

sadreality@kbin.social

-2 points

1 year ago

Haha… This person gets it.

permalink

report

parent

[ - ]

deezbutts@lemm.ee

1 point

1 year ago

Let the boys be boys

permalink

report

parent

[ - ]

mkhoury@lemmy.ca

15 points

1 year ago

That’s always been the case, though, imo. People had to make time for art. They had to go to galleries, see plays and listen to music. To me it’s about the fair promotion of art, and the ability for the art enjoyer to find art that they themselves enjoy rather than what some business model requires of them, and the ability for art creators to find a niche and to be able to work on their art as much as they would want to.

permalink

report

parent

[ - ]

mindbleach@sh.itjust.works

23 points

1 year ago

I don’t care what works a neural network gets trained on. How else are we supposed to make one?

Should I care more about modern eternal copyright bullshit? I’d feel more nuance if everything a few decades old was public-domain, like it’s fucking supposed to be. Then there’d be plenty of slightly-outdated content to shovel into these statistical analysis engines. But there’s not. So fuck it: show the model absolutely everything, and the impact of each work becomes vanishingly small.

Models don’t get bigger as you add more stuff. Training only twiddles the numbers in each layer. There are two-gigabyte networks that have been trained on hundreds of millions of images. If you tried to store those image, verbatim, they would each weigh barely a dozen bytes. And the network gets better as that number goes down.

The entire point is to force the distillation of high-level concepts from raw data. We’ve tried doing it the smart way and we suck at it. “AI winter” and “good old-fashioned AI” were half a century of fumbling toward the acceptance that we don’t understand how intelligence works. This brute-force approach isn’t chosen for cost or ease or simplicity. This is the only approach that works.

permalink

report

[ - ]

DeathsEmbrace@lemmy.ml

-5 points

1 year ago

Deleted by creator

permalink

report

parent

[ - ]

mindbleach@sh.itjust.works

13 points

1 year ago

Right, copyright law.

permalink

report

parent

[ - ]

DeathsEmbrace@lemmy.ml

1 point

1 year ago

Deleted by creator

permalink

report

parent

Show more comments

[ - ]

anachronist@midwest.social

3 points

1 year ago

Models don’t get bigger as you add more stuff.

They will get less coherent and/or “forget” the earlier data if you don’t increase the parameters with the training set.

There are two-gigabyte networks that have been trained on hundreds of millions of images

You can take a huge tiff of an image, put it through JPEG with the quality cranked all the way down and get a tiny file out the other side, which is still a recognizable derivative of the original. LLMs are extremely lossy compression of their training set.

permalink

report

parent

[ - ]

mindbleach@sh.itjust.works

4 points

1 year ago

which is still a recognizable derivative of the original

Not in twelve bytes.

Deep models are a statistical distillation of a metric shitload of data. Smaller models with more training on more data don’t get worse, they get more abstract - and in adversarial uses they often kick big networks’ asses.

permalink

report

parent

Suing Writers Seethe at OpenAI's Excuses in Court(futurism.com)

Technology

!technology@lemmy.ml

Community stats

Community moderators