I know a lot of people want to interpret copyright law so that allowing a machine to learn concepts from a copyrighted work is copyright infringement, but I think what people will need to consider is that all that’s going to do is keep AI out of the hands of regular people and place it specifically in the hands of people and organizations who are wealthy and powerful enough to train it for their own use.
If this isn’t actually what you want, then what’s your game plan for placing copyright restrictions on AI training that will actually work? Have you considered how it’s likely to play out? Are you going to be able to stop Elon Musk, Mark Zuckerberg, and the NSA from training an AI on whatever they want and using it to push propaganda on the public? As far as I can tell, all that copyright restrictions will accomplish to to concentrate the power of AI (which we’re only beginning to explore) in the hands of the sorts of people who are the least likely to want to do anything good with it.
I know I’m posting this in a hostile space, and I’m sure a lot of people here disagree with my opinion on how copyright should (and should not) apply to AI training, and that’s fine (the jury is literally still out on that). What I’m interested in is what your end game is. How do you expect things to actually work out if you get the laws that you want? I would personally argue that an outcome where Mark Zuckerberg gets AI and the rest of us don’t is the absolute worst possibility.
But if it makes you happy, how about we get a copyright ala Creative Commons that can allow an individual to create an AI using the copyrighted work for non-profit reason, but restrict corporations from doing so with an AI used for profit, and considers any work created by this AI to be noncopyrighted.
Honestly, I think keeping the output of AI non-copyrighted is probably the best of both worlds, because it allows individuals to use AI as an expressive tool (you keep separating “creatives” from “average people”, which I take issue with) while making it impractical for large media companies to use.
At any rate, the reason copyright restrictions would just kill open source AI is that it strikes me as incredibly unlikely that you’re going to be able to stop corporations from training AI on media that they own outright. Disney has a massive library of media that they can use as training data, and no amount of stopping open source AI users from training AI on copyrighted works is going to prevent Disney from doing that (same goes for Warner Bros, etc). Disney, which is known for exploiting its own workers, will almost certainly use that AI to replace their animators completely, and they’ll be within their legal rights to do so since they own all the copyrights on it.
Now consider companies like Adobe, Artstation, and just about any other website that you can upload art to. When you sign up for those sites, you agree to their user agreement, which has standard boilerplate language that gives them a sublicenseable right to use your work however they see fit (or “for business purposes”, which means the same thing). In other words, if you’ve ever uploaded your work anywhere, you’ve already given someone else the legal right to train an AI on your work (even with a creative interpretation of copyright law that allows concepts and styles to be copyrighted), which means they’re just going to build their own AI and then sell it back to you for a monthly fee.
But artists and writers should be compensated every time someone uses an AI trained on their work, right? Well, let’s look at ChatGPT for a moment. I have open source code out there on github, which was almost certainly included in ChatGPT’s training data. Therefore, when someone uses ChatGPT for anything (since the training data doesn’t go into a database; it just makes tiny tiny little changes to neuron connection weights), they’re using my copyrighted work, and thus they owe me a royalty. Who better to handle that royalty check but OpenAI? So now you get on there and use ChatGPT, making use of my work, and some of the “royalty fee” they’re now charging goes to me. Similarly, ChatGPT has been trained on some of whatever text you’ve added to the internet (comments, writing, whatever, it doesn’t matter), so when I use it, you get royalties. So far so good. Now OpenAI charges us both, keeps a big commission, and we both pay them $50/month for the privilege of access to all that knowledge, and we both make $20/month because people are using it, for a net -$30/month. Who wins? OpenAI. With a compensation scheme, the big corporations win every time and the rest of us lose, because it costs money to do it, and open source can’t do it at all. Better to skip the middle man, say here’s an AI that we all contributed to and we all have access to.
So again, what specifically is your plan to slow down deployment? Because onerous copyright restrictions aren’t going to stop any of the people who need to be stopped, but they will absolutely stop the people competing with those people.
@IncognitoErgoSum Honestly? Arguing against AI to anyone I can find and supporting any legal action to regulate the industry. That includes my boss when he considers purchasing an AI service.
If find something that’s mine has been used to train an AI, I am willing to join a class action suit. The next work contract renegotiation I have will take into account the possibility of my writing being used to train, and it’ll be a no. I’m supporting the SAG-AFTRA and WGA strikes because those contracts will set important precedents on how AI can be used in creative industries at least, and will likely spread to other industries.
And I think if enough people don’t buy into the hype, and are skeptical, and public opinion remains against it, then it’s less likely AI will be used in industries that need a strict safety standard until we get a regulatory agency for it.
I get it, then.
It’s more about the utilitarian goal of convincing people of something that it’s convenient for you if the public believes it, in order to protect yourself and your immediate peers from automation, as opposed to actually seeking the truth and sticking going with established legal precedent.
Legally, your class action lawsuit doesn’t really have a leg to stand on, but you might manage to win anyway if you can depend on the ignorance of the judge and the jury about how AI actually works, and prejudice them against it. If you can get people to think of computer scientists and AI researches as “tech bros” instead of scientists with PHDs, you might be able to get them to dismiss what they say as “hype” and “fairy tales”.
I still say you’re wrong about how the AI actually works, man. You’re looking at it with rose-colored goggles, head filled with sci-fi catch phrases. But it’s just a math machine.