Authors using a new tool to search a list of 183,000 books used to train AI are furious to find their works on the list.
No, but the training data does contain a copy. And making a model is not criticising, commenting upon, or creating a parody of it.
It’s not. The humans that trained it (assumably) purchased the material used to train it. What’s the problem?
That list is not exclusive, it’s just a list of examples of fair use.
The training data is not distributed with the AI model.
it’s just a list of examples of fair use.
Yes, it’s a list of quite similar ways of commenting upon a work. Please explain how training an LLM is like any of those things, and thus, how Fair use would apply.