Setting aside the usual arguments on the anti- and pro-AI art debate and the nature of creativity itself, perhaps the negative reaction that the Redditor encountered is part of a sea change in opinion among many people that think corporate AI platforms are exploitive and extractive in nature because their datasets rely on copyrighted material without the original artists’ permission. And that’s without getting into AI’s negative drag on the environment.
You should check out this article by Kit Walsh, a senior staff attorney at the EFF, and this one by Katherine Klosek, the director of information policy and federal relations at the Association of Research Libraries.
Using things “without permission” forms the bedrock on which artistic expression and free speech as a whole are built upon. I am glad to see that the law aligns with these principles and protects our ability to engage openly and without fear of reprisal, which is crucial for fostering a healthy society.
I find myself at odds with the polarized argumentation about AI. If you don’t like it, that’s understandable, but don’t make it so that if someone uses AI, they have to defend themselves from accusations of exploiting labor and the environment. Those accusations are often times incorrect or made without substantial evidence.
I’m open to that conversation, as long as we can keep it respectful and productive. Drop a reply if you want, it’s way better than unexplained downvoting.
Yes, using existing works as reference is obviously something that real human artists do all the time, there’s no arguing that is the case. That’s how people learn to create art to begin with.
But, the fact is, generative AI is not creative, nor does it understand what creativity is, nor will it ever. Because all it is doing is performing complex data statistical analysis algorithms to generate a matrix of pixels or a string of words.
Im sorry, but the person entering in the prompt to instruct the algorithm is also not doing anything creative either. Do you think it is art to go through a fast food drive through and place an order? That’s what people are objecting to - people calling themselves artists because they put some nonsense word salad together and then think what they get out of it is some unique thing that they feel they created and take ownership of. If not for the AI model they are using and the creative works it was trained on, they could not have created it or likely even imagined it without it.
People are actively losing their livelihoods because AI tech is being oversold and overhyped as something that it’s not. Execs are all jumping on the bandwagon and because they see AI as something that will save them a bunch of money, they are laying off people they think aren’t needed anymore. So, just try to incorporate that sentiment into your understanding of why people are also upset about AI. You may not be personally affected, but there are countless that are. In fact, over the next two years, as many as 203,000 entertainment workers in the US alone could be affected
You want to have fun creating fancy kitbashed images based off of other people’s work, go right ahead. Just don’t call it art and call yourself an artist, unless you could actually make it yourself using practical skills.
Also, good luck trying to copyright it because guess what, you can’t.
Yes, using existing works as reference is obviously something that real human artists do all the time, there’s no arguing that is the case. That’s how people learn to create art to begin with.
But, the fact is, generative AI is not creative, nor does it understand what creativity is, nor will it ever. Because all it is doing is performing complex data statistical analysis algorithms to generate a matrix of pixels or a string of words.
Im sorry, but the person entering in the prompt to instruct the algorithm is also not doing anything creative either. Do you think it is art to go through a fast food drive through and place an order? That’s what people are objecting to - people calling themselves artists because they put some nonsense word salad together and then think what they get out of it is some unique thing that they feel they created and take ownership of. If not for the AI model they are using and the creative works it was trained on, they could not have created it or likely even imagined it without it.
I’d like to ask you what experience you have with generative art, because I’d like to explain a bit of what I know,
There’s also a spectrum of involvement depending on what tool you’re using. I know with web based interfaces don’t allow for a lot of freedom due to wanting to keep users from generating things outside their terms of use, but with open source models based on Stable Diffusion you can get a lot more involved and get a lot more freedom. We’re in a completely different world from March 2023 as far as generative tools go. Take a quick look at things work.
Let’s take these generation parameters for instance:
sarasf, 1girl, solo, robe, long sleeves, white footwear, smile, wide sleeves, closed mouth, blush, looking at viewer, sitting, tree stump, forest, tree, sky, traditional media, 1990s \(style\), <lora:sarasf_V2-10:0.7>
Negative prompt: (worst quality, low quality:1.4), FastNegativeV2
Steps: 21, VAE: kl-f8-anime2.ckpt, Size: 512x768, Seed: 2303584416, Model: Based64mix-V3-Pruned, Version: v1.6.0, Sampler: DPM++ 2M Karras, VAE hash: df3c506e51, CFG scale: 6, Clip skip: 2, Model hash: 98a1428d4c, Hires steps: 16, "sarasf_V2-10: 1ca692d73fb1", Hires upscale: 2, Hires upscaler: 4x_foolhardy_Remacri, "FastNegativeV2: a7465e7cc2a2",
ADetailer model: face_yolov8n.pt, ADetailer version: 23.11.1, Denoising strength: 0.38, ADetailer mask blur: 4, ADetailer model 2nd: Eyes.pt, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur 2nd: 4, ADetailer confidence 2nd: 0.3, ADetailer inpaint padding: 32, ADetailer dilate erode 2nd: 4, ADetailer denoising strength: 0.42, ADetailer inpaint only masked: True, ADetailer inpaint padding 2nd: 32, ADetailer denoising strength 2nd: 0.43, ADetailer inpaint only masked 2nd: True
To break down a bit of what’s going on here, I’d like to explain some of the elements found here. sarasf
is the token for the LoRA of the character in this image, and <lora:sarasf_V2-10:0.7>
is the character LoRA for Sarah from Shining Force II. LoRA are like supplementary models you use on top of a base model to capture a style or concept, like a patch. Some LoRA don’t have activation tokens, and some with them can be used without their token to get different results.
The 0.7 in <lora:sarasf_V2-10:0.7>
refers to the strength at which the weights from the LoRA are applied to the output. Lowering the number causes the concept to manifest weaker in the output. You can blend styles this way with just the base model or multiple LoRA at the same time at different strengths. Furthurmore you can adjust the UNet and Text Encoder by adding another colon like so : <lora:sarasf_V2-10:1:0.7>
for even more varied results. Doing this allows you to separate the “idea” from the “look” of the LoRA. You can even use a monochrome LoRA and take the weight into the negative to get some crazy colors.
The Negative Prompt is where you include things you don’t want in your image. (worst quality, low quality:1.4),
here are quality tags and have their attention set to 1.4. Attention is sort of like weight, but for tokens. LoRA bring their own weights to add onto the model, whereas attention on tokens works completely inside the weights they’re given. In this negative prompt FastNegativeV2
is an embedding known as a Textual Inversion. It’s sort of like a crystallized collection of tokens that tell the model something precise you want without having to enter the tokens yourself or mess around with the attention manually. Embeddings you put in the negative prompt are known as Negative Embeddings.
In the next part, Steps
stands for how many steps you want the model to take to solve the starting noise into an image. More steps take longer. VAE
is the name of the Variational Autoencoder used in this generation. The VAE is responsible for working with the weights to make each image unique. A mismatch of VAE and model can yield blurry and desaturated images, so some models opt to have their VAE baked in, Size
are the dimensions in pixels the image will be generated at. Seed
is the number representation of the starting noise for the image. You need this to be able to reproduce a specific image.
Model
is the name of the model used, and Sampler
is the name of the algorithm that solves the noise into an image. There are a few different samplers, also known as schedulers, each with their own trade-offs for speed, quality, and memory usage. CFG
is basically how close you want the model to follow your prompt. Some models can’t handle high CFG values and flip out, giving over-exposed or nonsense output. Hires steps
represents the amount of steps you want to take on the second pass to upscale the output. This is necessary to get higher resolution images without visual artifacts. Hires upscaler
is the name of the model that was used during the upscaling step, and again there are a ton of those with their own trade-offs and use cases.
After ADetailer
are the parameters for Adetailer, an extension that does a post-process pass to fix things like broken anatomy, faces, and hands. We’ll just leave it at that because I don’t feel like explaining all the different settings found there.
https://youtu.be/-JQDtzSaAuA?t=97
https://www.youtube.com/watch?v=HtbEuERXSqk
Not all selfies are art, but you can make art with cameras. I think the same applies here.
People are actively losing their livelihoods because AI tech is being oversold and overhyped as something that it’s not. Execs are all jumping on the bandwagon and because they see AI as something that will save them a bunch of money, they are laying off people they think aren’t needed anymore. So, just try to incorporate that sentiment into your understanding of why people are also upset about AI. You may not be personally affected, but there are countless that are. In fact, over the next two years, as many as 203,000 entertainment workers in the US alone could be affected
This EFF article by Katharine Trendacosta and Cory Doctorow touches on this. I think it’s worth a read.
You want to have fun creating fancy kitbashed images based off of other people’s work, go right ahead.
This is misinformation, and not how the technology works. Here’s a quick video explanation,
Just don’t call it art and call yourself an artist, unless you could actually make it yourself using practical skills.
This is just snobbery that people have always used to devalue the efforts of others. Punching down and gatekeeping won’t solve your problems, the people you’re really mad at are above you.
Art is about bringing your ideas into the world, anything beyond that is fetish. Spending hundreds of hours learning a skill isn’t art, it’s work. While I believe the effort invested in a work can contribute to its depth and meaning, that doesn’t make them better than works without as much effort.
cont.
Here is an alternative Piped link(s):
https://piped.video/-JQDtzSaAuA?t=97
https://piped.video/1d_jns4W1cM
https://www.piped.video/watch?v=HtbEuERXSqk
Piped is a privacy-respecting open-source alternative frontend to YouTube.
I’m open-source; check me out at GitHub.
I appreciate that you taken time to explain the technical aspects into what generative AI is processing under the hood, but the reality is that no amount of programming will ever be able to recreate the uniqueness and infinite variability of human creativity, emotion, imagination or consciousness. There is an immeasurable difference between true creativity and producing variations on a data set. I say this as both an artist and a programmer. I’m not just talking out of my ass.
I agree with you that a goal of art is to express ideas and that there’s are a lot of people in the art world that fetishize art in to being something more important than it is in certain contexts, but art is also a core component and something unique to humanity(and sometimes even to other species.) In that way, it’s something to be cherish and regarded - and throughout history it has been extremely culturally significant. Trying to translate these concepts into an algorithm, in my mind, nothing but an extremely arrogant waste of effort and time. Why not spend your time automating the boring shit no one wants to do rather than the creative things people actually enjoy doing?
I am not gatekeeping. I am just stating simple facts. I find it offensive and demeaning that you are devaluing the immense amount of effort that artists undergo to hone their crafts and produce art. You’re damn right it’s work - if you want to get proficient at something, that’s what it takes. I don’t care how boomer-ey that sounds. Yes, some artists have natural talent and don’t need as much effort as others. But, nonetheless, effort is required to create. Anyone can create art, not just some elite select few. But, not everyone can create art that is universally recognized as great or masterful, and it’s not a problem that need to be solved by technology. Unfortunately, art is subjective, so not everything one creates is perceived the same. That’s why some are more successful than others. You may argue that AI levels the playing field, but the fact is that it leverages the work of “successful” artists or artworks, and generates results that are perceived as successful or appealing as a result. It’s a shortcut. You are bypassing the effort otherwise needed by using a tool, which allows most users to to be totally ignorant of the basic knowledge required to create an art work - shape language, color theory, composition, lighting, appeal, posing, etc.
Entering a prompt into an AI model is akin to directing, producing or acting as a muse. It’s a very similar argument as to the validity or artistic merits of factory artists like Andy Warhol or Jeff Koons - While you are responsible for the idea that produces a result, you are still relying on the work and effort of not only the numerous team of people creating the AI model and its algorithms , but also the immeasurable amount of man-hours and creativity involved in creating the source content for the model training materials.
It’s one thing to use generative AI as tool, with intent to make use of the output as reference for your own work in a larger context. But to take the direct output and call it art is morally and ethically wrong. In my eyes, it makes you look like a total hack who doesn’t want to put the effort in to make things for themselves…no matter how much time you put into coming up with the prompt for the output.
I still stand by my original arguments - coming up with a prompt or a training data set to create an image is not art, because you are not actively involved with the creation of the imagery, itself. What an AI model generates is not a creative work and it is not your creation. If that is offensive to you, there’s nothing I can do about that, because it’s apparent that your arguments only serve to make yourself feel better about using generative AI.
It’s also apparent that you have an extremely skewed view of what art is and what it means to be an artist. Art, at its base level is about expressing HUMAN creativity, not what an algorithm interprets it to be. It’s about making countless, specific choices for each step of the creative process and having complete control of the final outcome. It’s those choices that make your art truly unique and an expression of your creative vision. It doesn’t matter if it is objectively bad or good, just that it came from you, and that every detail, every color, every line, was your choice, not an interpretation of your words.
Unless you are creating your own AI model from scratch and training it purely on your own artworks, I don’t see how you can, in good conscience, claim the results to be your own.
Any one can create art, but an artist is someone who dedicates themselves to their craft, as with any other craftsman. That passion is what separates an artisan from a hobbyist. You may view this as snobbery, but I view it as respect and honoring a tradition that spans all of human kind, back to the earliest cave paintings tens of thousands of years ago. I know my limits and what I’m capable of and I have come to terms with those deficiencies in my work. I’m not delusional enough to think that by generating an image through AI, it somehow makes up for those shortcomings and makes me into something I am not.
Mate, that’s not art, that’s coding. Congratulations on learning a new coding skill and how inputs can affect outputs. Frankly, it’s barely coding, it’s adding degrees of specification so a program can do all the work. I get that it took you a while to learn what all of it means and how it works, sort of, but something being hard to do doesn’t make it art.
And don’t cheapen photography by comparing it to generating an AI image. There’s physical labor involved in photography on top of composition and patience.
Part 2
Also, good luck trying to copyright it because guess what, you can’t.
This looks like it’s set to change. The US Copyright Office is proactively exploring and evolving its understanding of this topic and are actively seeking expert and public feedback. You shouldn’t expect this to be their final word on the subject.
It’s also important to remember Copyright Office guidance isn’t law. Their guidance reflects only the office’s interpretation based on experience, it isn’t binding in courts or other parties. Guidance from the office is not a substitute for legal advice, and it does not create any rights or obligations for anyone. They are the lowest rung on the ladder for deciding what law means.
Let’s keep it civil and productive. Jeering dismissive language like “Also, good luck trying to copyright it because guess what, you can’t.” isn’t helping your argument, they’re just mean spirited. Let’s have a civil discussion, even if we disagree. I’m open to keep talking, but I will quit replying if you continue being disrespectful.
It’s clear where you hold your stakes in the matter and where I hold mine. Whether or not you want to continue the conversation is up to you, but I’m not going to go out my way to be polite in the matter, because I don’t really give a shit either way or if you’re offended by what I say. AI personally affects me and my livelihood, so I do have passionate opinions about its use, how companies are adapting it and how it’s affecting other people like me.
All the article you linked shows is that they held a meeting, which doesn’t really show anything. The government has tons of meetings that don’t amount to shit.
So, instead of arguing whether or not the meeting actually shows they are considering anything different, I will explain my personal views.
In general, I’m not against AI. It is a tool that can be effective in reducing menial tasks and increasing productivity. But, historically, such technology has done nothing but put people out of work and increased profits for the executives and shareholders. At every given chance, creatives and their work are devalued and categorized as “easy”, “childish” or not a real form of work, by those who do not have the capacity to do it themselves.
If a company wants to adapt AI technology for creative use, it should solely trained off of content that they own the copyright to. Most AI models are completely opaque and refuse to disclose the materials they were trained on. Unless they can show me exactly what images were used to generate the output, then I will not trust that the output is unique and not plagiarizing other works.
Fair use has very specific use cases where it’s actually allowed - parody, satire, documentary and educational use, etc. For common people, you can be DMCA’ed or targeted in other ways for even small offenses, like remixes. Even sites like archive.org are constantly under threat by lawsuits. In comparison, AI companies are seemingly being given free pass because of wide adoption, their lack of transparency, and the vagueness as to where specifically the output is being derived from. A lot of AI companies are trying to adapt opt-out to cover their asses, but this is only making our perception of their scraping practices worse.
As we are starting to see with some journalism lawsuits, they are able to specifically point out where their work is being plagiarized, so I hope that more artists will speak up and also file suit for models where their work is blatantly being trained to mimic their styles. Because If someone can file copyright suit against another person for such matters, they should certainly be able to sue a company for the same unauthorized use of their work, when being used for profit.