this post was submitted on 10 Jul 2023
89 points (100.0% liked)

Technology

19 readers
1 users here now

This magazine is dedicated to discussions on the latest developments, trends, and innovations in the world of technology. Whether you are a tech enthusiast, a developer, or simply curious about the latest gadgets and software, this is the place for you. Here you can share your knowledge, ask questions, and engage in discussions on topics such as artificial intelligence, robotics, cloud computing, cybersecurity, and more. From the impact of technology on society to the ethical considerations of new technologies, this category covers a wide range of topics related to technology. Join the conversation and let's explore the ever-evolving world of technology together!

founded 2 years ago
 

In addition to the possible business threat, forcing OpenAI to identify its use of copyrighted data would expose the company to potential lawsuits. Generative AI systems like ChatGPT and DALL-E are trained using large amounts of data scraped from the web, much of it copyright protected. When companies disclose these data sources it leaves them open to legal challenges. OpenAI rival Stability AI, for example, is currently being sued by stock image maker Getty Images for using its copyrighted data to train its AI image generator.

Aaaaaand there it is. They don’t want to admit how much copyrighted materials they’ve been using.

you are viewing a single comment's thread
view the rest of the comments
[–] chemical_cutthroat@kbin.social 9 points 2 years ago* (last edited 2 years ago) (18 children)

If I do a book report based on a book that I picked up from the library, am I violating copyright? If I write a movie review for a newspaper that tells the plot of the film, am I violating copyright? Now, if the information that they have used is locked behind paywalls and obtained illegally, then sure, fire ze missiles, but if it is readily accessible and not being reprinted wholesale by the AI, then it doesn't seem that different from any of the other millions of ways we use data in every day life. Just because a machine learned it instead of a human, I don't believe that it makes it inherently wrong. I am very open to discussion on this, and if anyone has a counter-argument, I'd love to hear it, because this is a new field of technology that we should all talk about and learn to understand better.

Edit: I asked GPT-4 what it thought about this, and here is what it said:

As an AI developed by OpenAI, I don't access any copyrighted databases, subscription sites, confidential information, or proprietary databases for my learning or response generation. I was trained on a diverse range of internet text, which includes publicly available data in multiple languages and formats. The training also involves learning from large-scale datasets that contain parts of many kinds of texts (books, websites, scientific articles, etc.). However, I don't know specifics about which documents were in my training set or have access to any specific documents or sources.

Your question seems to pertain to how copyright law applies to AI. This is a complex and evolving area of law and policy, largely because AI systems like me do not "read" or "use" material in the same way a human would. A human might read a book and then consciously recall information from it when writing or speaking. In contrast, AI models do not have consciousness or awareness, they analyze statistical patterns in the data they were trained on and use these patterns to generate responses.

When a human uses copyrighted material to create something new, the law often considers this to be a derivative work. It is typically necessary to obtain permission from the copyright holder to create derivative works. However, AI doesn't use or recall copyrighted material directly; instead, it learns patterns from a huge amount of text data to predict or generate text based on a given prompt.

Dissemination of copyrighted material, such as through forum posts or Q&A platforms, without proper licensing or within the bounds of fair use, can be a violation of copyright law. AI doesn't 'disseminate' copyrighted material; it generates responses based on patterns it has learned.

That said, the legal interpretation of how copyright law applies to machine learning and AI is still evolving, and there can be a lot of complexities and nuances. Therefore, the information here should not be taken as legal advice.

As of my last update in September 2021, this was the case. However, I recommend checking the most recent legal standpoints and opinions, as this area is still evolving rapidly.

[–] mack123@kbin.social 2 points 2 years ago (3 children)

It is an area that will require us to think carefully of the ethics of the situation. Humans create works for humans. Has this really changed? Now consumption happens through a machine learning interface. I agree with your reasoning, but we have an elephant in the room that this line of reasoning does not address.

When we ask the AI system to generate content in someone else's style or when the AI distorts someone's view in its responses. It is in this area where things get very murky for me. Can I get an AI to eventually write another book in Terry Pratchett's style? Would his estate be entitled to some form of compensation? And that is an easier one compared to living authors or writers. We already see the way image generating AI programs copy artists. Now we are getting the same for language and more.

It will certainly be an interesting space to follow in the next few years as we develop new ethics around this.

[–] chemical_cutthroat@kbin.social 2 points 2 years ago* (last edited 2 years ago) (2 children)

@mack123

Can I get an AI to eventually write another book in Terry Pratchett's style? Would his estate be entitled to some form of compensation?

No, that's fair use under parody. Weird Al isn't compensating other artists for parody, so the creators of OpenAI shouldn't either, just because their bot can make something that sounds like Pratchett or anyone else. I wrote a short story a while back that my friend said sounded like if Douglas Adams wrote dystopian fiction. Do I owe the Adams' estate if I were to publish it? The same goes for photography and art. If I take a picture of a pastel wall that happens to have an awkward person standing in front of it, do I owe Wes Anderson compensation? This is how we have to look at it. What's good for the goose must be good for the gander. I can't justify punishing AI research and learning for doing the same things that humans already do.

[–] mack123@kbin.social 1 points 2 years ago (1 children)

That is the current stand of affairs yes. But it something that I think we will need to resolve as AI becomes better. When it becomes impossible to say which work was created by the human original and which by the AI.

I do think it would be ethically wrong for a company to profit by mimicking someone's style exactly. What incentive remains for the original style or work to exist if you cannot earn a living from it.

[–] chemical_cutthroat@kbin.social 1 points 2 years ago

I do think it would be ethically wrong for a company to profit by mimicking someone's style exactly. What incentive remains for the original style or work to exist if you cannot earn a living from it.

That's where we differ in opinion. I create art because it's what I enjoy doing. It makes me happy. Would I like to profit from it? Sure, and I do, to some extent. However, you are conflating two ideas. Art created for profit is no longer art, it is a product. The definition fundamentally changes.

I'm a writer, a photographer, and a cook. The first two I do for pleasure, the last for profit. If I write something that someone deems worthy to train an AI on, first, great, maybe I'm not as bad as I think I am. Second, though, it doesn't matter, because when I wrote what I wrote, it was a reflection of something that I personally felt, and I used my own data set of experience to create that art.

The same thing goes for photography, though slightly differently. When I'm walking around with my camera and taking shots, I do it because something has made me feel an emotion that I can capture in a camera lens. I have also done some model shoots, where I am compensated for my time and effort. In those shoots, I search for art in composition and theme because that's what I'm paid for, but once I finish the shoot, and I give the photographs to the model, what they do with them is their own business. If they use them to train AI, then so be it. The AI might be able to make some 99% similar to what I've done, but it won't have what I had in the moment. It won't have the emotional connection to the art that I had.

As far as the third, cooking, goes, I think it's the most important. When I follow a recipe, I'm doing exactly what the AI does. I use my data set to create something that is a copy of something someone else has done before. Maybe I tweak it here and there, but so does AI. I do this for profit. I feed people, and they pay me. Do I owe the man who created the Caesar Salad every time I sell one? It's his recipe. I make the dressing from scratch just like he did. I know that's not a perfect example, but I'm sure you can see the point I'm making.

So, when it comes to Art v. Product, there are two different sides to it, and both have different answers. If you are worried about AI copying "art" then don't. It can't. Art is something that can only be created by the artist in the moment, and may be replicated, but can never truly be copied, in the same way that taking a photo of the Mona Lisa doesn't make me DaVinci. However, if it's a product, then we are talking about capitalism, and here we can see that there is no argument against AI, because it is only doing what we have been doing for forever. McDonalds may as well be the AI of fast food burgers. Pizza Hut the AI of pizza. Taco Bell the AI of TexMex. Capitalism is about finding faster, cheaper ways of producing products that people want. Supply and demand. If someone is creating a product, and their product can be manufactured faster, and cheaper, by the competition, then the onus is on the original creator to find a way to stand out from the competition, or lose their marketshare to the competitor. We can't hamper AI just because some busker is having a hard time selling his spray paint on bowl planet scape art. If you mass produce for the sake of profit, you can't complain when someone out-mass produces you, AI or human. That's the way of the world.

load more comments (14 replies)