>> If you think OpenAI is less valuable because it can't use copyrighted content...

tedivm · on Jan 16, 2024

I firmly believe that training models qualifies as fair use. I think it falls under research, and is used to push the scientific community forward.

I also firmly believe that commercializing models built on top of copyrighted works (which all works start off as) does not qualify as fair use (or at least shouldn't) and that commercializing models build on copyrighted material is nothing more than license laundering. Companies that commercialize copyrighted work in this manner should be paying for a license to train with the data, or should stick to using the licenses that the content was released under.

I don't think your example is valid either. The reason that AI models are generating content similar to other people's work is because those models were explicitly trained to do that. That is literally what they are and how they work. That is very different than people having similar styles.

_ugfj · on Jan 16, 2024

> I firmly believe that training models qualifies as fair use

There's a hell lot of money to be made from this belief so of course the HN crowd will hold it.

Some of us here who have been around the copyright hustle for a little longer laugh at this bitterly and pray that the courts and/or Doctorow's activism saves us. But there's so much money to be made from automatized plagiarism and the forces against are so weak, the hope is not much.

The world will be a much, much poorer place once all the artists this view exploits will stop making art because they need to make a living.

See https://twitter.com/molly0xFFF/status/1744422377113501998 and https://i.imgur.com/zOOcPCi.jpg

Dylan16807 · on Jan 17, 2024

> There's a hell lot of money to be made from this belief so of course the HN crowd will hold it.

That is a pretty unfair response when you're skipping the part about how commercializing should not be fair use.

tedivm · on Jan 17, 2024

I literally met and worked with Doctorow on a protest back in 2005, so I'm not exactly new to this. I also think that the only way you could have written your comment was by grossly misinterpreting my comment.

MrNeon · on Jan 17, 2024

I hope the idea of Intellectual Property as a whole is thrown out the window and copyright with it.

c0pium · on Jan 16, 2024

Generative models are just a tool. Artists are mad because this tool empowers other people, who they view as less talented, to make art too.

The camera and 1-hour film developing didn’t destroy oil paintings, it just enabled more people to have control over what was on their walls.

eek2121 · on Jan 17, 2024

No, I have to disagree here. I'm not an artist, but I respect the creations of others. OpenAI does not. They could have trained on free data, but it did not want to because it would cost more (finding humans to find/sanction said data, etc).

romwell · on Jan 17, 2024

>Generative models are just a tool.

Sure. It's just a tool. That need other people's art to work.

If it's "just a tool" in and of itself, then there's no problem keeping it away from other people's art.

>The camera and 1-hour film developing didn’t destroy oil paintings

Because the copyright laws were extended to include photographic reproduction of art as something you need to obtain a permission (and a license) for.

The same needs to happen for generative AI.

A photocopy machine is just a tool too. So is the printing press.

candiodari · on Jan 17, 2024

> Sure. It's just a tool. That need other people's art to work.

So does a human brain.

Which brings us to the other side of the reasoning is that tools like Midjourney and OpenAI enable idiots (when it comes to drawing/animating ... that includes me) to create engaging artwork.

Recently generating artwork like that went from almost impossible to "great, but easily recognizeable as AI artwork". Frankly, I expect the discussion will end when it stops being recognizeable.

I hate Andreesen Horowitz' reasoning, but they're right about one thing: once we have virtual artists that are not easy to distinguish from "real" ones, the discussion will end. It does not really matter what anyone's opinion on the matter is as it will not make a difference in the end.

vkou · on Jan 17, 2024

> So does a human brain.

A major difference between a human training himself by looking at art, and a computer doing it, is that the human ends up working for himself, the computer is owned by some billionaire.

One enhances the creative potential of humanity as a whole, the other consolidates it in the hands of the already-powerful.

Another major difference is that a human can't use that knowledge to mass-produce that art at a scale that will put other artists in the poorhouse. The computer can.

Copyright exists to benefit humanity as a whole... And frankly, I see no reason for why a neural network's output should be protected by copyright. Only humans can produce copyrightable works, and a prompt is not a sufficient creative input.

visarga · on Jan 18, 2024

No, the beneficiaries of generative AI are the users because they set the prompt and use the outputs. Providers make cents per million tokens. It is a people empowerment technology.

vkou · on Jan 18, 2024

Right, and sharecroppers are the empowered ones in the relationship with their landlords.

Utterly ridiculous.

WereAllMadHere · on Jan 17, 2024

Visual artists cannot create without tools. Whether that tool is a brush and paint, a camera, or a neural network.

Whether an artist pays for a subscription to openAI or buys paint pots on Amazon.com money is going to a billionaire, it is not a difference between ai and other art.

You are also ignoring the existence of non-commercial open source AI, they exist.

Regarding copyright, we copyright output not input. Otherwise most photography would be uncopyrightable.

vkou · on Jan 17, 2024

There's a substantive difference in whether the artist is using the tool, or the tool works on its own. A paintbrush doesn't produce a painting by itself, a human needs to apply an incredibly specialized creative skillset, in conjunction with the paintbrush to do so.

An LLM takes a prompt and produces a painting. No sane person would say that I 'drew' the painting in question, even if I provided the prompt.

> Regarding copyright, we copyright output not input. Otherwise most photography would be uncopyrightable.

We copyright things that require creative input. A list of facts or definitions did not require creative input, and is therefore not copyrightable.

Using an LLM does not meet the bar for creative input.

flir · on Jan 17, 2024

> There's a substantive difference in whether the artist is using the tool, or the tool works on its own. A paintbrush doesn't produce a painting by itself, a human needs to apply an incredibly specialized creative skillset, in conjunction with the paintbrush to do so.

That sounds like a kinder restatement of the opinion at the top of the thread: "Artists are mad because this tool empowers other people, who they view as less talented, to make art too."

Artists might not like the phrasing, but scratch the surface and there's a degree of truth there. It's an argument from self-interest, at core.

com2kid · on Jan 17, 2024

One small nitpick: It is completely possible for an artist to make all of their own tools, and indeed for the majority of history that is exactly how things went.

tekknik · on Jan 17, 2024

But today the artist that can also create a robust version of photoshop on their own doesn’t really exist. Maybe some can write code to that level but certainly not a majority and it’s certainly not the same as sanding wood to make a paintbrush.

Piskvorrr · on Jan 17, 2024

Ok, here's a pile of sand, the goal is 1. a computer and 2. an AI to run on it. Go!

(spoiler: bootstrapping yourself up the tech tree gets progressively harder)

tekknik · on Jan 17, 2024

> I see no reason for why a neural network's output should be protected by copyright. Only humans can produce copyrightable works, and a prompt is not a sufficient creative input.

Your brain is a neural network, just FYI.

vkou · on Jan 17, 2024

A privileged one, compared to one made of silicon, in the eyes of the law.

tekknik · on Jan 17, 2024

If you graduated from school and only used work that was public domain, would you have all the knowledge you currently have? Have you learned anything from anybody since graduating?

Where is the line? It’s ok for humans to learn from others work but not a machine?

JambalayaJim · on Jan 17, 2024

It is NOT okay for machine owner to profit from learning, unless the machine owner compensates the owners of the training data. That is where the line should be drawn.

romwell · on Jan 17, 2024

>Where is the line? It’s ok for humans to learn from others work but not a machine?

Yes.

The machine doesn't get to make its own choices. Once it does, we'll have a different conversations.

Presently, humans decide what goes into the training set, and what comes out. Those humans are the ones that we need to regulate.

Hot take: The LLM is learning as much as a ZIP file is learning.

Aeolun · on Jan 16, 2024

If I read a lot of stories in a certain genre that I like, and I later write my own story, it’s almost by definition going to be a mish-mash of everything I like.

Should I pay the authors of the books I read when I sell mine?

bluefirebrand · on Jan 16, 2024

We shouldn't hold individual humans and ML models to the same standards, because ML models themselves are products capable of mass production and individual humans are not even remotely at the same scale.

If you write that book, chances are you will gain some fans that are also fans of other authors in that genre.

If ML models write that genre, they can flood that genre so full that human artists won't be able to complete.

It's not even a remotely equivalent scenario

bawolff · on Jan 16, 2024

I feel like the issue here, is you are giving AIs agency.

AIs are not magic. They are tools. They are not alive, they do not have agency. They do not do things by themselves. Humans do things, some humans use AI to do those things. Agency always rests with a combination of the tool's creator and operator, never the tool itself.

Is there really a difference between a human flooding the market using AI and a human flooding the market using a printing press?

Even if human's can't compete (An obviously untrue premise from my perspective, but lets assume it for the sake of argument), is that a bad thing? The human endeavor is not meant to be a make work project. Humans should not be forced to pointlessly toil out of protectionism when they could be turning their attention to something that can't be automated.

johnnyanmac · on Jan 16, 2024

>Is there really a difference between a human flooding the market using AI and a human flooding the market using a printing press?

A magnitude of difference, yes. Even a printing press will be limited by natural resources, which require humans to procure.

A computer server can do a lot more with a lot less. And is much easier to scale than a printing press.

>Even if human's can't compete (An obviously untrue premise from my perspective, but lets assume it for the sake of argument), is that a bad thing?

When the AI can be argued to be stealing human's work, yes. A printing press didn't need to copy Shakespeare to be useful. And it'd benefit Shakespeare anyways because more people get to read about his works.

So far I don't see how AI benefits artists. Optimistically:

-an artist can make their own market? Doubtful, they will be outgunned by SEO optimized ads from corporations.

- they can make commissions faster? To begin with commissions aren't a sustainable business. Even if they 5x the labor and somehow kept the same prices they aren't living well. But in reality, they will get less business as people will AI their "good enough" art and probably won't pay as much for something not fully hand drawn

- okay, they can make bigger commissions? There's a drama about spending 50k on a 3 minute AMV, imagine if that could be done by a single artist in a day now!... Well, give it another 10 years. Lot of gen Ai is static assets. Rigging or animating is still far from acceptable quality, and a much harder problem space. I also wouldn't be surprised if by then any AI models has its own phase of enshittification and you end up blowing hundreds, thousands anyway.

-----

>Humans should not be forced to pointlessly toil out of protectionism when they could be turning their attention to something that can't be automated.

Until someone conceptualizes a proper UBI scheme, pointlessly toiling is how most of the non-elite live. I have yet to hear of a real alternative for these misplaced artists to move towards.

So what? So we all just become managers in meetings in 30 years?

tekknik · on Jan 17, 2024

> A magnitude of difference, yes. Even a printing press will be limited by natural resources, which require humans to procure. A computer server can do a lot more with a lot less. And is much easier to scale than a printing press.

AI runs on some of the most power hungry and expensive silicon on the planet. Comparing a GPU cluster and a printing press then staring the GPU cluster not limited by natural resources is just silly. Where does the materials come from to make the processors?

> When the AI can be argued to be stealing human's work, yes. A printing press didn't need to copy Shakespeare to be useful. And it'd benefit Shakespeare anyways because more people get to read about his works.

The same can be true for AI as well. I could see a picture and then ask AI whose style it is. Then I could go look up more work by that artist, increasing their visibility.

- they can make commissions faster? To begin with commissions aren't a sustainable business. Even if they 5x the labor and somehow kept the same prices they aren't living well. But in reality, they will get less business as people will AI their "good enough" art and probably won't pay as much for something not fully hand drawn

Is this a complaint that something got cheaper to make? This one affects more than just artists. For instance, code quality output from LLM is quite high. So, wages across the board will decrease yet capabilities will increase. This is a problem external to AI.

> Until someone conceptualizes a proper UBI scheme, pointlessly toiling is how most of the non-elite live. I have yet to hear of a real alternative for these misplaced artists to move towards.

Again, not just artists and the path forward is the same as it’s always been with technological advancements, increase your skill level to above the median created by the new technology.

johnnyanmac · on Jan 17, 2024

>Comparing a GPU cluster and a printing press then staring the GPU cluster not limited by natural resources is just silly. Where does the materials come from to make the processors?

Probably mined from 3rd world country slaves (in the literal "owning people" sense). But still, these servers already exist and scale up way more than a tree.

>well. I could see a picture and then ask AI whose style it is. Then I could go look up more work by that artist, increasing their visibility.

Sure, and you can use p2p to download perfectly legal software. We know how the story ends.

>Is this a complaint that something got cheaper to make... not just artists and the path forward is the same as it’s always been with technological advancements, increase your skill level to above the median created by the new technology.

It's a complaint that people even woth more efficiency still can't make a living. While the millionaires become billionaires. I'm not even concerned about software wages. Some Principal SWE going from 400k to 200k will still live fine.

Artists going from 40k to 40k (but now working more efficiently) is exactly how we ended up with wages stagnating for 30 years. And yes, it is affecting everyone even pre-AI. The median is barely a living wage anymore, which is what "minimum wage" used to be.

If we lived in a work optional world I don't think many would care. But we don't and recklessly taking jobs to feed the billionaires is just going to cause societal collapse if left unchecked.

cwkoss · on Jan 17, 2024

> Probably mined from 3rd world country slaves (in the literal "owning people" sense).

Why do you think the device you're using to make this comment is better than a GPU?

Was your comment written by some new flamebait AI? You say so many arguments that fall apart under the slightest examination

johnnyanmac · on Jan 17, 2024

I'm not. But I'm going to hold a company who's responsible for the production more accountable than a consumer who can't research the sourcing of every single part of their personal device.

>Was your comment written by some new flamebait AI

IIRC AI focuses on formal language and discourages slang. Either way:

>Please respond to the strongest plausible interpretation of what someone says, not a weaker one that's easier to criticize. Assume good faith.

cwkoss · on Jan 17, 2024

It's like you are mad at gravity. That sucks you feel that way, but very unlikely to change anything.

m4x · on Jan 17, 2024

> Is there really a difference between a human flooding the market using AI and a human flooding the market using a printing press?

Yes. A printing press only floods the market with copies. An AI floods the market with new derivative works.

A human producing a single creative work and then flooding the market with copies leaves lots of room for other humans to produce their own novel work. An AI flooding the market with new derivative works leaves no such room.

I work with DNNs a lot professionally and remain a proponent of the technology, but what OpenAI et al are doing is highly exploitative and scummy. It’s also damaging their social licence and may end up setting the field back.

Viliam1234 · on Jan 18, 2024

"AI flooding the market with new derivative works" sounds nice if you are the consumer.

Now that we can finally have lots of amazing things almost for free, should we create artificial scarcity to protect the existing business models?

m4x · on Jan 20, 2024

It’s potentially nice for the consumer. If I could get personalised audio and video content created on demand for me, that would be pretty amazing. But it does disincentivise people from creating content rather than just consuming it, and I think that could end up taking away a lot of the magic from life.

freejazz · on Jan 17, 2024

It'd be a good point if it wasn't for the fact that search engines didn't exist until google, because of technology, and that courts didn't need to consider the issue until then. So where does your point get us? We are here now.

scheeseman486 · on Jan 17, 2024

Search engines are an index, which have existed for centuries.

bawolff · on Jan 17, 2024

When i was in university, i remember there was this humanities professor who had a concordance for the iliad on his shelf. As a CS person it was so cool to see the ancient version of a search engine.

Piskvorrr · on Jan 17, 2024

"search engines didn't exist until google" - you might want to, uh, google that

tanseydavid · on Jan 18, 2024

The post you are replying to is poorly-worded.

I think the post is making reference to the "thing" which made Google stand out in a sea of existing search engines.

"Using the idea of relevancy, they built Google in a way that - in comparison to other search engines at the time - was simply better at connecting users with more pertinent results.

"A query typed in Google provided more utility and relevancy than did Excite, Yahoo and other search engines.

"To put it simply, Google had a superior product.

https://www.searchenginepeople.com/blog/125-why-google-won.h....

bawolff · on Jan 18, 2024

Why? I dont see any way the ranking algorithm would affect this discussion, and if it did, i think old style (ranking based on keywords) would make the comparison slightly better than google (pagerank) style search engines.

csallen · on Jan 16, 2024

Computers and machines have been capable of mass production for decades, and humans have used them as tools. In the past 170 years, these tools of mass production have already diminished many thousands of professions that were staffed by people who had to painstakingly craft things one at a time.

Why is art some special case that should be protected, when many other industries were not?

Why should we kill this technology to protect existing artistic business models, when many other technologies were allowed to bloom despite killing other existing business models?

Nobody can really answer these questions.

johnnyanmac · on Jan 16, 2024

>Why is art some special case that should be protected, when many other industries were not?

Because in this case the art is still necessary for the machine to work. You don't need horse buggies to make a car, nor existing books to make a printing press. You DO need artist's art to make these generative AI tools work.

If these worked purely off of open source art or from true scratch, I wouldn't personally have an issue.

>Why should we kill this technology to protect existing artistic business models,

We don't need to kill it. Just pay your dang labor. But if we are treating proper compensation as stifling technology, I'm not surprised people are against it.

Maybe in the 2010's tech would have the goodwill to pull this off in PR, but the 2020's have drained that goodwill and then some. Tech's made so many promises to make lives easier and now they joined the very corporations they claimed to fight against.

>Nobody can really answer these questions.

Well it's in courts, so someone is going to answer it soon-ish

csallen · on Jan 16, 2024

> We don't need to kill it. Just pay your dang labor.

> But if we are treating proper compensation as stifling technology, I'm not surprised people are against it.

That's just it, nobody looking to get paid by OpenAI actually did any labor for OpenAI. They did labor for other reasons, and were happy with it.

OpenAI found a way to benefit by learning from these images. The same way that every artist on the planet benefits by learning from the images of their fellow artists. OpenAI just uses technology to do it much more efficiently.

This has never been considered labor in the past. We've never asked artists to "properly compensate" each other for learning/inspiration in the past. I don't know why it should be considered labor or proper compensation now.

But we shall see what the courts decide!

egypturnash · on Jan 16, 2024

There are many ways an artist can compensate their influences. Some of them are monetary.

When discussing our work, we can name them.

When one of our influences comes out with a new body of work, we can gush about it to our own fans.

When we find ourselves in a position of authority, we can offer work to our influences. No animation studio is really complete without someone old enough to be a grandfather hanging out helping to teach the new kids the ropes in between doing an amazing job on their own scenes, and maybe putting together a few pitches, for instance.

We can draw fan art and send it to them.

None of these are mandatory, but artists tend to do this, because we are humans, and we recognize that we exist in a community of other artists, and these all just feel like normal human things to do for your community.

And if an artist suddenly starts wholesale swiping another artist's style without crediting them, their peers get angry. [1]

1: https://en.wikipedia.org/wiki/Keith_Giffen#Controversy

OpenAI isn't gonna tell you that it was going for a Cat & Girl kind of feel in this drawing. OpenAI isn't gonna offer Dorothy Gambrell a job. OpenAI isn't going to tell you that she just came out with a new collection and she's still at the top of her game, and that you should buy it. OpenAI's not going to send her a painting of Cat & Girl that it did for fun. OpenAI isn't going to do anything for her unless the courts force it to, because OpenAI is a corporation who has found a way to make money by strip-mining the stuff people post publicly on the Internet because they want other humans to be able to see it.

csallen · on Jan 16, 2024

Most people know 20,000-40,000 words. Let's call it 30,000. You've learned 99.999% of those 30,000 people from other people. And don't get me started on phrases, cliches, sentence structures, etc.

How many of those words do you remember learning? How many can you confidently say you remember the person or the book that taught you the word? 5? 10? Maybe 100?

That's how brains work. We ingest vast amounts of information that other people put out into the world. We consume and it incorporate it and start using it on our own work. And we forget where we even got it. My brain works this way. Your brain works this way. Artists' brains work this way. GPT-4 works this way.

The idea that a visual artist can somehow recall where they first saw many of the billions of images stored in their brain -- the photos, movies, architecture, paintings, and real-life scenes that play out every second of every day -- is laughable. Almost all of that goes uncredited, and always will.

This is what it is to learn.

tga_d · on Jan 17, 2024

I tend to fall more on the "training should be fair use" side than most, but your comment seems to be missing the point. Nobody is arguing that models are violating copyright or social norms around credit simply because they consume this information. Nobody ever argued/argues that the traditional text generation in markov models on your phone's keyboard runs afoul of these issues. The argument being made is that these particular models are now producing content that very clearly does run into these norms in a qualitatively different way. You cannot convincingly make the argument that the countless generated "X, but in the style of Y" images, text, and video going around the internet are exclusively the product of some unknowable mishmash of influences -- there is clearly some internalized structure of "this work has this name" and "these works are associated with this creator".

To take it to an extreme, you obviously can't just use one of the available neural net lossless compression algorithms to circumvent copyright law or citation rules (e.g., distributing a local LLM that helpfully displays the entirety of some particular book when you ask it to), you can't just tweak it to make it a little lossy by changing one letter, or a little more lossy than that, etc., while on the other hand, any LLM that performs exactly the same as a markov model would presumably be fine, so there is a line somewhere.

csallen · on Jan 17, 2024

A company hires an artist. That artist has observed a ton of other artists' work over the years. The company instructs that artist to draw, "X but in the style of Y", where Y is some copyrighted artwork. The company then prints the result and puts it on their packaging.

A company builds an AI tool. That AI tool is trained on a ton of artists' work over the years. The company opens up the AI tool and asks it to draw, "X but in the style of Y," where Y is come copyrighted artwork. The company then prints the result and puts it on their packaging.

What's the difference?

I'd argue there isn't one. The copyright infringement isn't the ability of the artist or the AI tool to make a copy. It's the act of actually using it to make a copy, and then putting that out into the world.

pbhjpbhj · on Jan 17, 2024

The artist has a claim for production of a derivative work and for passing off against the other artist.

throwaway2037 · on Jan 17, 2024

> What's the difference?

Ultimately, only high courts in each jurisdiction can decide. I can imagine a case where some highly advanced nations decide different interpretations that cause conflict. Then, we need an amendment to the widely accepted international copyright rules, the Berne Convention. Ref: https://en.wikipedia.org/wiki/Berne_Convention

tga_d · on Jan 17, 2024

Okay, but then that's an an argument subject to the critiques made upthread that you were initially trying to dismiss? You can't claim that AI doesn't need to worry about citing influences because it's just doing a thing humans wouldn't cite influences for, then proceed to cite an example where you would very much be expected to cite your influences, and AI wouldn't, as evidence.

csallen · on Jan 17, 2024

I never argued that AI doesn't need to worry about citing influences. If I am a person using a tool to create a work, and the final product clearly resembles some copyrighted work that I need to reference and give credit to, what does it matter if my tool is a pencil, a graphics editing program, a GPT, or my own mind? I can cite the work.

tga_d · on Jan 17, 2024

Like I said, this is exactly what the comment you first replied to was explaining. It is very clearly not the same as a pencil or a graphics editing program, because those things do not have a notion of Cat & Girl by Willem de Kooning embedded in them that they can utilize without credit. It is clearly not the same as your mind, because your mind can and, assuming you want to stay in good standing, will provide credit for influence.

Again, take it back to basics: do you believe it is permissible to share a model itself (not the model output, the model), either directly or via API, that can trivially reproduce entire copyrighted works?

csallen · on Jan 17, 2024

I'd say that a tool itself can't be guilty of copyright infringement, only the person using the tool can. So it doesn't matter if the GPT has some sort of "notion" of a copyrighted work in it or not. GPTs aren't sentient beings. They don't go around creating things on their own. Humans have to sit down and command them, and that point, whoever issued the command is responsible for the output. Copyright violation happens at the point of creation or distribution, not at the much earlier point of inspiration or learning.

So yeah, of course imo it should be permissible to share a model that can reproduce copyrighted works. Being "capable of being used" to violate a law is not the same thing as violating a law.

A ton of software on my computer can copy-paste others' work, both images and words. It can trivially break copyright. Hell, there are even programs out there than can auto-generate code for me, code that various companies have patent claims for. Do I think distributing any of this software should be illegal? No. But I think using that software to infringe on someone's copyright should be.

(Note: This is different than if the program distributed came with a folder that included bunch of copyrighted works. To me, sharing something like that would be a copyright violation.)

tga_d · on Jan 17, 2024

I'm not sure how to explain this any clearer. I am talking about neural net compression algorithms. As in, it is literally just a neural net encoding some copyrighted work, and nothing else. It is ultimately no more intelligent than a zip file, other than the file and program are the same. You can't seriously believe that these programs allow you to avoid copyright claims, can you? Movie studios, music producers, and book publishers should just pack it in, pirates just need to switch to compressing by training a NN, and seeding those instead, and there's no legal precedence to stop them? If you do think that, do you at least understand why nobody is going to take your position seriously?

csallen · on Jan 17, 2024

A neural net designed to do nothing other than compress and decompress a copyrighted work is completely different than GPT-4, unless I'm uninformed. To me that sounds like comparing a VCR to a brain. GPT-4's technology is clearly something that "learns" in order to be able to produce novel thoughts and ideas, rather than merely compressing. A judge or jury would easily understand that it wasn't designed just to reproduce copyrighted works.

> It is clearly not the same as your mind, because your mind can and, assuming you want to stay in good standing, will provide credit for influence

I forgot to respond to this, but it's not true. Your mind is incapable of providing credit for 99.9% of its influence and inspiration, even when you want it to. You simply don't remember where you've learned most of the things you've learned. And when you have a seemingly novel idea, you can't always be aware of every single influential example of another person's work/art that combined to generate that new idea.

mlyle · on Jan 17, 2024

> A neural net designed to do nothing other than compress and decompress a copyrighted work is completely different than GPT-4, unless I'm uninformed.

Compression and the output from LLMs are cousins. The model tries to predict what continuations are likely, given context. Indeed, it takes a lot of effort to make LLMs less willing to just output training data verbatim. And conversely, you can get compression algorithms to do things similar to what LLMs do (poorly).

Whether this also describes most of human cognitive process, is subject to debate.

Dylan16807 · on Jan 17, 2024

Individual words aren't comparable to the things people are worried about getting copied. People are much more able to tell you where they learned about more sophisticated concepts and styles.

csallen · on Jan 17, 2024

The same principle applies, though. They can tell you maybe a dozen, maybe a few dozen, concepts they've learned and use in their work. But what about the thousands of concepts they use in their work they can't tell you about? The patterns they've noticed, the concepts that don't even have names, but that came from seeing things in the world world that were all created by other people?

WereAllMadHere · on Jan 17, 2024

For example, how many artists drawing street scenes credit the designer at Ford Motors for teaching them what a generic car looks like? How many even know which designers created their mental model of a car?

Dylan16807 · on Jan 17, 2024

That is again a single word.

There is a strong correlation between how copyrightable a concept is and how well you can point to where you learned it.

bluefirebrand · on Jan 17, 2024

> That's just it, nobody looking to get paid by OpenAI actually did any labor for OpenAI.

To me this is a strong point in favor of the idea that OpenAI has no business using their work. How can you even think it's ok for OpenAI to use work that was not done for them without paying some kind of license? They aren't entitled to the free labor of everyone on the internet!

rpdillon · on Jan 17, 2024

> How can you even think it's ok for OpenAI to use work that was not done for them without paying some kind of license?

At the risk of answering a rhetorical question: because copyright covers four rights: copying, distribution, creation of derivative works, and public performance, and LLM training doesn't fit cleanly into any of these, which is why many think copying-for-the-purpose-of-training might be fair use (courts have yet to rule here).

I think the most sane outcome would be to find that:

- Training is fair use

- Direct, automated output of AI models cannot be copyrighted (I think this has already been ruled on[0] in the US).

- Use of an genAI to create works that would otherwise be considered a "derivative work" under copyright law can still be challenged under copyright.

The end result here would be that AI can continue to be a useful tool, but artists still have legal teeth to come after folks using the tool to create infringing works.

Of course, determining whether a work is similar enough to be considered infringing remains a horribly difficult challenge, but that's nothing new[1], and will continue to hinge on how courts assess the four factors that govern fair use[2].

[0]: https://www.reuters.com/legal/ai-generated-art-cannot-receiv...

[1]: https://www.npr.org/2023/05/18/1176881182/supreme-court-side...

[2]: https://fairuse.stanford.edu/overview/fair-use/four-factors/

guappa · on Jan 17, 2024

> They did labor for other reasons, and were happy with it.

They were happy until their copyright got stolen, I guess. Then got unhappy.

KingMob · on Jan 17, 2024

> We've never asked artists to "properly compensate" each other for learning/inspiration in the past.

LLMs are collections of GPUs crunching numbers. "Inspiration" doesn't really apply to them.

A better analogy is sampling, and musicians remixing music are very much required to pay for the samples they use.

mlrtime · on Jan 17, 2024

Only if the "use" where use means distribute.

If I sample a track and play it in my home I don't properly compensate anyone.

If I ask GPT to create a cool new comic based on the article and then delete or use it privately it, same applies.

KingMob · on Jan 18, 2024

Assuming that's true, GPT is the one "distributing" in your second example, so that still applies to them (if not you).

ClumsyPilot · on Jan 16, 2024

> That's just it, nobody looking to get paid by OpenAI actually did any labor for OpenAI. They did labor for other reasons, and were happy with it

Nobody working on a new cancer drug actually did any work for me. They did labour for other reasons, and were happy with it.

There it is okay for me to steal their recipe and sell their cancer drug.

Aeolun · on Jan 16, 2024

Nope, but it’s ok for you to read their recipe if they place it on the internet (research paper), and use it to make your own drug.

WereAllMadHere · on Jan 17, 2024

The entire point of the patent system was to say inventors can put their design on the net without it being stolen; so future inventors can build on their work.

s1artibartfast · on Jan 17, 2024

And that is a good thing we should all celebrate.

johnnyanmac · on Jan 16, 2024

>They did labor for other reasons, and were happy with it.

True, sadly most of those copyright are probably owned by other megacorp. So they either collude to surppess the entire industry or eat each other alive in legal clashes. The latter is happening as we speak (the writers for NYT are probably long retired, but NYT still owns the words) so I guess we'll see how that goes.

>OpenAI found a way to benefit by learning from these images. The same way that every artist on the planet benefits by learning from the images of their fellow artists.

If we treat AI like humans, art historically has an equally thin line between inspiration and plagiarism. There are simply more objective metrics to measure now because we can indeed go inside an AI's proverbial brain. So the metaphor is pretty apt, except with more scrutiny able to be applied.

bluefirebrand · on Jan 16, 2024

> Why is art some special case that should be protected, when many other industries were not?

It shouldn't be.

As soon as someone makes an AI that can produce it's own artwork without requiring ingesting every piece of stolen artwork it can, then I'm on board.

But as long as it needs to be trained on the work of humans it should not be allowed to displace those people it relied on to get to where it is. Simple as that.

csallen · on Jan 16, 2024

Are there any humans that can produce artwork without ingesting inspiration from other art? Do you know any artists that lived in a box their whole life and never saw other art? Do you know any writers who'd never read a book?

Are they any human artists who can't, if requested, draw or write something that's a copy of some other person's drawings or writings?

Also, FYI, you can't steal digital artwork. You can only commit copyright infringement, which is not the same crime as theft, because theft requires depriving the owner of something in their possession.

bluefirebrand · on Jan 16, 2024

> Are there any humans that can produce artwork without ingesting inspiration from other art? Do you know any artists that lived in a box their whole life and never saw other art? Do you know any writers who'd never read a book?

> Are they any human artists who can't, if requested, draw or write something that's a copy of some other person's drawings or writings?

This still is pretending that humans and AI models are equivalent actors and should have the same rights

Emphatically no they shouldn't. The capabilities are vastly different. Fair use should not apply to AI.

Ajedi32 · on Jan 16, 2024

This isn't about giving "rights" to machines. Machines are just tools. The question is about what humans are allowed to do with those tools. Are humans using AI models and humans not using AI models equivalent actors that should have the same rights? I'd argue emphatically yes they should.

mlyle · on Jan 16, 2024

The thing is, we already have doctrine that starts to encompass some of these concepts with fair use.

The four pronged test in US case law:

- the purpose and character of use (is a machine doing this different in purpose and character? many would say yes. is "ripping-off-this-artist-as-a-service" different than an isolated work that builds upon another artist's art?)

- the nature of the copyrighted work

- the amount and substantiality of the portion taken (can this be substantially different with AI?)

- the effect of the use upon the potential market for the original work (might mechanization of reproducing a given style have a larger impact than an individual artist inspired by it?)

These are well balanced tests, allowing me as a classroom teacher to duplicate articles nearly freely but preventing me from duplicating books en masse for profit (different purpose; different portion taken; different impact on market).

freejazz · on Jan 17, 2024

The problem with this conversation is that its being had by people that make the top level comment here stating that clothing is not copyrightable. It is. Clothing design is copyrightable. This was a huge recent case, Star Athletica. They know nothing about copyright law and they just build intuitions from the world around them, but the intuitions are completely nonsense because they are made in ignorance of the actual law and what the law does and why the law does it. I find it exhausting.

throwaway17_17 · on Jan 17, 2024

Your sentiment is probably correct in that there are many aspects of copyright law that are not strictly aligned with the public’s intuition. But your example is a bit of a reach. Star Athletica was a relatively novel holding that allows for a specific piece of clothing, when properly argued, could qualify as copyrightable as a semi-sculptural work of art, however this quality of a given piece is separate to its character as clothing. In fact, the USSC in Star Athletica explicitly held a designer/manufacturer has “no right to prohibit any person from manufacturing [clothing] of identical shape, cut, and dimensions” to clothing which they design/manufacture. That quote is directly from a discussion of the ability to apply copyright protections to clothing design. I think the end result is that trying to argue technical legal issues around a poorly implemented statutory regime is always fraught with errors. That really leave moral and commercial arguments outstanding and advocacy should try and focus on that, when not fighting to affect change in the law these copyright determinations are based on.

And just to be clear, this post does not constitute legal advice.

mlyle · on Jan 17, 2024

You're dismissing my comment because of what someone else said upthread?

I hate the desire to meta-comment about the site rather than argue on the merits.

We obviously don't know so much about how courts will interpret copyright with LLMs. There's a lot of arguments on all sides, and we're only going to know in several years after a whole lot of case law solidifies. There are so many questions, (fair use, originality, can weights be copyrighted? when can model output be copyrighted? etc etc etc). Not to mention that the legislative branch may weigh in.

This discourse by citizens who are informed about technology is essential for technology to be regulated well, even if not all participants in the conversation are as legally informed as you'd wish. Today's well-meaning intuition about what deserves copyright and why inform tomorrow's case law and legislation.

Ukv · on Jan 16, 2024

> Emphatically no they shouldn't. The capabilities are vastly different. Fair use should not apply to AI.

Fair use applies even to use of traditional algorithms, like the thumbnailing/caching performed by search engines. If I make a spam detector network, why should it not be covered by fair use?

bawolff · on Jan 16, 2024

Fair use applies to humans and the things they do (including AI). It is not something that applies to algorithms in themselves. AI's are not people, the people who use them are people and fair use may or may not apply to the things they do depending on the circumstances of whatever it is they do. The agent is always the human not the machine.

Ukv · on Jan 16, 2024

True; consider the "it" in my question ("If I make a spam detector network, why should it not be covered by fair use?") as "my making (and usage) of the network".

rideontime · on Jan 16, 2024

No idea on the legality, but common sense suggests that the difference would be that a spam detector doesn't replace the products that it was trained on, while AI-generated "art" is intended to replace human artists.

Ukv · on Jan 16, 2024

> common sense suggests that the difference would be that a spam detector doesn't replace the products that it was trained on

The extent to which it supplants the original work is one of the fair use considerations.

I think it'd make more sense to have a stance of "current LLMs and image generators should be judged by fair use factors and I believe they'd fail", though I'd still disagree, instead of having machine learning models subject to a different set of rules than humans and traditional algorithms.

rideontime · on Jan 16, 2024

That is indeed the most common stance. There isn't nearly as much outcry over, say, image classification by LLMs, as there is over AI "art" generation.

shagie · on Jan 16, 2024

The question is "is it a derivative work of the original?" - not if it is a generative work.

If that was the distinction to be made, using ChatGPT as a classifier would be acceptable while using it to write new spam (see the "I am sorry" amazon listings of the other day) would be unacceptable.

If two different uses of a tool allow for both infringing and non-infringing uses (are photocopiers allowed to make copies(!) of copyrighted works?) it has generally been the case that the tool is allowed and the person with agency to either use the copyrighted work in an infringing or a non-infringing way is the one to come under scrutiny.

I believe that if it is found that OpenAI is found to have committed copyright infringement in training the model, then an argument that training a model on spam be considered to be copyright infringement could be reasonably constructed.

If, on the other hand, OpenAI is found to have sufficiently transformative in its creation of the model and some uses are infringing, then it is the person who did the infringing (as with a photocopier or a printer printing off a copy of a comic from the web) that should be have legal consequences.

theLiminator · on Jan 16, 2024

Yeah, I really think it should fall on the user as opposed to the tool.

eggdaft · on Jan 17, 2024

> Are there any humans that can produce artwork without ingesting inspiration from other art?

Logically, the answer to this is (almost certainly) yes, so you’ll need to discount this argument.

If the answer were no, then either an infinite number of humans have lived (such that there was always a previous artist to learn from), or it was true in the past but false in the present, which seems unlikely given humans brains have generally become more and not less sophisticated over time.

I presume what you’re missing here is that the brain can be inspired from other sources than human art. For example: nature; life experience; conversation.

Not making any other comment about what machines can or can’t do, just wanted to point out this argument is invalid as it comes up a lot and is probably grounded in ignorance around the artistic process. It’s such a strange idea to suggest that the artist process is ingesting lots of art to make more art. That’s such a weird world view. It’s like insisting every artist is making art the way Quentin Tarantino makes films.

I’ve spent a lot of time with artists, I’ve worked with them, I’ve been in relationships with artists, and I can tell you the great ones see the world differently. There’s something about their brains that would cause them to create art even if born on a desert island without other human contact. Some of them don’t even take an interest in other art.

In fact, those artists that _do_ make art heavily based on other artists’ work as suggested are often derided as “derivative” and “unoriginal”.

ClumsyPilot · on Jan 16, 2024

> Are there any humans that can produce artwork without ingesting inspiration from other art?

This sounds so detached from human experience that I am tempted to ask if you are a human or just a disembodied spirit that haunts the internet.

When the first neanderthal drew a deer on the walls of a cave, where did they get inspiration?

When a little child draws a tree for the first time, where do they draw inspiration? Do you think they were reviewing works of Picasso?

When the firm man made an axe, chopped a tree, made a bed, sown some clothes, discovered fire, where did they draw inspiration?

Do you not have eyes, ears, do you not perceive and get inspiration from the natural world around you?

com2kid · on Jan 17, 2024

> When a little child draws a tree for the first time, where do they draw inspiration? Do you think they were reviewing works of Picasso?

Are we going to discount the hundreds to thousands of artistic pictures children are exposed to? Or how about the teacher sitting up front demonstrating to the class how to draw a tree?

> Do you not have eyes, ears, do you not perceive and get inspiration from the natural world around you?

Learning to see as an artist is a distinct skill. Being able to take the super compressed simplified world view that mind sees and put something recognizable on paper is a specialized skill that has to be developed. That skill is developed by doing it over and over again, often by copying the style of an artist that someone enjoys.

Or to put it another way, go to any period in history prior to the mid 20th century and art in a given region starts to share the same style, dramatically so, because people were inspired by each other, almost to a comical extent. (Financial reasons also had something to do with it as well of course, Artists paint/carve/engrave/etc what sells!)

Aeolun · on Jan 16, 2024

Yeah, but that’s not really your sole source of inspiration. My son has been ‘inspired’ by the art of all other kids in his kindergarden. Certainly by the time he gets to the age where he does it professionally he’s been inspired by an uncountable number of people.

freejazz · on Jan 17, 2024

Being inspired isn't against the law. copying is. it'd be one thing if this conversation could be had with useful terminology that's actually on point. instead we have you, insisting that there is no creative process, there is only experiencing other art and inevitably copying (because apparently you think that's the only thing humans can do!). It's all so telling. Yet its tragic because so many here don't even realize it. I'm sad for your inability to engage with creativity and creative acts.

mlyle · on Jan 17, 2024

I think a lot of the discussion is where the balance of the creativity lies when a human uses a model (trained on other artistic works) to create art.

Is the result a copy, or perhaps a derivative work of the art in the training set?

Does the person using the model have authorship of the result?

Was it even okay to use the art to train the model and then share the resulting weights?

Are the resultant weights protected by copyright themselves?

I suspect the actual answers we'll come to on these topics will be full of nuance.

ClumsyPilot · on Jan 16, 2024

What % is his independent inspiration? 30%? 90%? There are certainly people for whom it was 90%. For most we don’t know.

We do know one thing for sure - that for AI it’s 0%

mlyle · on Jan 17, 2024

We don't know what percentage is independent inspiration for a person using the AI to create art.

Once upon a time it was a contentious idea that humans had significant authorship in photographs, which merely mechanically captured the world. What % is the camera's independent inspiration?

Here, we have humans guiding what's often a quite involved process of synthesis of past human (and machine) creation.

bluefirebrand · on Jan 17, 2024

> We don't know what percentage is independent inspiration for a person using the AI to create art

The person using the AI doesn't matter in the equation. They aren't an artist, they're a monkey with a typewriter.

We're talking about the AI here, because it can generate the same images no matter which monkey with a typewriter is typing the prompts.

mlyle · on Jan 17, 2024

> The person using the AI doesn't matter in the equation. They aren't an artist, they're a monkey with a typewriter.

That's an opinion.

Does your opinion hold in all circumstances? If I spend 20 hours with an AI, iterating prompts, erasing portions of output and asking it to repaint and blend, and combining scenes-- did I do anything creative?

Aeolun · on Jan 18, 2024

Of course the person using the AI matters. It's literally the same as holding a brush. You can give it a prompt, get a result and be unhappy with it, modify it or remove it, and proceed doing that until you are happy with what you have.

No matter how great the AI is, a monkey with an AI will never generate anything useful.

guappa · on Jan 17, 2024

> Are there any humans that can produce artwork without ingesting inspiration from other art?

Do you think art was there before humans? Or humans made art?

If you believe the 1st proposition… please tell me about your very unique religion!

If not… you've answered your own question.

Ukv · on Jan 16, 2024

> But as long as it needs to be trained on the work of humans it should not be allowed to displace those people it relied on to get to where it is. Simple as that.

Do you feel the same way about tools like Google Translate?

bluefirebrand · on Jan 16, 2024

Tbh I'm not familiar enough with how Google Translate is built, but if it's ingesting tons of people's work without their permission so it can be used to replace them then yes I do.

shadowgovt · on Jan 16, 2024

For what it's worth: that's pretty much how Translate works.

Translate operates at a large-chunk resolution, and one of the insights in solving the problem was the idea that you can often get a pretty-good-enough translation by swapping a whole sentence for another whole sentence. So they ingest vast amounts of pre-translated content (the UN publications are a great source, because they have to be published in the language of every member nation), align it for sentence- and paragraph-match, and feed the translation engine at that level.

It's created an uncanny amount of accuracy in the result, and it's basically fed wholesale by the diligent work of translators who were not asked their consent to feed that beast. Almost nobody bats an eye about this because the value (letting people using different languages communicate with each other) grossly outstrips the opportunity cost of lost human translator work, and even the translators are, in general, in favor of it; they aren't going to be displaced because (a) it doesn't really work in realtime (yet), (b) it can't handle any of the deeper signal (body language, tone, nuance) of face-to-face negotiation, and (c) languages are living things that constantly evolve, and human translators handle novel constructs way better than the machines do (so in high-touch political environments, they matter; the machines have replaced translators in roles like "rewriting instruction manuals" that were always pretty under-served in the first place).

amplex1337 · on Jan 16, 2024

I would argue that Translate being fed by paid UN translators who likely agreed to the use of their transcriptions in a TOS or something is not an equal comparison to unpaid artists having their art submitted online to sites which become part of a training set used in for-profit models such as OpenAI, that they never consented to. OpenAI is a nonprofit parent company, but this spawned a child for-profit company OpenAI LP which most of their staff work for, which is meant to return many-fold returns to their shareholders who are effectively profiting from the labor of all the artists and sources in their training.

skydhash · on Jan 16, 2024

Google translate is very basic and not even close to something good if you already know both languages. Useful if you're translating to your language (you do the correction when reading), but can lead to confusion the other way.

s1artibartfast · on Jan 16, 2024

Interesting distinction.

If you can do the correction when reading, it seems reasonable to assume the reader in the opposite direction has the same correction capability.

I would expect the chance of confusion to be identical. The only difference is a matter of perspective, where in one case you are the reader and in one case you are the author.

skydhash · on Jan 16, 2024

Yes, they are identical. But I believe the reader is better armed to deal with the confusion, or at least to recognize the error, because it does not fit it. But when producing, you don't know the target language, so there's a better chance for errors to slip in unnoticed.

It's better for me to receive a text in the original language and translate it myself than to try to decipher something translated automatically.

ClumsyPilot · on Jan 16, 2024

Vastly inappropriate comparison- there are millions of pages of text out of copyright, you can get a good translation engine using public domain.

That’s is not the case for art, vast majority of art used by midjourney is not public domain.

c0pium · on Jan 16, 2024

> vast majority of art used by midjourney is not public domain

Is that true? How did you establish that?

shadowgovt · on Jan 17, 2024

It's unfortunately also not great for translation. Language changes fast enough that training on content that went out of copyright is old data.

freejazz · on Jan 17, 2024

OpenAI has basically admitted it. Is OpenAI even disputing that it ingested all the works its being sued over? Not as far as I can tell.

c0pium · on Jan 18, 2024

Huh? You’re aware that midjouney and OpenAI are different things, right?

theLiminator · on Jan 16, 2024

What about code? Or what about if we eventually robot labourers that is trained on observing human labourers?

johnnyanmac · on Jan 16, 2024

Code has licenses too. And we've had very high profile lawsuits based on "copying code".

>what about if we eventually robot labourers that is trained on observing human labourers?

Interesting point, but by that point in time I don't think generative art will even be in the top 10 ethical dilemmas to solve for "sentient" robots.

As it is now, robots aren't the ones at the helm grabbing data for themselves. Humans give orders (scripts) and provide data and what/where to obtain that data.

robryan · on Jan 16, 2024

What if the AI was solely trained on this person's work, then from that churned out a similar replacement that was monetized?

3asdf123 · on Jan 17, 2024

Well art predates other professions by like thousands of year so it rightfully earned it's privileges.

visarga · on Jan 18, 2024

Just the people in this discussion thread, devs and antrepreneurs, have probably automated a huge amount of work. But here we are bickering about AI and copyrights like its a new thing.

nullptr_deref · on Jan 16, 2024

Redacted.

Ajedi32 · on Jan 16, 2024

I'm confused about your point. Are you saying we should ban $10 mass produced shirts so that more people can make a living hand-crafting $100 shirts?

jacobr1 · on Jan 16, 2024

>What would you buy? $10 H&M or $100 hand-made shirt? - (My guess, if you could afford the later.)

This is an interesting example because even in the $100 case you are still talking about machine-augmentation. You can have a seamstress or a tailor customize patterns, using off the shelf textiles, for that order of magnitude price - but if you want to use custom built, exotic materials or many kinds combined, the cost is on the orders of thousands not hundreds. Also there is a large industry of just printing designs on stock-shirts, that has a different point effort-scale equilibria.

Thinking about how how automation disintermediates is very important. For animation, often productions have key-frame artists in the animation pipeline that define scenes, and then others that take though to flush out all the details of that scene. GenAI can potentially automate that process. You could still have the artist producing a keyframe, and can render that into a video.

Another big factor is style. One hypothesized reason that more impressionism, absurdism or abstract art all become styles is photography. Once cheap machine-produced photography became available, there is less need for a portrait artist. But further, it also is no longer high-status and others push trends alternative directions.

All the experiments and innovation going right now will definitely settle into a different set of roles for artists, and trends that they will seek to satisfy. Art-style itself will change as a result of both what is technically possible and also what is _not_ easily automatable in order to gain prestige.

CrimsonRain · on Jan 16, 2024

Too much wall of text for nothing. Nobody is stopping you from buying hand crafted masterpiece. Just get out of the way of progress.

csallen · on Jan 16, 2024

Mass production hasn't killed art and never will.

What's killing art is this idea by a vocal minority of "artists" that they need to mass produce their work, enter the market, and attempt to make millions of dollars by selling and distributing it to millions.

That's not art. That's capitalism. That's competing to produce something that customers will want to buy more than what your competitors offer.

If you want to compete on the capitalistic marketplace, then compete on the capitalistic marketplace. But if you want to be an artist, be an artist.

Art is still alive and well and always will be. Every day I see people singing because they love singing, making pottery because they love making pottery, writing because they love writing. Whether other people love or enjoy their art, the artist may or may not care. Whether they can profit from their art, the artist may or may not care. But many billions of artists will keep creating, crafting, and designing day after day, and they will never be stopped by AI or anything else.

gumballindie · on Jan 16, 2024

People do whatever they want with their own property. You have no right to steal it just because they want to monetise it. What’s killing art is stealing it en masse using procedural generators.

nullptr_deref · on Jan 16, 2024

Redacted.

csallen · on Jan 16, 2024

Jobs have never been less soul crushing, or more creative, in the history of humanity. And that becomes increasingly true every decade.

Do you know what a job does? What a company does? It contributes to society! It produces something that someone else values. That they value so much they're willing to pay for it. Being part of this isn't a bad thing. It's what makes society work.

A job/company entertains. It keeps things clean. It transports people to where they need to go. It produces. It gives people things they want. It creates tools, and paints, and nails, and shirts. I look out my window, and I see people delivering furniture, chefs cooking food and selling it out of trucks, keepers maintaining grounds, people walking dogs.

Being useful to the fellow members of your society for 40 hours a week is not "soul crushing."

nullptr_deref · on Jan 16, 2024

Hey. Thanks. Sorry about wasting your time. Shouldn't have started in the first place. It was my fault for trying to make a silly point.

Too mid to understand your point.

csallen · on Jan 16, 2024

(This is a response to your comment before you edited it.)

Find the intersection of something that people increasingly value, that you enjoy, and that you can compete at.

The best proof that people value something is that they're spending money for it. If people aren't spending money, they don't value it, and you probably don't want to go into it. If people aren't spending more and more money on it every year, then it's not increasing in value, and you probably don't want to go into it.

The best proof that you enjoy something is that you enjoyed it in the past. Things you liked as a kid, activities that excited you as a young adult, etc., are often the best candidates.

Look for intersections of the two things above. Do some Googling, do some research.

Finally, you need to be able to compete at it. If you do something worse than everyone else does it, then no one will pick you, because you're probably not being helpful. The simple answer to this is to practice to make yourself better. But most people don't want to do that. A better answer to this is to be more unique, so you can avoid the competition. Don't do a job that has a title, a college major, and millions of talented applicants. It's not that helpful to society to do something a hundred million other people can already do, which is why there's more competition and lower wages.

When you find the intersection of what's valued and what you enjoy, call up some people in those fields and ask what's rare. What in their area is needed. What are they missing. What is no one else doing.

Or just start your own company. That's the easiest way to be unique. But it's hard.

Finally, if you feel you're too "mid," then make sure your standards aren't crazy. Don't let society tell you that you need to be a millionaire with a yacht and designer clothes to be happy. Get a normal 9 to 5 with some purpose in it, that you can be proud of, that others appreciate. Live within your means and don't stress yourself out financially. Spend your free time doing things you like. Take care of your health, find good relationships, and treasure them. That's a happy life at any income. I know a bunch of miserable depressed rich people who are very good at making money and very bad at health/relationships/etc., which is the real stuff that life is made out of.

Humdeee · on Jan 16, 2024

It's an interesting predicament. Assuming these stories between person and machine are indistinguishable and of same quality, then the difference here is the ability to scale. Without giving bias because of humanity reasons, why should we give entitlement to output derived from a human over something else of same quality?

I hate making analogies, but if we make humans plant rows of potatoes, should that command a higher price and seen more valuable than planting potatoes by tractor 20 rows wide?

bluefirebrand · on Jan 16, 2024

> Without giving bias to humanity

No, we should absolutely be giving bias to humanity. Flesh and blood humans matter, their lives matter, their thoughts matter and their work matters.

Machines are tools for them to use not entities given the same rights and same consideration.

I reject your whole premise.

Aeolun · on Jan 16, 2024

So you instead want to what? Ban the tools because they interfere with doing things the human way?

polytely · on Jan 17, 2024

no force the people creating and profiting from the tools to get permission from the people they mine the data from or cease operating

nojster · on Jan 16, 2024

Descartes told us that animals are mere soulless automatons, not entities given the same rights and same consideration as humans.

Well ok, that was 300 years ago and views have changed dramatically since then.

OOPMan · on Jan 17, 2024

Nice strawman.

Humdeee · on Jan 16, 2024

Exactly; their flesh, blood, energy, etc. does matter. This is my argument for it, not for your argument against it, lmao. There's nothing more remarkable about my planted potato row vs the tractor planted rows, and my energy can be spent elsewhere. I am not entitled to making a living hand planting potatoes if there's not a market for it.

People have the choice to continue making stories and they'll have a fanbase for it and always will, because that's ultimately apart of freedom and choice. Many are less what I'll call purists here, and don't care about how it came to be, they just want a quality story.

What you're loosely proposing is art being a protected class of output, when we have tools that can match and soon with the potential to surpass. Is that not a terrific way to stunt what you're trying to defend?

For transparency, I am an advocate for human made art, but I am against stunting tooling that can otherwise match said creativity. I see that as an artform in itself.

Aeolun · on Jan 16, 2024

> with the potential to surpass

I think AI art will by definition never surpass human art. Humans can be inspired by things other than the art of others.

bluefirebrand · on Jan 16, 2024

> For transparency, I am an advocate for human made art,

If you believe AI tooling is an artform then you categorically are advocating against human made art as far as I am concerned.

c0pium · on Jan 16, 2024

This is just gatekeeping. Art is not better because it was made by hand as opposed to with technology. If I use a generative model to make art then I’m an artist.

bluefirebrand · on Jan 17, 2024

> If I use a generative model to make art then I’m an artist.

You are free to think so, but it really doesn't make you an artist any more than wearing a medal you bought second hand makes you a war hero.

Something else did the work and you're just claiming credit. It's honestly kind of sad.

c0pium · on Jan 18, 2024

Thank you for providing such an excellent example of how easy it is to dismiss things without understanding them.

jazzyjackson · on Jan 17, 2024

I would argue art is better when it's the result of the effort and vision of an individual

prompting a search engine to stitch images together on your behalf might result in an image you can call art, but imo all the art generated wholecloth like this sucks. necessarily derivative. put into the world without thought.

My favorite critique of LLM work: "why would I bother to read a story that no one bothered to write"

c0pium · on Jan 18, 2024

This is just the fallacy of the Protestant work ethic with different words. Things don’t need to be difficult to be good. You can’t tell how hard an artist worked just by looking at the piece. There’s a lot of truly terrible art that has had a ton of work put into it.

It’s very easy to make bad art quickly with powerful tools. It’s also possible to carefully craft prompts which generate amazing results that win awards. Source: I’ve done this. You should see the reactions when people have heaped flowery accords on a drawing and then find out it’s Dall-e. The irony of the transition from “art is rebellion” to pearl-clutching is almost the best part.

That critique says more about your understanding than it does about the work.

SamoyedFurFluff · on Jan 17, 2024

Seriously asking: if I customize my order at a fast food joint am I a chef? How is that different from prompt engineering to generate art?

c0pium · on Jan 18, 2024

Saying seriously asking and then asking an unserious question is like saying no offense before telling someone to fuck off.

OOPMan · on Jan 17, 2024

Plenty of people would disagree so clearly this is not a settled matter

c0pium · on Jan 18, 2024

Good thing it’s not up to them then. They don’t get to say that something isn’t art.

c0pium · on Jan 16, 2024

These models are not conscious, they’re not acting on their own. If I make art using a generative model it’s no more the model doing it than it’s sketchbook doing it if I were to use that. I’m making art using whatever tool, sometimes that tool is more or less powerful. But I’m the one doing it.

visarga · on Jan 18, 2024

Humans sell books and music by the millions. Streaming and copying/printing is mass production too

PeterStuer · on Jan 17, 2024

What about ML models that only publish 1 or 2 books a year?

Is it realy about volume?

huimang · on Jan 16, 2024

How many books can you write per second?

How how many books per second can you read to influence and change your personal style?

I don't think any person who actually has worked on anything creative in their life would compare a personal style to a model that can output in nearly any style at extreme speeds. And even if you're inspired by a specific author, invariably what happens is it becomes mix of yourself + those influences, not a damn near-copy.

With visual mediums it's even worse, because you have to take the time [months, years] to specialize in that specific medium/style.

bawolff · on Jan 16, 2024

> I don't think any person who actually has worked on anything creative in their life would compare a personal style to a model that can output in nearly any style at extreme speeds. And even if you're inspired by a specific author, invariably what happens is it becomes mix of yourself + those influences, not a damn near-copy.

I don't think anyone who has ever read a novel in their life would say that an AI can write literature at all, in any style.

> not a damn near-copy.

The obvious solution is to just treat it as if a human did it. If you did not know the authorship of the output and thought it was a human, would you still consider it copyright infringement? If yes, fair enough. If no, then i think is clearly not a "damn near-copy"

sandworm101 · on Jan 16, 2024

>> How many books can you write per second?

On my laptop, using modern tools backed by AI? ... many.

>> How how many books per second can you read to influence and change your personal style?

Thanks now to AI, hundreds. I can plug the output of the book-reading AI into the input of the tool I use to write my books and thereby update my personal style to incorporate all the latest trends. Blame the idiots who are paying me for my books.

rangerelf · on Jan 16, 2024

So, zero. You yourself: zero.

You completely ignored the premise of the question.

c0pium · on Jan 16, 2024

You should read the response more carefully. Generative models are just tools. If I use one to write a story it’s no less a story that I wrote than if I’d chiseled it into a Persian mountainside.

OOPMan · on Jan 17, 2024

It pretty clearly is. Less of a story that is.

c0pium · on Jan 18, 2024

Careful, you’re saying the quiet part out loud.

californical · on Jan 16, 2024

This is clearly a bad-faith response to the point that the GP was making

ska · on Jan 16, 2024

There are two problems with this (very common) line of argument.

First, the law is pretty clear that yes if your story is too similar to another work, they have rights. Second, it's not at all obvious we can or should generalize from "what a human can do" and "what a bunch of computers can do" in areas like this.

fragmede · on Jan 16, 2024

Did you not pay them when you bought their book to read it in the first place? That dead trees don't lend themselves to that sort of payoff is a limitation of the technology. In music, sampling is a well-accepted mechanism for creating new music, and the original authors of the music they used do get paid when the new one is used.

soulofmischief · on Jan 16, 2024

No, I bought the books used for 25 cents at a local booksale, and the authors did not benefit from my secondary market transaction.

sandworm101 · on Jan 16, 2024

>> the authors did not benefit from my secondary market transaction.

But they did. The presence of a secondary market for used books increased the value of some new books. People buy them knowing that they might one day recoup some costs by selling them. Would people pay more, or less, for a new car if they were told they could never sell or trade it away as a used car?

cwkoss · on Jan 17, 2024

Lol, did an AI write this? Literally no one buys books because they might one day recoup a fraction of the sticker price on the secondary market.

Baffling

JambalayaJim · on Jan 17, 2024

They actually do exactly this, they’re just not thinking about it. You buy a book because you want the physical possession, which gives you the ability to sell it or give it to someone or display it. Not because you want to read the contents - else you would just borrow it from a library.

soulofmischief · on Jan 18, 2024

> You buy a book because you want the physical possession, which gives you the ability to sell it or give it to someone or display it. Not because you want to read the contents - else you would just borrow it from a library.

Please back up this statement.

I for one, buy primary market books for the content; and to support creators whom I wish to support. Not to display the book or resell it. That is some capitalist jive.

People typically use libraries because

A) they are too poor to afford the books, and libraries therein provide a valuable community function

B) they are doing research and only need the books for a short time

C) they read more books than they can afford

and other such reasons.

soulofmischief · on Jan 17, 2024

Gee I don't know, but I'm glad that digital goods do not incur the same material costs as a car. "You wouldn't download a car", we've come full circle.

downWidOutaFite · on Jan 16, 2024

You got that via the legal "first sale doctrine" which has been killed for digital works.

MacsHeadroom · on Jan 17, 2024

"In 2012, the Court of Justice of the European Union (ECJ) held in UsedSoft GmbH v. Oracle International Corp that the first sale doctrine applies to used copies of [intangible goods] downloaded over the Internet and sold in the European Union." [0]

Arguably the U.S. courts are in the wrong here. We can only hope first sale doctrine is extended to digital goods in the U.S. in the future, as it has been in the EU for over a decade.

[0] https://scholarlycommons.law.northwestern.edu/cgi/viewconten...

soulofmischief · on Jan 17, 2024

It's a tough issue to correlate to physical goods, especially when you realize that people sometimes donate books.

madeofpalk · on Jan 16, 2024

Put differently - if you perfectly memorise Harry Potter, write it down into a book and sell it, you'll get into trouble.

philomath_mn · on Jan 16, 2024

Right, I don't think anyone disagrees with that.

The question is about someone/something writing a book _influenced_ by Harry Potter -- do they owe JK Rowling royalties?

halostatue · on Jan 16, 2024

That depends on a variety of factors. You may find yourself in trouble if you write about a wizard boy called Perry Hotter going to Elkwood school of magic and he ends up with two sidekicks (a smarter girl and a redhead boy).

It could be argued quite convincingly that stories like Brooks's Shannara and Eddings's Belgariad are LOTR with the serial numbers filed off — but there is more than enough difference in how various pieces work for those series to make them unique creations that do not infringe on the properties or cover too much the story. (Although I cringe at putting the execrable Belgariad books in any class with either LOTR or Shannara.)

The "best" modern example of this is the 50 Shades series. These are Twilight fan fiction (it is acknowledged as such) with the vampire bits filed off. They are inspired by Twilight, but they are not identifiably Twilight in the end. It might be hard to tell the quality of writing from that which an LLM can produce, and frankly Anne Rice did it all better decades before (both vampires and BSDM).

Humans can be influenced by writers, artists, etc. LLMs cannot. They can produce statistically approximated mishmashes of the original works themselves, but there is no act or spark of creation, insight, or influence going on that makes the sort of question you’re asking silly. LLMs are just math. Humans may be just chemistry, but there’s qualia that LLMs do not have any more than `fortune` does.

jacobr1 · on Jan 16, 2024

> but there’s qualia that LLMs do not have

I'm with all your other arguments ... but not this point. What is the special magic property that machine-generated art doesn't have? Both human and machine generated art can be banal, can be crap. And I think there is plenty of machine generated art this a quite beautiful, and if well prompted even very insightful. Non-GenAI can be this way, Conway's game of life has a quality of beauty to it that rivals of forms of modern art. If you wanted to argue that there still is the need for a human to provide some initial inspiration as input, or programming before something of value can be generated, then I would agree, at least for now, though there is meta-argument about asking LLMs to generate their own prompts that makes this an increasingly gray area.

But I don't think the stochastic parrot argument holds water. Most of _human_ creations is derivative. Unique mixes of pre-existing approaches, techniques, substance, often _is_ the creative act. True innovation with no tie to existing materials seems vanishingly rare to me and is really high bar, beyond which most humans ever achieve.

visarga · on Jan 18, 2024

> Humans can be influenced by writers, artists, etc. LLMs cannot.

This is literally wrong, LLMs are influenced by their users. And people can input new ideas, facts and explore new directions by choosing how they interact with the LLM.

A LLM writing a book all on its own start to finish would be a different story, the output would be derived 100% from its training set. A LLM being prompted with a book chapter would risk borrowing too much.

But if you prompt a LLM without copy pasting protected content in the prompt, then you are the main influence. And LLMs can explore outside their training distribution in this way, helped by humans.

Many people think a trained LLM is frozen, but they do in-context-learning, they can even acquire new concepts/words and properly use them in the same session. They don't keep the memory of this until retraining, but that doesn't mean they are locked up. There is plenty of space in the buffer you can use to add new material after training.

This kind of thinking is like saying you can't drive a nail unless you own a licensed hammer, if someone is using his shoe to drive a nail will be in infringement. Maybe the shoe is also a hammer if the user so says.

apersona · on Jan 17, 2024

You are not a machine.

Have you noticed that authors and artists love sharing their inspirations? Let's say you're an up-and-coming author. In an interview, you list your sources of inspiration.

Using your logic, why does the creative community celebrate you and your inspirations instead of crying foul like they are with LLMs?

MrVandemar · on Jan 17, 2024

> If I read a lot of stories in a certain genre that I like, and I later write my own story, it’s almost by definition going to be a mish-mash of everything I like.

But it's also going to be affected by the teachers you had in pre-school, the people you hang around with, your relatives, films you've seen, adverts you watched, good memories and bad memories of events. You bring your lived experience to your story, and not just a mish-mash of stories in a particular genre, but everything.

Whereas when you train a model, you know the exact input, and that exact input may be 100% copyright material.

RobRivera · on Jan 16, 2024

I feel like the keyword is 'almost' and then you begin pulling on that thread:

How closely is this the case? What blind spots exist? How do you measure this? What is the capacity for original idea generation does the human mind have and how does it inspire a unique spin to it?

This is one of those areas where 'thought experiments' are never going to pass muster against genuine experiments with metrics, trial, and robist scientific research.

But with the stakes as they are, I dont have faith there exists a good faith dialogue in this arena.

tedivm · on Jan 16, 2024

Is your mishmash going to be a literal statistical model built on top of those other stories?

kelnos · on Jan 16, 2024

You are not an AI model, and AI models are not human authors, so your comparison is invalid and question irrelevant.

bergen · on Jan 17, 2024

Did you read the books with the intent to incorporate their ideas into your head and profit of this?

data-ottawa · on Jan 19, 2024

That’s basically the education system’s goal.

However, AI models require copying and reproducing copyrighted works, whereas when I read I book I’m not copying it and I’ve secured some sort of license to use it.

The collection, mass copying, and redistribution of work to create these models seems quite obviously to be a violation on IP laws.

rspoerri · on Jan 16, 2024

I hope you payed for the book you read.

If openai would pay usage fees for the training material per user its generating content for - it would never be profitable - artist would be fine off. But even all the shares are owned by people who have given this system none of it‘s knowledge.

johnnyanmac · on Jan 16, 2024

>If openai would pay usage fees for the training material per user its generating content for - it would never be profitable

In that case, good? I thought if nothing else, these past year or two would teach companies about sinking money into unsustainable businesses and then price gauging later (i know it won't, the moment interest rates fall we are back to square one). If it isn't profitable, scale down the means of production (which may include paying C class executives one less yatch per year, tragic), charge more upfront to the customers, or work out better deals with your 3rd parties (which is artists in this case).

I also find some scheudenfredre in that these companies are trying to sell "less enployees" to other companies but would also benefit from said scaling down as they throw out defenses of "we can't afford to pay every copyright" .

kansface · on Jan 16, 2024

> The reason that AI models are generating content similar to other people's work is because those models were explicitly trained to do that.

Ah, just like humans who train against the output of other humans. AI models are not fundamentally different in kind in this regard, only scope, and even that isn't perfectly obvious to me a priori.

acdha · on Jan 16, 2024

Humans usually add their own style to things, and it’s hard to discuss copyright without that larger context along with the question of scale (me making copies of your paintings by hand is not as significant a risk to your livelihood as being able to make them unimaginably faster than you can at much lower cost). Just as rules about making images of people in certain ways or places only became critical when photography made image reproduction an industrial-scale process, I think we’ll be seeing updates to fair-use rules based on scale and originality.

jwells89 · on Jan 17, 2024

Humans can also come up with their own styles and can draw things they’ve never seen, which ML models as they currently exist are not capable of (and likely will never be). A human artist who has lived their entire life in the wilderness and has never trained themselves with the work of another artist will still be able to produce art with styles produced entirely by personal experimentation.

ML models have a long way to go before comparisons to humans make any kind of sense.

kelnos · on Jan 16, 2024

I really don't get why so many people seem to think that an AI model training on copyrighted work and outputting work in that same style is exactly the same thing (morally, ethically, legally, whatever dimension you want) as a human looking at copyrighted work and then being influenced by that work when they create their own work.

The first thing is the output of a mathematical function as computed by a computer, while the second is an expression of numan creativity. AI models are not alive. They are not creative. They do not have emotion. These things are not even in the same ballpark, let alone similar or the same.

Maybe someday AI will be sophisticated enough to be considered alive, to have emotion, and to be deserving of the same rights and protections that humans have. And I hope if and when that day comes, humanity recognizes that and doesn't try to turn AI into an enslaved underclass. But we are far, far from that point now, and the current computer programs generating text and images and video are not exhibiting creativity, and do not deserve these kinds of protections. The people creating the art that is used to feed the inputs and outputs of these computer programs... those are the people that should have their rights protected.

kansface · on Jan 19, 2024

> The first thing is the output of a mathematical function as computed by a computer, while the second is an expression of human creativity.

The mathematical function is the expression of human creativity. It consumes other expressions of human creativity, a creativity act in an off itself, to output a different expression of human creativity. Other humans consume the output of the system and the system alike to generate further creative works. An AI model is a higher order creative process, much like humans, and is fundamentally predicated on the creative involvement of humans.

Consider the seminal work _Designing Programmes_ by Karl Gerstner if you'd like further consideration of optimizing creative output via self imposed, systematized restrictions and permutations (design programs as meta art). Or alternatively, consider aleatoric music or Toshiko Takaezu for the incorporation of _chance_ into art.

There really isn't anything too new here in my book (AI) - just increased scope and fruition.

> They do not have emotion.

Art does not require an input or output of emotion nor an emotional affect.

KuriousCat · on Jan 16, 2024

Going by this logic, why is OpenAI forbidding use of the content it generates for training other models?

throwup238 · on Jan 16, 2024

They can write whatever they want in their Terms of Service. That's the logic.

That doesn't mean that courts will meaningfully enforce it for them.

KuriousCat · on Jan 16, 2024

I understand that, only pointing out the hypocrisy

kelnos · on Jan 16, 2024

Because any company, hypocrisy be damned, will use every legal lever at their disposal to protect their business model.

KuriousCat · on Jan 16, 2024

Hope we are not normalizing hypocrisy, usually it is very destructive.

johnnyanmac · on Jan 16, 2024

Well, mostly because of corporate greed of ownership. But the underlying issue is that Ai training in AI is a recipe for ruining the entire training set. At least in these early stages.

KuriousCat · on Jan 16, 2024

Not just greed, they want to silence copyright holders whose works they freely use and at the same time prevent others from using theirs. It is like having different set of rules for them. I don't believe training itself is ruining anything, it is the proposed model of value capture and marginalizing content creators that poses greater threat.

spookie · on Jan 16, 2024

Yes, you've condensed the problem in display quite well here. It's not even just hypocrisy, but also short sighted behaviour.

Artists will learn to not trust the web, if they haven't already. The greatest time to train a model was yesterday, eventually no novel ideas, expressions, art will prosper on the "open" web. Just a regurgitation of some statistical idea of words, and pixels.

bayindirh · on Jan 16, 2024

Oh, right. It just reads a million books in a couple of days, removes all the source information, mix and match it the way it sees fit and sells this output $10/month to anyone comes with a credit card.

It's the same thing with GitHub's copilot.

A book publisher would seize everything I have, and shot me at a back alley if I do 0.0001% of this.

somethingsaid · on Jan 16, 2024

Yeah, fair use implicitly uses the constraints of typical human lifetime and ability to moderate how much damage is done to publishers with it. That wasn’t an issue before recently, as humans were the only ones who could create output based off fair use laws.

MacsHeadroom · on Jan 17, 2024

> Yeah, fair use implicitly uses the constraints of typical human lifetime and ability

Authors Guild, Inc. v. Google, Inc. strongly disagrees with you on that (the "Google Books case").

munificent · on Jan 16, 2024

AI models are fundamentally different because a computer is a lump of silicon which is neither a moral subject nor object. A human author is a living sentient being that needs to earn a living and is deserving of dignity and regard.

hermitdev · on Jan 16, 2024

I'm sorry, but I'm going to fundamentally disagree with you. One does not get a morality pass because "the computer did it". People are creating these AI models, selecting data and feeding the models data on which to be trained. The outcome of that rests upon _both_ the creators of the models and the users prompting the models to achieve a result.

munificent · on Jan 17, 2024

I don't know if we are disagreeing.

I interpret the comment I was replying to as basically saying "We let humans do it, so therefore we should let machines do it." And my response is basically, "we let humans do it because it provides benefits to an actual living being that deserves them".

When Midjourney serves up an image, it does not collect a paycheck that enables it to feed its family. It doesn't go home and sleep well at night with the satisfaction that it has created a piece of art that meant something to others.

It may be the case that the executives and engineers who own Midjourney feel some of that, but I think the experience of making a machine to make X is fundamentally different from making an X.

It may also be the cast that the person who wrote a prompt to ask Midjourney to produce an image generates some value from that and feels good about it. I get that. But I think the amount of creative effort they put into doing that and the amount of value in the result that is derived from uncompensated other artists is profoundly different from sitting down and actually drawing a picture.

A large enough difference in degree is a difference in kind.