Technology

59092 readers

6622 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

104

If artificial intelligence uses your work, it should pay you (wapo.st)

submitted 1 year ago by silence7@slrpnk.net to c/technology@lemmy.world

77 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] fubo@lemmy.world 6 points 1 year ago (1 children)

Oh sure, if a copyright holder can demonstrate that a specific work is reproduced. Not just "I think your AI read my book and that's why it's so good at carpentry."

[+] silence7@slrpnk.net -9 points 1 year ago (2 children)

The thing is that they're all reproduced, at least in part. That's how these models work.

[–] fubo@lemmy.world 12 points 1 year ago* (last edited 1 year ago) (2 children)

Reproducing a work is a specific thing. Using an idea from that work, or a transformation of that idea, is not reproducing that work.

Again: If a copyright holder can show that an AI system has reproduced the text (or images, etc.) of a specific work, they should absolutely have a copyright claim.

But "you read my book, therefore everything you do is a derivative work of my book" is an incorrect legal argument. And when it escalates to "... and therefore I should get to shut you down," it's a threat of censorship.

[–] silence7@slrpnk.net -5 points 1 year ago (1 children)

The problem is that the LLMs (and image AIs) effectively store pieces of works as correlations inside them, occasionally spitting some of them back out. You can't just say "it saw it" but can say "it's like a scrapbook with fragments of all these different works"

[–] fubo@lemmy.world 7 points 1 year ago (1 children)

I've memorized some copyrighted works too.

If I perform them publicly, the copyright holder would have a case against me.

But the mere fact that I could recite those works doesn't make everything that I say into a copyright violation.

The copyright holder has to show that I've actually reproduced their work, not just that I've memorized it inside my brain.

[+] silence7@slrpnk.net -8 points 1 year ago* (last edited 1 year ago) (2 children)

The difference is that your brain isn't a piece of media which gets copied. The AI is. So when it memorizes, it commits a copyright violation

[–] fubo@lemmy.world 7 points 1 year ago* (last edited 1 year ago) (1 children)

If that reasoning held, then every web browser, search engine bot, etc. would be violating copyright every time it accessed a web page, because doing so involves making a copy in memory.

Making an internal copy isn't the same as publishing, performing, etc. a work.

[+] silence7@slrpnk.net -8 points 1 year ago (1 children)

There's an implied license to use content for the purpose of displaying it for web content. Copies for other purposes...not so much. There have been a whole series of lawsuits over the years over just how much you can copy for what purpose.

[–] fubo@lemmy.world 5 points 1 year ago* (last edited 1 year ago) (1 children)

There isn't an "implied license". Rather, copyright is simply not infringed until the work is actually republished, performed, etc. without the copyright holder's permission.

Making internal in-memory copies — e.g. for search-engine indexing — is simply not an infringement to begin with; just as it's not an infringement for me to memorize a copyrighted work, but it would be an infringement if I were to recite it in a public performance without permission.

Copyright simply does not grant the copyright-holder absolute & total control of everything downstream from the work. It restricts republishing, performing, etc.; it does not restrict memorization, indexing, summarizing in a review, answering questions about the work, etc.

Again: if the AI system is made to regurgitate the actual text of the work, that's still a copyright infringement. But merely having learned from it is not.

[–] silence7@slrpnk.net -3 points 1 year ago (1 children)

This is different from those, and not at all tested in the courts. There are likely to be a whole bunch of lawsuits and several years before this is settled.

[–] conciselyverbose@kbin.social 3 points 1 year ago

There is no possible basis in law for copyright infringement.

Copyright infringement isn't "you can do these things with copyrighted materials and everything else is banned". It's "these specific things (redistributing substantial portions of published works) are disallowed, unless you meet exceptions, and anything not explicitly disallowed is legal".

You are unconditionally allowed to learn from copyrighted works. There is no legal basis for preventing it. There is no possible basis in copyright law preventing it. It would take new legislation restricting doing so, and it would be impossible to apply to any training that happened before this new crime against humanity of a law was written.

[–] conciselyverbose@kbin.social 5 points 1 year ago

No, it doesn't. Learning from copyrighted material is black and white fair use.

The fact that the AI isn't intelligent doesn't matter. It's protected.

[–] FaceDeer@kbin.social 4 points 1 year ago

No, that's not how these models work. You're repeating the old saw about these being "collage machines", which is a gross mischaracterization.