this post was submitted on 22 Jul 2023
156 points (91.5% liked)

Technology

59201 readers
4225 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
top 37 comments
sorted by: hot top controversial new old
[–] Dusty@l.dusty-radio.com 12 points 1 year ago

Why is crap from 2 weeks ago being posted like it's new news yet again

Is this one of those bot accounts that aren't marked properly or is OP just after karma (which doesn't exist on this site).

[–] HonorIsDead@lemmy.world 11 points 1 year ago (1 children)

I'm conflicted on a lot of this. At the end of the day it seems like these LLMs are simulating human behavior to an extent - exposure to content and generating similar content from that. Could Sarah Silverman be sued by comedians who influenced her comedy style and routines? generally no. I do understand the risk with letting these 'AI' run rampant to displace a huge portion of the creative space which is bad but where should the line be drawn? Is it only the fact they were trained material they dont own people are challenging? What recourse will they have when a LLM is trained on wholly owned IP?

[–] Granite@kbin.social 14 points 1 year ago (2 children)

She’s suing for copyright infringement, basically, not the LLM emulating her style.

The LLMs read books from her and many, many others that they didn’t buy, because unauthorized copies had been uploaded to the web (happens to every popular book).

Honestly, I don’t know if she has a case. Going after the people who illegally uploaded her book would be the proper route, but that’s always nearly impossible.

Long and short, LLMs benefited from illegal copies.

[–] billstickers@aussie.zone 1 points 1 year ago

I see a lot of people claim the training model included copyrighted works particularly books because it can provide a summary of it. But it can provide a summary of visual media too, and no one is claiming it’s sitting there watching films.

If the argument is it has quite a detailed knowledge of the book, that’s not convincing either. All it needs is a summary and it can make up the blanks, and get it close enough we can’t tell the difference. Nothing is original.

[–] hedgehog@ttrpg.network -1 points 1 year ago (1 children)

If you upload an illegal copy of a book and I download it, not realizing or caring that it’s pirated, and then I re-upload it elsewhere, you and I have both committed copyright infringement. This feels like the same thing.

I suspect the case will depend largely on whether the ways that the models were trained using her works qualify as fair use.

[–] islandofcaucasus@lemmy.world 4 points 1 year ago (2 children)

Your example is faulty. If you upload an illegal copy of a book and I read it then tell people all about it, I am not committing copyright infringement

[–] kromem@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

How did you read it?

Did you access it where it was illegally posted online?

And in so doing, copy it locally in order to read it?

Guess what? According to copyright laws in the US, you just committed copyright infringement.

There's two separate claims.

One, that training is infringement, will hopefully be found to be without merit or it's a slippery slope to the death of free use.

The other, that OpenAI committed copyright infringement by downloading pirated books, is not special in any way with the AI stuff. It doesn't matter how they used it. If they can be found to have downloaded it - even if they then never even opened the file - they are liable for civil damages that can be as high as $150,000 per work if they knew in advance that they were pirating it, and not less than $100 per work no matter if they knew or not.

This is the result of years of lobbying by the various digital rights owners over the past few decades. It's a very broad scope of law and OpenAI should rightfully be concerned if they didn't actually purchase the copyrighted material they used to train.

You can learn and share the knowledge from a book I might illegally upload, but if you are caught having made a copy of the pirated textbook I uploaded, you are liable for damages completely separate from what you did with the knowledge from the books.

[–] hedgehog@ttrpg.network -1 points 1 year ago

If you use that illegal copy to create a work, then your copy infringes copyright (unless it falls under fair use). LLMs don’t count as people in any legal sense, and training them doesn’t have a legal status comparable to a real person reading books.

[–] LexiconDexicon@lemmy.world 10 points 1 year ago (1 children)
[–] seasonone@opidea.xyz 6 points 1 year ago (1 children)

Me Too. I don't understand why aren't publisher like Macmillan suing them?

[–] Ensign_Crab@lemmy.world 4 points 1 year ago

Do you think publishers want to pay human authors?

[–] AngrilyEatingMuffins@kbin.social 5 points 1 year ago* (last edited 1 year ago)

none of the LLMs are funny or outrageous so I doubt she was highly cited.

it would be nice if the end result of all of these cases was that publicly sourced models had to be public services

ha! as if.

I'm still waiting on proof for any of these allegations. So far it's just been people suing for the sake of suing and hoping they strike gold. If anyone can point to any evidence at all (read: not hearsay) then I'll gladly review it, but as it stands, its nothing.

[–] feedum_sneedson@lemmy.world -2 points 1 year ago

I don't like Sarah Silverman.

[–] Marsupial@quokk.au -3 points 1 year ago (3 children)

I hope she loses.

No one should “own” words or concepts.

[–] Default@aussie.zone 7 points 1 year ago (1 children)

You don't think authors should get paid?

[–] pulaskiwasright@lemmy.ml 5 points 1 year ago (1 children)

So why are you ok with openai being paid for taking work from other people that you don’t think should be paid? If she loses, then that’s the situation.

[–] Marsupial@quokk.au 0 points 1 year ago* (last edited 1 year ago) (1 children)

So change the system that lets them be paid for, don’t paywall human culture and let that system continue.

You’re picking the wrong target here.

[–] pulaskiwasright@lemmy.ml 2 points 1 year ago

The system exists. We all have to live with it. Or change it, but this case won’t do that and you’re effectively siding with big tech over authors.

[–] mrmanager@lemmy.today -5 points 1 year ago (2 children)

Then why are we banned from using certain words on social media?

[–] jgardner10@lemmy.world 6 points 1 year ago (1 children)

Just because you can freely say something doesn’t mean I have to forced to listen to it.

[–] mrmanager@lemmy.today -2 points 1 year ago (2 children)

And that means that the words cannot be used, which means they are not owned by you. If you could use them, you would own them right?

[–] Cail@kbin.social 3 points 1 year ago

But the words can be used, just not in that specific space. If you're not allowed to bring a gun to a restaurant it doesn't mean the restaurant suddenly owns the bullets.

[–] Marsupial@quokk.au 2 points 1 year ago

That's a completely unrelated topic.

You're talking about censorship, I'm talking about ownership.

load more comments
view more: next ›