this post was submitted on 14 Jun 2023
26 points (100.0% liked)
Lemmy.World Announcements
29026 readers
8 users here now
This Community is intended for posts about the Lemmy.world server by the admins.
Follow us for server news ๐
Outages ๐ฅ
https://status.lemmy.world
For support with issues at Lemmy.world, go to the Lemmy.world Support community.
Support e-mail
Any support requests are best sent to info@lemmy.world e-mail.
Report contact
- DM https://lemmy.world/u/lwreport
- Email report@lemmy.world (PGP Supported)
Donations ๐
If you would like to make a donation to support the cost of running this platform, please do so at the following donation URLs.
If you can, please use / switch to Ko-Fi, it has the lowest fees for us
Join the team
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I thought it was obvious this was the case. Twitter and Reddit are unhappy that AI language models have used all this data for training, but didn't get paid for it. Personally I don't even consider it "their" data to begin with even if they can claim legal ownership of it. But they want to get paid obscene amounts of money for data that was created by the goodwill of their users.
If they were worried data being scooped up for AI training, they would have approached it differently. What they did was target 3rd party apps to drive people to their app so they can get tracking data and push ads.
Yes, but the special thing here is that OpenAI, which has a lot of shared stakeholders with Reddit, has already trained their models on its data, so they might have an interest in turning it off for the other companies. Also, they might be in a better position to negotiate with Reddit for special access to the data than smaller companies.
It's a pretty wild theory, but interesting nontheless.
It's not their data. If you scrape Reddit for the comments are reposted them somewhere else Reddit wouldn't be able to come after you with a copyright violation lawsuit.
Any potential copyright is still owned by the original user with Reddit having a license to sublicense for "syndication, broadcast, distribution, or publication by other companies, organizations, or individuals who partner with Reddit."
They would have to come after you with a ToS contract violation or maybe some kind of Computer Fraud and Misuse allegations.
I completely agree it isn't their data. They still want money for data that isn't theirs.
Sorry if I seemed argumentative. I was trying to state that it wasn't just your opinion that they don't own user data but it is a fact they don't own user data.