this post was submitted on 10 Aug 2023
1646 points (97.6% liked)

Technology

59092 readers
6622 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Vulnerabilities in Sogou Keyboard encryption expose keypresses to network eavesdropping.

you are viewing a single comment's thread
view the rest of the comments
[–] Goodie@lemmy.world 97 points 1 year ago (4 children)

It's stories like this that don't surprise me as much as make me ask: How the fuck do you store and process this much data to get anything useful out of it.

[–] toofpic@lemmy.world 51 points 1 year ago (1 children)

You just save the first 50 digits typed after some email is typed, and you have all the passwords you need!

[–] Goodie@lemmy.world 4 points 1 year ago (2 children)

This only applies if a username is a email

And if it is then what happens when people actually email someone? Autocorrect during login?

[–] ultimate_question@lemmy.world 7 points 1 year ago* (last edited 1 year ago) (1 children)

I don't think they're saying that method would yield 100% clean data but it would give you all the "necessary" data with the absolute bare minimum storage requirement. At some point people will log into their email and for most people if you have their email password you have the password they use for everything

[–] toofpic@lemmy.world 0 points 1 year ago

Yep, I only reacted to a "new requirement": save space :)

[–] WarmSoda@lemm.ee 2 points 1 year ago

They weren't describing a use case for every single type of situation.

[–] WarmSoda@lemm.ee 37 points 1 year ago (2 children)

I could be wrong, and this is a generalization of any country you can name, but my impression is data is stored on everyone so when they decide someday to look you up they already have all the data collected. It's not really processed until needed.

[–] TheYear2525@lemmy.world 12 points 1 year ago (1 children)

And in hopes of it being useful later, when processing power is better.

Hey GovGPT8, please rank the 10 citizens most likely to organize protests if we institute curfews.

[–] WarmSoda@lemm.ee 2 points 1 year ago
[–] perviouslyiner@lemm.ee 14 points 1 year ago (2 children)

And how can autosuggest / autocorrect be so bad with so much training data

[–] TheEntity@kbin.social 4 points 1 year ago

Did you ever see how an average person types? It's not the amount of data that is the problem. We have too much dumb data!

[–] Steeve@lemmy.ca 2 points 1 year ago

The real answer is compute power. At the moment it's very expensive to run the computations necessary for big LLMs, I've heard some companies are even developing specialized chips to run them more efficiently. On the other hand, you probably don't want your phone's keyboard app burning out the tiny CPU in it and draining your battery. It's not worth throwing anything other than a simple model at the problem.

[–] BobKerman3999@feddit.it 1 points 1 year ago

They can a "rollup" of the data to coalesce a lot of stuff and still maintaining precision