TechTakes

1538 readers

1 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago

MODERATORS

dgerard@awful.systems

779

OpenAI is so cooked and I'm all here for it (lemmy.dbzer0.com)

submitted 3 weeks ago by db0@lemmy.dbzer0.com to c/techtakes@awful.systems

208 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] Brutticus@lemm.ee 23 points 3 weeks ago (9 children)

Okay, explain. What kinds of low hanging fruit?

[+] jatone@lemmy.dbzer0.com -13 points 3 weeks ago* (last edited 3 weeks ago) (8 children)

quants are pretty basic. switching from floats to ints (faster instruction sets) are the well known issues. both those are related to information theory, but there are other things I legally can't mention. shrug. suffice to say the model sizes are going to be decreasing dramatically.

edit: the first two points require reworking the base infrastructure to support which is why they havent hit widespread adoption. but the research showing that 3 bits is as good as 64 is intuitive once you tie the original inspiration for some of the AI designs. that reduction alone means you can get 21x reduction in model size is pretty solid.

[–] self@awful.systems 24 points 3 weeks ago (7 children)

both those are related to information theory, but there are other things I legally can’t mention. shrug.

hahahaha fuck off with this. no, the horseshit you’re fetishizing doesn’t fix LLMs. here’s what quantization gets you:

the LLM runs on shittier hardware
the LLM works worse too
that last one’s kinda bad when the technology already works like shit

anyway speaking of basic information theory:

but the research showing that 3 bits is as good as 64 is intuitive once you tie the original inspiration for some of the AI designs.

lol

[–] khalid_salad@awful.systems 7 points 3 weeks ago (1 children)

It's actually super easy to increase the accuracy of LLMs.

import pytorch # or ollama or however you fucking dorks use this nonsense
from decimal import Decimal

I left out all the other details because it's pretty intuitive why it works if you understand why floats have precision issues.

[–] froztbyte@awful.systems 5 points 3 weeks ago

decimal is a severely underappreciated library

load more comments (5 replies)