SniffDoctor

joined 7 months ago
[–] SniffDoctor@lemmy.ml 1 points 6 months ago* (last edited 6 months ago)

Yes. The Llama 70B derived models, as well as Mixtral 8x7B and the new Mistral Medium 70B are competitive with ChatGPT 3.5. Most of them can do 16,000 token context similar to ChatGPT as well.

You only NEED 40GB of free RAM to run them at decent quality, but it's slow.

With a 24GB GPU like a 3090 or 4090 you can run them at a reasonable speed with partial GPU offload. About 1-2 words per second. I run 70Bs in this manner on my computer.

With two 24GB GPUs you can run them very fast, like ChatGPT.


There's of course a whole world in between as well, but those are the rough hardware requirements to match ChatGPT in a self-hosted sort of way. There's also a new thing people are doing where they add layers from one model onto another one, like a merge but keeping >50% of the original layers from each model. "Goliath 120B" and the like. They're even better but it's a bit beyond reasonable consumer hardware now.

[–] SniffDoctor@lemmy.ml 15 points 7 months ago (5 children)

I hope to one day have enough free time to want to watch someone play video games unedited for hours at a time, let alone be willing to pay money for the privilege.

[–] SniffDoctor@lemmy.ml 1 points 7 months ago (4 children)

If this sort of thing is light enough to run on a Raspberry Pi or old laptop, is there anything wrong with just running it in the background on a modern(ish) computer? My desktop is a 5900x with 64GB of RAM. I already run Jellyfin, Sunshine, and a few other things in the background but kind of want to add a web server and mess around with some other stuff.

My main concern would be security I suppose if I'm hosting a web server on the same computer I store all my family backups and stuff. Would using virtual machines solve that?

[–] SniffDoctor@lemmy.ml 9 points 7 months ago

I'm not a secret agent or criminal, I just want an alternative to Google/Microsoft's suite that doesn't blatantly harvest all my information and try to sell me things based on it.