this post was submitted on 21 Jan 2024
286 points (96.7% liked)
Technology
60112 readers
2133 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
My RTX 4060 has 16GB of RAM. What on earth makes them think people would go for 12GB?
Not being a power of 2 gives me displeasure.
It is in base 6.
And base 3 and 12. But we don’t really use those numbering systems.
I have a 2060 super with 8GB. The VRAM is enough currently for FHD gaming - or at least isn't the bottle neck, so 12 GB might be fine with this use case BUT I'm also toying around with AI models and some of the current models already ask for 12 GB VRAM to run the complete model. It's not, that I would never get a 12 GB card as an upgrade, but you'd be sure, that I'd do some research for all alternatives and then it wouldn't be my first choice but a compromise, as it wouldn't future proof me in this regard.
Do you think there is a large overlap of people who buy $600-$900 cards and like 1080p?
My 3080 10GB runs out of VRAM personally at 1440p. I would never get <16GB again.
I have a 4090 and I feel the pinch on vram with ai. It's never enough.
Thanks, that was going to be exactly my question. I don’t see anyone choosing low memory for video but had no idea what ai needs
You can run Stable Diffusion XL on 8GB of VRAM (to generate images). For beginners, there's e.g. the open source software Fooocus, which handles quite a lot of work for you - it sends your prompt to a GPT-2 model (running on your PC) to do some prompt engineering for you and then uses that to generate your images and generally features several presets, etc. to get going easily.
Jan (basically an open source software that resembles ChatGPT and allows you to use several AI models) can run in 8GB, but only for 3B models or quantized 7B models. They recommend at least 16GB for regular 7B models (which they consider "minimum usable models"). Then there are larger, more sophisticated models, that require even more.
Jan can run on CPU in your regular RAM. Since it's chatting with you, it's not too bad, when it spits out words slowly, but GPU is / would be nice here...
Thanks
I’ve seen people say that card is absurd. I’m not sure who is right there.