this post was submitted on 17 Dec 2024
214 points (97.8% liked)

Technology

62816 readers
4986 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
all 50 comments
sorted by: hot top controversial new old
[–] billwashere 76 points 2 months ago (3 children)

Is 5090 the card model number or the price?

[–] JaddedFauceet 49 points 2 months ago

Uh it is both, for convenience, TLDR 5090

load more comments (2 replies)
[–] Kyrgizion 75 points 2 months ago (4 children)

Doesn't matter, I couldn't afford a 5090 anyway.

[–] [email protected] 59 points 2 months ago (2 children)

going by trend, I don't think my apartment could fit a 5090

[–] Drunemeton 21 points 2 months ago (1 children)

On the plus side you no longer need heating.

[–] mightyfoolish 5 points 2 months ago

Or you have to keep it in your other apartment to avoid heat stroke. If you can't afford another apartment, you probably can't afford this card.

[–] FireWire400 10 points 2 months ago

You'd need a separate apartment just to keep it from sagging...

[–] [email protected] 10 points 2 months ago* (last edited 2 months ago) (1 children)

Nvidia doesn't care. Their margins will be high enough, and there will be enough buyers that they'll be able to print money at whatever absurd price point they choose.

[–] [email protected] 8 points 2 months ago (3 children)

It's gonna be over $2,000. (Uneducated caveman ignorant guess)

[–] [email protected] 6 points 2 months ago

I'm seeing $1900, but guessing $2500+ once the scalpers get to scalping. I don't intend to buy one, but I hope those assholes get fucked over.

[–] Raxiel 4 points 2 months ago

If "Learning from history" counts as education, I wouldn't call it an uneducated guess.

[–] [email protected] 1 points 2 months ago

Close It will actually be ~$8000

[–] kn33 6 points 2 months ago

I got my 3080 and I plan on keeping using it for a good while still.

[–] Psythik 4 points 2 months ago* (last edited 2 months ago)

I got lucky and scored a 4090 at a discount on launch day. I think I'll just hang onto it for a few more generations. It still has plenty of power to run everything at 4K 120Hz especially with DLSS Balanced, (except Indiana Jones; that game is extremely demanding), which I use anyway cause IMO it looks better than AA.

[–] FireWire400 37 points 2 months ago (1 children)

Oh no, I'm sure no one could've guessed the names...

[–] Vash63 33 points 2 months ago (1 children)

32GB is news. It confirms either a 512 or (more likely) 256 bit bus, which would be a significant drop from the 384 bit on the 4090.

I'm sure the increased perf of G7 would fully offset that, but this means without some larger caches it will be difficult for this to be a massive performance jump from last gen.

Unless they're going 512 bit, in which case ignore all that and wow is this a monster.

[–] [email protected] 17 points 2 months ago (2 children)

More likely a 256 bit bus? Not with the monster of a card the 5090 would be. 256 bit on that massive of a card would be ridiculous. The card is set to draw 600 watts.

[–] [email protected] 15 points 2 months ago

For the record, in case anyone cared, I audibly laughed when I read 600 watts.

[–] [email protected] 8 points 2 months ago

sad PSU noises

[–] Ragdoll_X 37 points 2 months ago (1 children)

Knowing Nvidia's exorbitant pricing, I think I'll keep Intel's Arc B580 in my wishlist.

load more comments (1 replies)
[–] [email protected] 25 points 2 months ago (2 children)

Allegedly, the 5090 would have 32GB and the 5080 16GB, I don't see much room for the 5060 to have more than 8GB if the 5070 itself has 12GB?

I would have loved to see the 5080 with 24GB, the 5070 with 16GB and the 5060 with 12GB (at least). And for the 5060 to drop the 128 bit bus...

[–] KoalaUnknown 14 points 2 months ago (1 children)

I would have loved to see the 5080 with 24GB

They wouldn't do this because it would undercut their 5090 sales.

[–] CosmoNova 3 points 2 months ago (2 children)

Sales and probably pricing itself. With a gap this large, the price of the 5090 might be more than twice as high as that of the 5080 and people would still buy it.

[–] [email protected] 3 points 2 months ago

If the value proposition continues, and based on the leak it seems like it will, the 5090 would be a better value FPS per dollar than the 5080 and anything below it. The 4090 cost like 40% more than the 4080, but gave 60% better performance. The 5090 looks like it will be well over twice the performance of the 5080, so I'd expect twice the price.

[–] [email protected] 1 points 1 month ago

Great foreshadowing

[–] Raxiel 4 points 2 months ago

A 5060 with 8gb will have Intel wringing their hands with glee. Whatever performance gains the ($300-350) 5060 GPU has (and I don't doubt they'll be notable) will be choked off by an 8gb framebuffer. They might do a 16gb clamshell like they did with the 4060ti but like you said, where does that leave the 12gb 70 card?

[–] [email protected] 12 points 2 months ago (4 children)

I just want one to self host a 70B LLM model for fuck's sake. I don't want to be forced to take out a god damned mortgage/personal loan to buy one.

[–] Speculater 8 points 2 months ago

Damn, they require like 50GB vram, that's nuts.

[–] [email protected] 6 points 2 months ago (1 children)

I picked up a pair of old Tesla P40s. Right now I'm running a Q4 quant of Qwen 2.5 72B that fits in the combined 48GB of VRAM with 12k context. They aren't as fast as newer consumer cards, but it generates as fast as I can read while costing less than a used 3080.

[–] BatrickPateman 2 points 2 months ago (1 children)

interesting. They are cooled passively, right? What's your case and cooling setup?

[–] [email protected] 2 points 2 months ago

I have a dell power edge 730, which was about $200. It's CPU shrouds perfectly match the GPU intakes so air just flows through both from the server fans. I've seen a few 3d printable fan mounts for jury rigging them into a regular tower too.

[–] brucethemoose 4 points 2 months ago* (last edited 2 months ago) (1 children)

Qwen 2.5 32B is where it's at now. 24GB is affordable, and it fits perfectly.

Otherwise, stay on the lookout for AMD Strix Halo, which can reportedly allocate up to 96GB on its IGP, and you can run faster backends like vllm or exllama.

[–] [email protected] 1 points 2 months ago (1 children)

What's up with Qwen that makes it better than anything else?

[–] brucethemoose 4 points 2 months ago* (last edited 2 months ago)

It's just smarter with the same number of parameters. Try Qwen QwQ or Qwen coder 32B, see for yourself... it stacks up well against huge models like the 123B Mistral Large, or even GPT-4.

Why? Alibaba trained it well, presumably with better data than OpenAI or whomever else, though specifics are up for debate. Some suggests that bilingual training on English/Chinese (aka the two largest text corpuses in existance) significantly helps the model over mostly english. Some say the government just gave them better data. There's also suggestions that having so few GPUs compared to American AI companies made the Chinese "thrifty," and gave them far more incentive to be innovative rather than brute forcing models (which has diminishing returns).

[–] Ragdoll_X 1 points 2 months ago (1 children)

You might just want to use Kaggle tbh

[–] [email protected] 1 points 2 months ago

Never heard of it.