Technology

61524 readers

4763 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

Sam Altman says ChatGPT should be 'much less lazy now' (www.businessinsider.com)

submitted 1 year ago by L4s to c/technology

16 comments fedilink hide all child comments

Sam Altman says ChatGPT should be 'much less lazy now'::ChatGPT users previously complained that the chatbot was slacking off and refusing to complete some tasks.

you are viewing a single comment's thread
view the rest of the comments

[–] akrot 2 points 1 year ago (1 children)

ROCm? Is that even supported now? Last time I checked it was still a dumpster fire. What are the RAM and VRAM reqs for the Mixtral8x7b?

[–] AlmightySnoo 0 points 1 year ago* (last edited 1 year ago) (1 children)

ROCm is decent right now, I can do deep learning stuff and CUDA programming with it with an AMD APU. However, ollama doesn't work out-of-the-box yet with APUs, but users seem to say that it works with dedicated AMD GPUs.

As for Mixtral8x7b, ~~I couldn't run it on a system with 32GB of RAM and an RTX 2070S with 8GB of VRAM, I'll probably try with another system soon~~ [EDIT: I actually got the default version (mixtral:instruct) running with 32GB of RAM and 8GB of VRAM (RTX 2070S).] That same system also runs CodeLlama-34B fine.

So far I'm happy with Mistral 7b, it's extremely fast on my RTX 2070S, and it's not really slow when running in CPU-mode on an AMD Ryzen 7. Its speed is okayish (~1 token/sec) when I try it in CPU-mode on an old Thinkpad T480 with an 8th gen i5 CPU.

[–] akrot 2 points 1 year ago

I have a ryzen apu, so I was curious. I tried yesterday to fiddle with it, and managed to up the "vram" to 16gb. But installing xformers and flash-attention for LLM support on igpus is not officially supported and was not possible to install anything past pytorch. It's step further for sure, but still needs lots of work.