this post was submitted on 15 Dec 2023
10 points (91.7% liked)
Free Open-Source Artificial Intelligence
2896 readers
1 users here now
Welcome to Free Open-Source Artificial Intelligence!
We are a community dedicated to forwarding the availability and access to:
Free Open Source Artificial Intelligence (F.O.S.A.I.)
More AI Communities
LLM Leaderboards
Developer Resources
GitHub Projects
FOSAI Time Capsule
- The Internet is Healing
- General Resources
- FOSAI Welcome Message
- FOSAI Crash Course
- FOSAI Nexus Resource Hub
- FOSAI LLM Guide
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Hehe. I've recently spent $5 on OpenRouter and tried a few models from 7B to 70b and even one hundred and something billion parameters. They definitely get more intelligent. But I have determined that I'm okay within the 7B to 33B range. At least for my use-case. I've tested creative storywriting and dialogue in a near-future setting where AI and androids permeate human society. I wasn't that impressed. The larger models still did some of the same mistakes and struggled with spacial positions of the characters and the random pacing of the plot points didn't really get better.
This wasn't a scientific test whatsoever, I just took random available models, some were fine-tuned for similar purposes, some not and I just clicked my way through the list. So your mileage may vary here. Perhaps they're much better with factual knowledge or reasoning. I read a few comments from people who like for example chatting with the Llama(2) base model at 65b/70b parameters and say this is way better than the 13b fine-tunes I usually use.
And I also wasn't that impressed with OpenRouter. It makes it easy and has some 'magic' to add the correct prompt formatting with all the different instruct formats. But I still had it entangle itself in repetition loops or play stupid until I went ahead and disabled the automatic settings. And once again tried to find the optimal prompt format and settings.
So I'm back to KoboldCpp. I'm familiar with it's UI and all the settings. I think the CUDA toolkit within the Debian Linux repository is somewhat alright. I've deleted it because it takes up too much space and my old GPU with 2GB of VRAM is useless anyways. We cerainly all had our 'fun' with the proprietary NVidia stuff.