this post was submitted on 02 Jul 2023
79 points (94.4% liked)

Selfhosted

40351 readers
397 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 1 year ago
MODERATORS
top 12 comments
sorted by: hot top controversial new old
[–] [email protected] 15 points 1 year ago* (last edited 1 year ago) (2 children)

Ever time i see a post like this i ask the same thing and i have yet to receive answer.

Why should i care?

There are so many open source language models, all with different strengths and weaknesses. There are tools to run them on any OS with all kinds of different hardware requirements.

This has been the case since before chatgpt came out and has exponentially blown up since.

Gpt4all is just a single recent model. But in recent weeks it always gets the headlight under “run chatgpt at home”

What does it do to stand out? Why would i use this and not one of the vicuna or llama models?

Hugging face has a leaderboard for open source large language models.

https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard

If you are interested in running this tech at home, familiarize yourself with multiple models because they all will behave differently depending on your hardware and your needs.

[–] [email protected] 4 points 1 year ago

Its a collection of models you can download. It acts as a simple gui entry point into the llm world. Great to test different stuff.

[–] [email protected] 4 points 1 year ago (1 children)

Anybody know of a good guide for hosting this with GPU? Every guide seems to be talking about running it on CPU when one would expect the opposite to be true. I have not been able to use my RTX 3060 for this so far.

[–] eu8 1 points 1 year ago

Take my answer with a grain of salt, but I'm pretty sure if you have a GPU you can just run the same models and it should work more efficiently for you. The only difference for you is you can run some of the larger models.

[–] Protegee9850 2 points 1 year ago

It's a shame it only seems to be at the level of davinci-003 by now. I'm super interested in this, but that's just not good enough for most of the things I use GPT-3/4 for today...

[–] [email protected] 1 points 1 year ago

I tried that. GPT4all is a hog. You'll need at least 16GB of RAM.

[–] [email protected] -2 points 1 year ago (2 children)

I loved this however my only disappointment is that you can't use it as a server others can connect to and use the chat interface

[–] [email protected] 4 points 1 year ago* (last edited 1 year ago) (1 children)

Use this webui (its the stabld diffusion ui for llm)

https://github.com/oobabooga/text-generation-webui

I am pretty sure it has a sever option.

Here is a list of the models it likely supports, including gpt4all. https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard

Best one i tried is wizard vicuna 13B running on a rtx2070

[–] [email protected] 1 points 1 year ago

oh hey this is super useful, thanks! :D

[–] [email protected] 1 points 1 year ago

It does have an API server so you should be able to do just that. Haven't tried it though.

[–] [email protected] -3 points 1 year ago
load more comments
view more: next ›