this post was submitted on 16 Oct 2023

21 points (100.0% liked)

Free Open-Source Artificial Intelligence

3137 readers

19 users here now

Welcome to Free Open-Source Artificial Intelligence!

We are a community dedicated to forwarding the availability and access to:

Free Open Source Artificial Intelligence (F.O.S.A.I.)

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

GitHub Stars

FOSAI Time Capsule

founded 2 years ago

MODERATORS

21

Llama 2 / WizardLM Megathread (self.fosai)

submitted 1 year ago* (last edited 1 year ago) by Blaed to c/fosai

0 comments fedilink hide all child comments

Llama 2 & WizardLM Megathread

Starting another model megathread to aggregate resources for any newcomers.

It's been awhile since I've had a chance to chat with some of these models so let me know some your favorites in the comments below.

There are many to choose from - sharing your experience could help someone else decide which to download for their use-case.

Thread Models:

Llama 2 - MetaAI
WizardLM - WizardLM

Quantized Base Llama-2 Chat Models

Unquantized Models

`Llama-2-7b-Chat`

GPTQ

Llama-2-7b-Chat-GPTQ

GGUF

Llama-2-7b-Chat-GGUF

AWQ

Llama-2-7b-Chat-AWQ

`Llama-2-13B-chat`

GPTQ

Llama-2-13B-chat-GPTQ

GGUF

Llama-2-13B-chat-GGUF

AWQ

Llama-2-13B-chat-AWQ

`Llama-2-70B-chat`

GPTQ

Llama-2-70B-chat-GPTQ

GGUF

Llama-2-70B-chat-GGUF

AWQ

Llama-2-70B-chat-AWQ

Quantized WizardLM Models

Unquantized Models

`WizardLM-7B-V1.0+`

GPTQ

GGUF

AWQ

WizardLM-7B-V1.0-Uncensored-AWQ

`WizardLM-13B-V1.0+`

GPTQ

GGUF

AWQ

`WizardLM-30B-V1.0+`

GPTQ

GGUF

AWQ

Llama 2 Resources

LLaMA 2 is a large language model developed by Meta and is the successor to LLaMA 1. LLaMA 2 is available for free for research and commercial use through providers like AWS, Hugging Face, and others. LLaMA 2 pretrained models are trained on 2 trillion tokens, and have double the context length than LLaMA 1. Its fine-tuned models have been trained on over 1 million human annotations.

Llama 2 Benchmarks

Llama 2 shows strong improvements over prior LLMs across diverse NLP benchmarks, especially as model size increases: On well-rounded language tests like MMLU and AGIEval, Llama-2-70B scores 68.9% and 54.2% - far above MTP-7B, Falcon-7B, and even the 65B Llama 1 model.

Llama 2 Tutorials

Tutorials by James Briggs (also link above) are quick, hands-on ways for you to experiment with Llama 2 workflows. See also a poor man's guide to fine-tuning Llama 2. Check out Replicate if you want to host Llama 2 with an easy-to-use API.

Did I miss any models? What are some of your favorites? Which family/foundation/fine-tuning should we cover next?

no comments (yet)

sorted by: hot top controversial new old

there doesn't seem to be anything here