Learn Machine Learning

530 readers

1 users here now

Welcome! This is a place for people to learn more about machine learning techniques, discuss applications and ask questions.

Example questions:

"Should I use a deep neural network for my audio classification task?"
"I'm working with a small dataset, what can I do to make my model generalize well?"
"Is there a library available that implements function X in language Y?"
"I want to learn more about the math behind machine learning technique A, where should I start?"

Please do:

Be kind to new people
Post guides and tutorials that you find helpful
Link to open/free sources instead of paywalled when possible

Please don't:

Post news articles / memes (there are other machine learning/AI communities for this)

Other communities in this area:

Similar subreddits: r/MLquestions, r/askmachinelearning, r/learnmachinelearning

founded 2 years ago

MODERATORS

[email protected]

OpenChat_8192 - The first model to beat 100% of ChatGPT-3.5 (lemmy.intai.tech)

submitted 2 years ago by [email protected] to c/[email protected]

6 comments fedilink hide all child comments

cross-posted from: https://lemmy.intai.tech/post/40699

Models

opnechat

openchat_8192

opencoderplus

Datasets

openchat_sharegpt4_dataset

Repos

openchat

Related Papers

LIMA Less is More For Alignment

ORCA

Credit:

Tweet

Archive:

@Yampeleg The first model to beat 100% of ChatGPT-3.5 Available on Huggingface

🔥 OpenChat_8192

🔥 105.7% of ChatGPT (Vicuna GPT-4 Benchmark)

Less than a month ago the world witnessed as ORCA [1] became the first model to ever outpace ChatGPT on Vicuna's benchmark.

Today, the race to replicate these results open-source comes to an end.

Minutes ago OpenChat scored 105.7% of ChatGPT.

But wait! There is more!

Not only OpenChat beated Vicuna's benchmark, it did so pulling off a LIMA [2] move!

Training was done using 6K GPT-4 conversations out of the ~90K ShareGPT conversations.

The model comes in three versions: the basic OpenChat model, OpenChat-8192 and OpenCoderPlus (Code generation: 102.5% ChatGPT)

This is a significant achievement considering that it's the first (released) open-source model to surpass the Vicuna benchmark. 🎉🎉

OpenChat: https://huggingface.co/openchat/openchat

OpenChat_8192: https://huggingface.co/openchat/openchat_8192 (best chat)

OpenCoderPlus: https://huggingface.co/openchat/opencoderplus (best coder)

Dataset: https://huggingface.co/datasets/openchat/openchat_sharegpt4_dataset

Code: https://github.com/imoneoi/openchat

Congratulations to the authors!!

[1] - Orca: The first model to cross 100% of ChatGPT: https://arxiv.org/pdf/2306.02707.pdf [2] - LIMA: Less Is More for Alignment - TL;DR: Using small number of VERY high quality samples (1000 in the paper) can be as powerful as much larger datasets: https://arxiv.org/pdf/2305.11206

no comments (yet)

sorted by: hot top controversial new old

there doesn't seem to be anything here