this post was submitted on 11 Jun 2023
6 points (100.0% liked)
Machine Learning
1788 readers
11 users here now
founded 4 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I second that. Being able to test medium sized models locally can make debugging much easier.
I have a 3070 with 8GB VRAM, which can train e.g. a GPT2 with a batch-size of 1 with full precision.