this post was submitted on 09 Jun 2024
5 points (72.7% liked)
Machine Learning
1793 readers
11 users here now
founded 4 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
You can generate synthetic data matching the distribution your transformer learned. You can use this dataset to train another model. As of now, that's about it.
Yep, this is called model distillation.