My Path
Pricing
About
Feedback
← All topics
Training
Knowledge Distillation
Training small models to mimic large ones, compressing capability into a deployable size
77 views
Mark as read