My Path
Pricing
About
Feedback
← All topics
Training
Gradient Descent & Optimizers
The algorithms that adjust billions of parameters to minimize loss, from SGD to AdamW
16 views
Mark as read