Mini-Batch Stochastic Gradient Descent

A gradient descent algorithm that uses mini-batches. In other words, mini-batch stochastic gradient descent estimates the gradient based on a small subset of the training data. Regular stochastic gradient descent uses a mini-batch of size 1.

Real-world uses

Created for this library

1.
An ML team uses mini-batch SGD with momentum as the default optimizer for production training pipelines.
2.
A research team uses mini-batch SGD with a cosine learning rate schedule as the baseline optimizer for ResNet-style training.
3.
An ML platform team standardizes on tuned mini-batch SGD recipes per model family so engineers can focus on data and features.

Back to glossary

Mini-Batch Stochastic Gradient Descent

Real-world uses

Related terms

Loading…

Mini-Batch Stochastic Gradient Descent

Real-world uses

Related terms