Agentic AI Library

Curated Open Source Library

Start Here Library Glossary About the CreatorRoadmap Provide Feedback

Glossary term

Loading…

Home/Glossary/Optimizer

Training and Fine-Tuning

Optimizer

A specific implementation of the gradient descent algorithm. Popular optimizers include:

AdaGrad, which stands for ADAptive GRADient descent.

Adam, which stands for ADAptive with Momentum.

Real-world uses

Created for this library

1.
An ML team uses Adam as the default optimizer for production training pipelines because it is robust across hyperparameter ranges.
2.
A research team experiments with Adafactor as the optimizer for large language model training to save optimizer-state memory.
3.
An ML platform team standardizes optimizer choice per model family so engineers can focus on data and features.

Related terms

AdaGrad Gradient Descent

Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License

Back to glossary

Agentic AI LibraryOpen Source · Last Reviewed 2026-06-07

Library About the Creator Roadmap PrivacyProvide Feedback LinkedIn Author Portfolio

All Rights Reserved @2026 Georgi Naydenov