Glossary term
Glossary term
Agentic Systems
A family of algorithms that learn an optimal policy, whose goal is to maximize return when interacting with an environment. For example, the ultimate reward of most games is victory. Reinforcement learning systems can become expert at playing complex games by evaluating sequences of previous game moves that ultimately led to wins and sequences that ultimately led to losses.
Created for this library
A logistics startup uses reinforcement learning to train dispatching policies that outperform hand-tuned heuristics in simulation.
A trading research team uses reinforcement learning to learn execution strategies that minimize slippage on large orders.
An energy company uses reinforcement learning to optimize HVAC control across office buildings while respecting comfort constraints.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License