Q-Learning

In reinforcement learning, an algorithm that allows an agent to learn the optimal Q-function of a Markov decision process by applying the Bellman equation. The Markov decision process models an environment.

Real-world uses

Created for this library

1.
A logistics RL team uses Q-learning to train a dispatching policy that outperforms a hand-tuned heuristic in simulation.
2.
A trading research team uses Q-learning to learn execution strategies in a market simulator before live trials.
3.
An ad-bidding team uses Q-learning to learn the expected long-run revenue of each bid amount in real-time auctions.

Back to glossary

Q-Learning

Real-world uses

Related terms

Loading…

Q-Learning

Real-world uses

Related terms