Glossary term
Glossary term
Agentic Systems
In reinforcement learning, a DQN technique used to reduce temporal correlations in training data. The agent stores state transitions in a replay buffer, and then samples transitions from the replay buffer to create training data.
Created for this library
A robotics RL team uses experience replay to break correlations in training data when training a manipulation policy on a small physical fleet.
An ad-bidding RL team uses experience replay to reuse logged interactions efficiently during off-policy training.
A trading research team uses experience replay to reuse historical market trajectories when training its execution policy.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License