Target Network

In Deep Q-learning, a neural network that is a stable approximation of the main neural network, where the main neural network implements either a Q-function or a policy. Then, you can train the main network on the Q-values predicted by the target network. Therefore, you prevent the feedback loop that occurs when the main network trains on Q-values predicted by itself. By avoiding this feedback, training stability increases.

Real-world uses

Created for this library

1.
A research team uses a target network in its DQN training to stabilize learning over many updates.
2.
A robotics RL team uses a target network in its DQN-based manipulation policy to keep value updates stable.
3.
An energy company uses a target network in its DQN-based HVAC agent to prevent oscillations in value estimates during training.

Back to glossary

Target Network

Real-world uses

Related terms

Loading…

Target Network

Real-world uses

Related terms