Glossary term
Glossary term
Infrastructure and Serving
A TensorFlow object responsible for saving model checkpoints.
Created for this library
An ML platform team uses a checkpoint Saver to write model weights every 1,000 steps so training can resume from the latest checkpoint after a failure.
A research team uses a Saver to keep the top three checkpoints by validation metric for later promotion.
An ML engineer uses a Saver to archive model checkpoints across versions for backward-compatibility testing.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License