Glossary term
Glossary term
Infrastructure and Serving
A TensorFlow programming environment in which the program first constructs a graph and then executes all or part of that graph. Graph execution is the default execution mode in TensorFlow 1.x.
Contrast with eager execution.
Created for this library
An ML platform team uses graph execution for production model serving because compiled graphs are faster than eager calls.
A speech recognition team converts trained models to a graph execution form before deployment to maximize throughput on accelerator chips.
An ML platform team profiles models in graph execution mode to apply operator fusions that lower latency in production.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License