Glossary term
Glossary term
Infrastructure and Serving
An operation in a TensorFlow graph.
Created for this library
An ML platform team profiles per-node compute time in a TensorFlow graph to find bottlenecks during inference.
A research engineer applies operator fusion across nodes in a TensorFlow graph to reduce serving latency for production models.
An ML platform team uses graph-level optimizations across nodes in TensorFlow to lower compute cost for production models.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License