Glossary term
Glossary term
Infrastructure and Serving
The algorithm by which variables are divided across parameter servers.
Created for this library
A research engineer chooses a partitioning strategy that shards attention layers across devices while keeping embeddings replicated.
An ML platform team configures a partitioning strategy across a 2D device mesh to combine model and data parallelism for very large training jobs.
An ML platform team encodes its standard partitioning strategy as reusable configuration so research teams reuse a known-good setup.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License