Pipelining

A form of model parallelism in which a model's processing is divided into consecutive stages and each stage is executed on a different device. While a stage is processing one batch, the preceding stage can work on the next batch.

Real-world uses

Created for this library

1.
An ML platform team uses pipelining across stages of training so different micro-batches overlap on different parts of the model.
2.
A research lab uses pipelining when training very large language models so different layers process different micro-batches simultaneously.
3.
An ML engineer uses pipelining to keep accelerators busy while data loading and preprocessing happen on the host.

Back to glossary

Pipelining

Real-world uses

Related terms

Loading…

Pipelining

Real-world uses

Related terms