Replica

A copy (or part of) of a training set or model, typically stored on another machine. For example, a system could use the following strategy for implementing data parallelism:

Place replicas of an existing model on multiple machines.

Send different subsets of the training set to each replica.

Aggregate the parameter updates.

A replica can also refer to another copy of an inference server. Increasing the number of replicas increases the number of requests that the system can serve simultaneously but also increases serving costs.

Real-world uses

Created for this library

1.
An ML platform team uses multiple model replicas behind a load balancer to scale inference throughput for a high-traffic feature.
2.
A research team trains with synchronous replicas across nodes so gradients are averaged before each update.
3.
An ML platform team provisions extra replicas during peak hours so production inference latency stays within SLA.

Back to glossary

Replica

Real-world uses

Related terms

Loading…

Replica

Real-world uses

Related terms