Online Inference

Generating predictions on demand. For example, suppose an app passes input to a model and issues a request for a prediction. A system using online inference responds to the request by running the model (and returning the prediction to the app).

Contrast with offline inference.

See Production ML systems: Static versus dynamic inference in Machine Learning Crash Course for more information.

Real-world uses

Created for this library

1.
A SaaS company runs online inference for its in-product assistant so every keystroke or click receives a fresh prediction.
2.
A fraud team runs online inference at authorization time so transactions are scored within milliseconds.
3.
A search team runs online inference per query so each result list reflects the latest user context and freshest model.

Back to glossary

Online Inference

Real-world uses

Related terms

Loading…

Online Inference

Real-world uses

Related terms