Offline Inference

The process of a model generating a batch of predictions and then caching (saving) those predictions. Apps can then access the inferred prediction from the cache rather than rerunning the model.

For example, consider a model that generates local weather forecasts (predictions) once every four hours. After each model run, the system caches all the local weather forecasts. Weather apps retrieve the forecasts from the cache.

Offline inference is also called static inference.

Contrast with online inference. See Production ML systems: Static versus dynamic inference in Machine Learning Crash Course for more information.

Examples

1.
For example, consider a model that generates local weather forecasts (predictions) once every four hours. After each model run, the system caches all the local weather forecasts. Weather apps retrieve the forecasts from the cache.
2.
Offline inference is also called static inference.
3.
Contrast with online inference. See Production ML systems: Static versus dynamic inference in Machine Learning Crash Course for more information.

Real-world uses

Created for this library

1.
A retail recommendation team runs offline inference nightly to precompute the next-day homepage carousel for every active user.
2.
An insurance company runs offline inference each quarter to re-score the entire book of business with the latest risk model.
3.
A subscription business runs offline inference weekly to refresh churn-risk scores feeding the retention call list.

Back to glossary

The process of a model generating a batch of predictions and then caching (saving) those predictions. Apps can then access the inferred prediction from the cache rather than rerunning the model.

Offline inference is also called static inference.

Contrast with online inference. See Production ML systems: Static versus dynamic inference in Machine Learning Crash Course for more information.

Examples

1.
For example, consider a model that generates local weather forecasts (predictions) once every four hours. After each model run, the system caches all the local weather forecasts. Weather apps retrieve the forecasts from the cache.
2.
Offline inference is also called static inference.
3.
Contrast with online inference. See Production ML systems: Static versus dynamic inference in Machine Learning Crash Course for more information.

Real-world uses

Created for this library

1.
A retail recommendation team runs offline inference nightly to precompute the next-day homepage carousel for every active user.
2.
An insurance company runs offline inference each quarter to re-score the entire book of business with the latest risk model.
3.
A subscription business runs offline inference weekly to refresh churn-risk scores feeding the retention call list.

Back to glossary

Offline Inference

Examples

Real-world uses

Related terms

Loading…

Offline Inference

Examples

Real-world uses

Related terms