Test Set

A subset of the dataset reserved for testing a trained model.

Traditionally, you divide examples in the dataset into the following three distinct subsets:

a test set

Each example in a dataset should belong to only one of the preceding subsets. For instance, a single example shouldn't belong to both the training set and the test set.

The training set and validation set are both closely tied to training a model. Because the test set is only indirectly associated with training, test loss is a less biased, higher quality metric than training loss or validation loss.

See Datasets: Dividing the original dataset in Machine Learning Crash Course for more information.

Real-world uses

Created for this library

1.
An ML platform team requires a frozen test set per model release so model quality is comparable across versions.
2.
A medical AI team curates a test set with edge cases that clinicians find most informative for safety review.
3.
A research team holds out a clean test set per benchmark so model performance numbers stay comparable across runs.

Back to glossary

A subset of the dataset reserved for testing a trained model.

Traditionally, you divide examples in the dataset into the following three distinct subsets:

a training set

a validation set

a test set

Each example in a dataset should belong to only one of the preceding subsets. For instance, a single example shouldn't belong to both the training set and the test set.

See Datasets: Dividing the original dataset in Machine Learning Crash Course for more information.

Real-world uses

Created for this library

1.
An ML platform team requires a frozen test set per model release so model quality is comparable across versions.
2.
A medical AI team curates a test set with edge cases that clinicians find most informative for safety review.
3.
A research team holds out a clean test set per benchmark so model performance numbers stay comparable across runs.

Back to glossary

Test Set

Real-world uses

Related terms

Loading…

Test Set

Real-world uses

Related terms