AI Dictionary

Test Set

Definition

A set of data used to provide an unbiased evaluation of a final model fit on the training dataset.

Deep Dive

In machine learning, a test set is a distinct subset of a larger dataset, used solely to provide an unbiased evaluation of a model's final performance after it has been fully trained and optimized. Its primary purpose is to assess the model's generalization capability – how well it performs on new, unseen data, simulating real-world scenarios. Crucially, the test set is never used during the model's training phase or for hyperparameter tuning, preventing data leakage and ensuring an honest and objective measure of the model's efficacy.

Examples & Use Cases

1Evaluating an image classification model's accuracy on 10,000 never-before-seen images of cats and dogs
2Assessing a fraud detection algorithm's precision and recall on a month's worth of new transaction data
3Measuring a natural language processing model's ability to correctly classify sentiment in new customer reviews

Related Terms

Training DataValidation SetOverfitting