AI Dictionary
Test Set
Definition
A set of data used to provide an unbiased evaluation of a final model fit on the training dataset.
Deep Dive
In machine learning, a test set is a distinct subset of a larger dataset, used solely to provide an unbiased evaluation of a model's final performance after it has been fully trained and optimized. Its primary purpose is to assess the model's generalization capability – how well it performs on new, unseen data, simulating real-world scenarios. Crucially, the test set is never used during the model's training phase or for hyperparameter tuning, preventing data leakage and ensuring an honest and objective measure of the model's efficacy.
Examples & Use Cases
- 1Evaluating an image classification model's accuracy on 10,000 never-before-seen images of cats and dogs
- 2Assessing a fraud detection algorithm's precision and recall on a month's worth of new transaction data
- 3Measuring a natural language processing model's ability to correctly classify sentiment in new customer reviews
Related Terms
Training DataValidation SetOverfitting