AI Dictionary
Data Augmentation
Definition
Techniques used to increase the amount of data by adding slightly modified copies of already existing data.
Deep Dive
Data Augmentation refers to a suite of techniques used to artificially increase the amount of data by creating slightly modified copies of already existing data points. This strategy is particularly vital in machine learning, especially when working with limited datasets, as it helps to prevent overfitting and improve the generalization capability of models. By introducing variations into the training data, augmentation encourages the model to learn more robust features and become less sensitive to minor perturbations, ultimately leading to better performance on unseen data.
Examples & Use Cases
- 1Flipping and rotating images of cats and dogs to create new training examples for an image classifier
- 2Adding random background noise to speech audio samples to make a voice recognition model more robust
- 3Replacing words with their synonyms or paraphrasing sentences in text data to enhance a sentiment analysis model
Related Terms
OverfittingMachine LearningDataset