AI Dictionary
Labeling
Definition
Identifying raw data (images, text files, videos, etc.) and adding one or more meaningful and informative labels.
Deep Dive
Labeling, in the context of machine learning, is the fundamental process of identifying raw data (such as images, text files, audio recordings, or videos) and adding one or more meaningful, informative tags or annotations to it. This process creates the "ground truth" that machine learning models, particularly those based on supervised learning, use to learn patterns and make accurate predictions. Without properly labeled data, many AI systems would be unable to learn to perform their intended tasks.
Examples & Use Cases
- 1Drawing bounding boxes around objects (e.g., cars, pedestrians) in images for autonomous driving datasets
- 2Transcribing spoken words in audio files into text for speech recognition models
- 3Categorizing customer support emails by issue type (e.g., "billing," "technical support") for a natural language processing model
Related Terms
Data AnnotationSupervised LearningGround Truth