Data & Datasets
Benchmark Dataset
Data & Datasets· Intermediate
Definition
A standardized dataset used to evaluate and compare AI model performance across the research community. Examples: ImageNet (computer vision), GLUE/SuperGLUE (NLP), MMLU (general knowledge), HumanEval (code). Benchmark performance is a primary signal of model capability.
Tags
#evaluation#comparison#research#MMLU#ImageNet
MS
Maxx Stacks Editorial
Reviewed by enterprise AI practitioners
Maxx University
Keep learning. Keep building.
250+ terms. 5 learning paths. AI maturity assessment. Jargon translator. All free, always.