Evaluation
MMLU
Massive Multitask Language Understanding
Evaluation· Advanced
Definition
A comprehensive benchmark testing LLM knowledge across 57 subjects — from STEM and humanities to law and medicine — at various difficulty levels. One of the most widely cited benchmarks for evaluating LLM general knowledge and reasoning capabilities.
Tags
#benchmark#LLM#knowledge#reasoning#testing
MS
Maxx Stacks Editorial
Reviewed by enterprise AI practitioners
Maxx University
Keep learning. Keep building.
250+ terms. 5 learning paths. AI maturity assessment. Jargon translator. All free, always.