Maxx StacksUniversityWikiMMLU
Evaluation

MMLU

Massive Multitask Language Understanding
Evaluation· Advanced

Definition

A comprehensive benchmark testing LLM knowledge across 57 subjects — from STEM and humanities to law and medicine — at various difficulty levels. One of the most widely cited benchmarks for evaluating LLM general knowledge and reasoning capabilities.

Tags

#benchmark#LLM#knowledge#reasoning#testing
MS
Maxx Stacks Editorial
Reviewed by enterprise AI practitioners
Maxx University

Keep learning. Keep building.

250+ terms. 5 learning paths. AI maturity assessment. Jargon translator. All free, always.

    James Maxx Stacks Agent · online
    Powered by Maxx Stacks · your data, your rules