Evaluation
Pass@k
Evaluation· Advanced
Definition
A code generation evaluation metric that measures the probability that at least one of k generated solutions to a programming problem passes all test cases. Pass@1 measures one-shot accuracy; Pass@10 or Pass@100 measures the model's success when given multiple attempts.
Enterprise Context
Standard metric for evaluating code generation models in enterprise contexts — relevant for AI-assisted software development, automated testing, and code review tools.
Tags
#code#evaluation#benchmark
MS
Maxx Stacks Editorial
Reviewed by enterprise AI practitioners
Maxx University
Keep learning. Keep building.
250+ terms. 5 learning paths. AI maturity assessment. Jargon translator. All free, always.