Codesota · Models · openai/gpt-oss-120b (API)openai5 results · 1 benchmarks
Model card

openai/gpt-oss-120b (API).

openaiopen-source120B params
§ 01 · Benchmarks

Every benchmark openai/gpt-oss-120b (API) has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01CPTU-BenchNatural Language Processing · Polish Text Understandingtricky-questions3.9%#11/93source ↗
02CPTU-BenchNatural Language Processing · Polish Text Understandinglanguage-understanding4.0%#17/93source ↗
03CPTU-BenchNatural Language Processing · Polish Text Understandingaverage3.8%#20/93source ↗
04CPTU-BenchNatural Language Processing · Polish Text Understandingphraseology3.5%#28/93source ↗
05CPTU-BenchNatural Language Processing · Polish Text Understandingsentiment3.9%#32/93source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where openai/gpt-oss-120b (API) actually performs.

Natural Language Processing
1
benchmark
avg rank #21.6
§ 05 · Sources & freshness

Where these numbers come from.

SpeakLeash/CPTU-Bench
5
results
5 of 5 rows marked verified.