Codesota · Papers2026-02-19
Paper

Arcee Trinity Large Technical Report

arXiv ↗Code ↗
§ 01 · Benchmark results

7 results reproduced from this paper.

View:
Sorted instantly in-page
Results
7
SOTA rows
0
Models
2
Datasets
0
#ModelVendorBenchmarkValueSOTADateSource
01Trinity Large Base (5-shot)HellaSwag90.1%source ↗
02Trinity Large PreviewArcee AIMMLU87.2%source ↗
03Trinity Large Base (5-shot)WinoGrande80.8%source ↗
04Trinity Large PreviewArcee AIMMLU-Pro75.3%source ↗
05Trinity Large PreviewArcee AIGPQA Diamond63.3%source ↗
06Trinity Large PreviewArcee AIAIME 202524.4%source ↗
07Trinity Large PreviewArcee AISimpleQA23.9%source ↗
Benchmark trail
§ 02 · Models

2 models from this paper.

evaluates
Trinity Large Preview
Arcee AI
evaluates
Trinity Large Base (5-shot)
Read next

Three places to go from here.

Index
All papers
All tracked papers in the registry, with benchmark result, model, and leaderboard linkage where available.
Replacement
Papers with Code is dead — alternatives
What replaced PWC for each use case: LLMs, OCR, speech, vision, robotics.
Top hub
LLM benchmarks
Every frontier LLM benchmark, scored.