Codesota · Benchmark · MTEBHome/Leaderboards/MTEB
HuggingFace, cohere.ai, et al.

MTEB.

Massive Text Embedding Benchmark — 56+ tasks across 8 categories (retrieval, semantic textual similarity, classification, clustering, reranking, pair classification, summarization, bitext mining), covering 112 languages. The default index for evaluating text embedding models.

Paper Leaderboard
§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?
Use row edits to send a sourced correction into moderation.
Add / edit result Report issue

Mteb Score

Mteb Score is the reported evaluation metric for MTEB. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Mteb Scoreverifiedpapervendorcommunityunverified
RankModelTrustScoreYearLinksFix
01QZhou-Embeddingunverified75.972025Paper ↗Source ↗Looks wrong?
02Qwen3-Embedding-8Bunverified75.232025Paper ↗Code ↗Source ↗Looks wrong?
03Jasper-Token-Compression-600Munverified74.752025Paper ↗Code ↗Source ↗Looks wrong?
04Qwen3-Embedding-4Bunverified74.612025Paper ↗Code ↗Source ↗Looks wrong?
05LGAI-Embedding-Previewunverified74.122025Paper ↗Source ↗Looks wrong?
06F2LLM-4Bunverified73.672025Paper ↗Code ↗Source ↗Looks wrong?
07gemini-embedding-001unverified73.32025Paper ↗Source ↗Looks wrong?
08F2LLM-v2-14Bunverified73.082026Paper ↗Source ↗Looks wrong?
09F2LLM-v2-8Bunverified72.862026Paper ↗Source ↗Looks wrong?
10F2LLM-v2-4Bunverified72.412026Paper ↗Source ↗Looks wrong?
11F2LLM-1.7Bunverified72.012025Paper ↗Code ↗Source ↗Looks wrong?
12jina-embeddings-v5-omni-smallunverified71.782026Paper ↗Source ↗Looks wrong?
13jina-embeddings-v5-text-smallunverified71.782026Paper ↗Source ↗Looks wrong?
14F2LLM-v2-1.7Bunverified71.632026Paper ↗Source ↗Looks wrong?
15jasper_en_vision_language_v1unverified71.412024Paper ↗Code ↗Source ↗Looks wrong?
16KaLM-embedding-multilingual-mini-instruct-v2.5unverified71.292025Paper ↗Code ↗Source ↗Looks wrong?
17jina-embeddings-v5-omni-nanounverified71.112026Paper ↗Source ↗Looks wrong?
18jina-embeddings-v5-text-nanounverified71.112026Paper ↗Source ↗Looks wrong?
19GTE-Qwen2-7B-instructunverified70.722023Paper ↗Source ↗Looks wrong?
20Qwen3-Embedding-0.6Bunverified70.472025Paper ↗Code ↗Source ↗Looks wrong?
21ICT-TIME-and-Querit-embedding-v1unverified70.122026Paper ↗Source ↗Looks wrong?
22F2LLM-0.6Bunverified70.032025Paper ↗Code ↗Source ↗Looks wrong?
23F2LLM-v2-0.6Bunverified69.972026Paper ↗Source ↗Looks wrong?
24NV-Embed-v2unverified69.812024Paper ↗Source ↗Looks wrong?
25Linq-Embed-Mistralunverified69.82024Paper ↗Looks wrong?
26embeddinggemma-300munverified69.672025Paper ↗Source ↗Looks wrong?
27stella_en_1.5B_v5unverified69.432024Paper ↗Code ↗Source ↗Looks wrong?
28stella_en_400M_v5unverified69.392024Paper ↗Code ↗Source ↗Looks wrong?
29SFR-Embedding-Mistralunverified69.312024Paper ↗Looks wrong?
30F2LLM-v2-330Munverified68.862026Paper ↗Source ↗Looks wrong?
31NV-Embed-v1unverified68.322024Paper ↗Source ↗Looks wrong?
32E5-Mistral-7B-instructunverified67.972023Paper ↗Code ↗Source ↗Looks wrong?
33gte-Qwen2-1.5B-instructunverified67.22023Paper ↗Source ↗Looks wrong?
34GritLM-7Bunverified67.072024Paper ↗Code ↗Source ↗Looks wrong?
35UAE-Large-V1unverified66.42023Paper ↗Code ↗Source ↗Looks wrong?
36mxbai-embed-large-v1unverified66.262023Paper ↗Code ↗Source ↗Looks wrong?
37GIST-large-Embedding-v0unverified66.252024Paper ↗Code ↗Source ↗Looks wrong?
38GritLM-8x7Bunverified66.162024Paper ↗Code ↗Source ↗Looks wrong?

Avg Score

Avg Score is the reported evaluation metric for MTEB. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Avg Scoreverifiedpapervendorcommunityunverified
RankModelTrustScoreYearLinksFix
01NV-Embed-v2
NV-Embed-v2 ranked #1 on MTEB English (56 tasks) as of Aug 2024. Score from official HF model card and paper.
verified72.312024Paper ↗Looks wrong?
02GTE-Qwen2-7B-instruct
GTE-Qwen2-7B-instruct MTEB English average. Ranked #1 English, #1 Chinese as of Jun 2024.
verified72.052024Source ↗Looks wrong?
03voyage-3-large
voyage-3-large MTEB average. Voyage AI blog Jan 2025 reports top ranking; score from MTEB leaderboard snapshot.
verified70.322025Source ↗Looks wrong?
04E5-Mistral-7B-instruct
E5-Mistral-7B MTEB average (56 tasks) from the original paper Table 1.
verified66.632024Source ↗Looks wrong?
05jina-embeddings-v3
jina-embeddings-v3 MTEB English average from paper Table 3.
verified65.182024Paper ↗Looks wrong?
06text-embedding-3-large
OpenAI text-embedding-3-large MTEB score reported in launch blog and confirmed by multiple benchmark comparisons.
verified64.62024Source ↗Looks wrong?
§ 04 · Submit a result

Add to the leaderboard.

← Back to Leaderboards