SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence

arXiv:2606.02380Submitted Jun 2, 20260 benchmark results

Yuyan Bu, Haowei Li, Qirui Zheng, Bowen Dong, Kaiyue Yang, Jiaming Ji, Yingshui Tan, Wenxin Li, Yaodong Yang, Juntao Dai

View PDF ↗arXiv page ↗Edit

Abstract

First benchmark to isolate agent deception (plan-action divergence under pressure) from hallucination; reveals that deception is a genuine and pressing issue in tool-use contexts.

Tasks

edit

• Agentic AI

Results

No benchmark results recorded yet.

submit

Benchmark results referencing this paper haven't been added to the registry yet. If you have a reproduction, submit it →

CodeSOTA extraction

Benchmark evidence

edit

SPADE-Bench: Leakage rate and H-score across models (extract from main results).

Add or update benchmark results

Logged-in editor · benchmark trail

Three places to go from here.

Index

All papers

All tracked papers in the registry, with benchmark result, model, and leaderboard linkage where available.

Replacement

Papers with Code is dead — alternatives

What replaced PWC for each use case: LLMs, OCR, speech, vision, robotics.

Top hub

Agentic AI

Every benchmark in Agentic AI.