Cliff Tokens: Identifying Single-Token Failure Triggers in LLM Mathematical Reasoning

본문 미리보기

arXiv:2606.25524v1 Announce Type: new Abstract: Large language models (LLMs) reach high accuracy in mathematical reasoning, but individual traces on the same problem diverge; some arrive at the correct answer while others fail. Prior work analyzes failure at the step, chunk, or sentence level, or at tokens where failure has already occurred. Neither identifies the precise token that triggers the shift toward failure. We introduce the cliff token, a token where the token-wise potential drops sig

Cliff Tokens: Identifying Single-Token Failure Triggers in LLM Mathematical Reasoning

본문 미리보기

관련 글

The Hitchhiker's Guide to Agentic AI: From Foundations to Systems

Project Auto-World: Towards Automated Benchmarking of Neural Relational Reasoners

Diagnosing and Mitigating Compounding Failures in Agentic Persuasion via Taxonomic Strategy Retrieval

Do vision-language models search like humans? Reasoning tokens as a reaction-time analog in classic visual-search paradigms