When Does Learning to Stop Help? A Cost-Aware Study of Early Exits in Reasoning Models

본문 미리보기

arXiv:2606.30852v1 Announce Type: new Abstract: Reasoning models spend different amounts of useful computation across instances, but it remains unclear when a learned stopping rule improves over simple confidence or convergence thresholds. We study this question with LearnStop, a hidden-state-free checkpoint stopper for reasoning language models. At fixed budget checkpoints, LearnStop probes a short answer from the current reasoning prefix and predicts prefix correctness from online features su

When Does Learning to Stop Help? A Cost-Aware Study of Early Exits in Reasoning Models

본문 미리보기

관련 글

When Regulation Has Memory: Hysteresis and Control Burden in Artificial Agency

DDIAgents: Mechanism-Conditioned Context Flow for Drug-Drug Interaction Prediction

Beyond Compilation: Evaluating Faithful Natural-Language-to-Lean Statement Formalization

A Three-Phase Foundation Model for Tax-Aware Personalized Portfolio Management