AI2026년 7월 1일
When Does Learning to Stop Help? A Cost-Aware Study of Early Exits in Reasoning Models
출처:arXiv cs.AI
본문 미리보기
arXiv:2606.30852v1 Announce Type: new Abstract: Reasoning models spend different amounts of useful computation across instances, but it remains unclear when a learned stopping rule improves over simple confidence or convergence thresholds. We study this question with LearnStop, a hidden-state-free checkpoint stopper for reasoning language models. At fixed budget checkpoints, LearnStop probes a short answer from the current reasoning prefix and predicts prefix correctness from online features su
전체 내용이 궁금하다면?
원문을 직접 읽어보세요
공유: