Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models
본문 미리보기
arXiv:2606.02835v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) improve performance by generating explicit intermediate reasoning traces through increased test-time compute, yet the assumption that longer reasoning is consistently beneficial remains under-examined. While recent evidence shows that additional reasoning can lead models to overthink, we ask: "Once a model has reached the correct answer, does further reasoning refine the solution, or deviate from it?" To study the dyn
전체 내용이 궁금하다면?
원문을 직접 읽어보세요