Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning

본문 미리보기

arXiv:2606.27483v1 Announce Type: new Abstract: Large language model (LLM) agents have demonstrated strong capability in sequential decision-making, yet they remains fundamentally reactive in long-horizon tasks. Unlike humans who employ "what-if" reasoning to evaluate potential plans before commitment, standard agents lack an internal world model to simulate future outcomes. Therefore, we propose to internalize future-aware planning by training a single autoregressive model to verbalize both a

Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning

본문 미리보기

관련 글

AI-Model Network: Concept, Current State and Future

When Does Personality Composition Matter for Multi-Agent LLM Teams?

Odyssey: Constructing Verifiable Local Truth-Preserving Foundation Models

DysLexLens: A Low-Resource LLM Framework for Analysing Dyslexic Learners Insights from Online Forums