Skill-Guided Continuation Distillation for GUI Agents

본문 미리보기

arXiv:2606.18890v1 Announce Type: new Abstract: Improving GUI agents typically relies on behavior cloning on expert trajectories. However, as the current policy deviates from the expert policy, it inevitably encounters policy-induced off-trajectory states during closed-loop execution, i.e., states that fall outside the expert trajectories. Since expert trajectories provide no demonstrations for these unseen states, such states receive no effective supervision, leaving the policy unable to selec

Skill-Guided Continuation Distillation for GUI Agents

본문 미리보기

관련 글

MosaicLeaks: Can your research agent keep a secret?

CaVe-VLM-CoT: An Interpretable Vision-Language Model Framework

WorldLines: Benchmarking and Modeling Long-Horizon Stateful Embodied Agents

What Must Generalist Agents Remember?