Entity

COMAP: Co-Evolving World Models and Agent Policies for LLM Agents

Equipping language agents with world models enables them to anticipate environment dynamics and evaluate candidate actions before execution. However, existing textual world models are typically fixed after training, preventing them from adapting to the on-policy state-action distributions induced by an evolving agent. Meanwhile, agent-improvement methods often rely on external rewards or verifiers, limiting their applicability in realistic interactive environments. In this paper, we propose COMA

Paper · arXiv

cs.AI

Authors: Youwei Liu, Jian Wang, Hanlin Wang, Wenjie Li
Published: 2026-06-01
Categories: cs.AIcs.CL

Abstract ↗

via arXiv · 2606.02372