Entity

Do Proactive Agents Really Need an LLM to Decide When to Wake and What to Anchor?

Proactive agents read user activity as text and call an LLM on every event to decide whether to act. But user activity is not natively text: it is a structured event stream of (actor, verb, object, timestamp) tuples that the operating system already maintains in graph form. Rendering the structure as text and asking an LLM to recover it is a round-trip the system never had to take. We treat the always-on signal as graph updates rather than text and use a small temporal-graph-learning (TGL) model

Paper · arXiv

cs.CL

Authors: Xiaoze Liu, Ruowang Zhang, Amir H. Abdi, Michel Galley, Zhikai Chen + 3 more
Published: 2026-05-28
Categories: cs.CLcs.AIcs.HC

Abstract ↗

via arXiv · 2605.30152