Entity

Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories

The past few decades have witnessed significant advances in the design of machine learning algorithms, from early studies on task-specific shallow models to more general deep Large Language Models (LLMs). Despite showing promising results in tasks that require instant prediction or in-context learning, existing models lack the ability to continually learn and effectively transfer their temporal in-context knowledge to their long-term parameters. Inspired by human learning process, we introduce a

Paper · arXiv

cs.LG

Authors: Ali Behrouz, Farnoosh Hashemi, Vahab Mirrokni
Published: 2026-06-02
Categories: cs.LGcs.AI

Abstract ↗

via arXiv · 2606.03979