Entity

Where Should Knowledge Enter? A Layered Framework for Knowledge Infusion in Multimodal Iterative Generative Mo

Multimodal generative models produce fluent outputs but remain unreliable when generation must respect structured, domain-specific, or safety-critical knowledge. Existing methods incorporate knowledge through mechanisms such as prompt augmentation, guidance, latent editing, or fine-tuning, yet they are typically categorized by technique rather than by the component of the generative process they modify. We argue that knowledge infusion in iterative generative models is fundamentally aninterventi

Paper · arXiv

cs.AI

Authors: Renjith Prasad, Chathurangi Shyalika, Anushka Pawar, Amit Sheth
Published: 2026-06-04

Abstract ↗

via arXiv · 2606.06356