Entity

Efficient ASR Training with Conversations that Never Happened

Conversational ASR for lower-resource languages and niche domains is limited by the scarcity of domain-matched multi-speaker training data. We propose an augmentation pipeline that generates scenario-level dialogues with participant metadata, maps speaker attributes to TTS voice profiles, and assembles synthesized utterances into speaker-aware simulated conversations. We evaluated five LLM families under single-generator, fixed-budget mixture, and scale-up settings using the same FastConformer-L

Paper · arXiv

cs.CL

Authors: Máté Gedeon, Péter Mihajlik
Published: 2026-06-02
Categories: cs.CLcs.AIcs.SDeess.AS

Abstract ↗

via arXiv · 2606.03957