Skip to content
From Layers to Submodules: Rethinking Granularity in Replacement-Based LLM Compression · Vinony