Entity

From Latent Space to Training Data: Explainable Specialization in Minimal MLPs

We here study whether training biases can make hidden neurons specialize in minimal one-hidden-layer MLPs, and whether such specialization improves prototype-based reconstruction of the training dataset from the learned weights. We consider Gaussianactivation MLPs of width equal to dataset size and compare three structural losses that respectively encourage coverage of the training samples, separation between neuron-induced prototypes, and low overlap of hidden responses, against the standard fi

Paper · arXiv

cs.LG

Authors: Enrique Alba, Ezequiel Lopez-Rubio
Published: 2026-05-25
Categories: cs.LGcs.AI

Abstract ↗

via arXiv · 2605.25939