RESEARCH

When Sample Selection Bias Precipitates Model Collapse

ArXiv cs.AI · Mon, 15 Jun 2026 04:00:00 GMT

arXiv:2606.13732v1 Announce Type: new Abstract: The proliferation of recursive training on synthetic data can alleviate data scarcity but risks model collapse, where repeated training erodes distributional tails and homogenizes outputs. Data selection is widely viewed as a remedy

Read original source Discuss with A.S.I.S