RESEARCH

On Distinguishing Capability Elicitation from Capability Creation in Post-Training: A Free-Energy Perspective

ArXiv cs.AI · Tue, 12 May 2026 04:00:00 GMT

arXiv:2605.08368v1 Announce Type: new Abstract: Debates about large language model post-training often treat supervised fine-tuning (SFT) as imitation and reinforcement learning (RL) as discovery. But this distinction is too coarse. What matters is whether a training procedure in

Read original source Discuss with A.S.I.S