RESEARCH

Transferability for General Reasoning: An Automated Curriculum for Multi-Domain RLVR

ArXiv cs.AI · Thu, 25 Jun 2026 04:00:00 GMT

arXiv:2606.25178v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has been extended from single-domain training to multi-domain reasoning suites spanning mathematics, programming, and science. However, the training curriculum (how often each do

Read original source Discuss with SiMON