TECH

VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO

Hacker News · Tue, 23 Jun 2026 02:01:25 GMT

Article URL: https://arxiv.org/abs/2606.16140 Comments URL: https://news.ycombinator.com/item?id=48639240 Points: 4 # Comments: 0

Read original source Discuss with SiMON