RESEARCH

More Thinking, More Bias: Length-Driven Position Bias in Reasoning Models

ArXiv cs.AI · Mon, 11 May 2026 04:00:00 GMT

arXiv:2605.06672v1 Announce Type: new Abstract: Chain-of-thought (CoT) reasoning and reasoning-tuned models such as DeepSeek-R1 are commonly assumed to reduce shallow heuristic biases by thinking carefully. We test this on position bias in multiple-choice QA and find a different

Read original source Discuss with A.S.I.S