RESEARCH

VAMPS: Visual-Assisted Mathematical Problem Solving Benchmark

ArXiv cs.AI · Thu, 04 Jun 2026 04:00:00 GMT

arXiv:2606.04244v1 Announce Type: new Abstract: Multimodal large language models are increasingly capable of complex reasoning, yet their performance often degrades when they must externalize a problem through a tool and then reason over the tool's output, specifically when they

Read original source Discuss with A.S.I.S