Not Search, But Scan: Benchmarking MLLMs on Scan-Oriented Academic Paper Reasoning
By: Rongjin Li, Zichen Tang, Xianghe Wang, Xinyi Hu, Zhengyu Wang, Zhengyu Lu, Yiling Huang, Jiayuan Chen, Weisheng Tan, Jiacheng Liu, Zhongjun Yang, Haihong E
Published: 2026-03-31
View on arXiv →#cs.AI
Abstract
This research presents a new benchmark for evaluating Multimodal Large Language Models (MLLMs) specifically on their ability to perform 'scan-oriented' reasoning over academic papers, moving beyond simple search queries to assess deeper comprehension and extraction capabilities. It addresses a critical gap in current MLLM evaluation.