TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models

By: Fangxu Yu, Xingang Guo, Lingzhi Yuan, Haoqiang Kang, Hongyu Zhao, Lianhui Qin, Furong Huang, Bin Hu, Tianyi Zhou

Published: 2026-01-27

View on arXiv →
#cs.AI

Abstract

This paper introduces TSRBench, a comprehensive benchmark designed for multi-task and multi-modal time series reasoning. It aims to evaluate and advance generalist AI models in their ability to understand and process complex temporal data from diverse sources, promoting progress in generalized AI applications.

FEEDBACK

Projects

No projects yet

TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models | ArXiv Intelligence