MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

By: Hanzhang Zhou, Xu Zhang, Panrong Tong, Jianan Zhang, Liangyu Chen, Quyu Kong, Chenglin Cai, Chen Liu, Yue Wang, Jingren Zhou, Steven Hoi

Published: 2025-12-26

View on arXiv →
#cs.AI

Abstract

This paper introduces MAI-UI, a family of foundation GUI agents designed for real-world deployment. It integrates agent-user interaction, external tool use via MCP, and a native device-cloud collaboration system, establishing new state-of-the-art performance in GUI grounding and mobile navigation benchmarks.

FEEDBACK

Projects

No projects yet

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents | ArXiv Intelligence