Mind the Gap Between Spatial Reasoning and Acting! Step-by-Step Evaluation of Agents With Spatial-Gym
By: Lars Benedikt Kaesberg, Tianyu Yang, Niklas Bauer, Terry Ruas, Jan Philip Wahle, Bela Gipp
Published: 2026-04-13
View on arXiv →#cs.AI
Abstract
This paper introduces Spatial-Gym, a Gymnasium environment that isolates spatial constraint reasoning by testing pathfinding in 2D-grid puzzles as a sequential decision task with optional backtracking. It evaluates AI models in a step-by-step manner, revealing a significant human-model gap in spatial reasoning, and suggesting that current models struggle with global planning when forced into sequential actions. Spatial-Gym provides a framework for diagnosing limitations and improving spatial reasoning through reinforcement learning.