Agentic Test-Time Scaling for WebAgents

By: Nicholas Lee, Lutfi Eren Erdogan, Chris Joseph John, Surya Krishnapillai, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

Published: 2026-02-13

View on arXiv →
#cs.AI

Abstract

This work presents CATTS, a simple technique for dynamically allocating compute for multi-step agents, especially web agents. It empirically studies inference-time scaling for web agents, addressing the challenge of compounding errors in long-horizon, multi-step tasks and proposes solutions for improved reliability and performance in real-world web interaction.

FEEDBACK

Projects

No projects yet

Agentic Test-Time Scaling for WebAgents | ArXiv Intelligence