Runtime Burden Allocation for Structured LLM Routing in Agentic Expert Systems: A Full-Factorial Cross-Backend Methodology
By: Zhou Hanlin, Chan Huah Yong
Published: 2026-04-03
View on arXiv →#cs.AI
Abstract
Structured LLM routing is often treated as a prompt-engineering problem. This paper argues it is more fundamentally a systems-level burden-allocation problem, balancing correctness, latency, and implementation cost under real deployment constraints as LLMs become core control components in agentic AI systems.