Runtime Burden Allocation for Structured LLM Routing in Agentic Expert Systems: A Full-Factorial Cross-Backend Methodology

By: Zhou Hanlin, Chan Huah Yong

Published: 2026-04-03

View on arXiv →
#cs.AI

Abstract

Structured LLM routing is often treated as a prompt-engineering problem. This paper argues it is more fundamentally a systems-level burden-allocation problem, balancing correctness, latency, and implementation cost under real deployment constraints as LLMs become core control components in agentic AI systems.

FEEDBACK

Projects

No projects yet

Runtime Burden Allocation for Structured LLM Routing in Agentic Expert Systems: A Full-Factorial Cross-Backend Methodology | ArXiv Intelligence