AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs
By: Chengming Cui, Tianxin Wei, Ziyi Chen, Ruizhong Qiu, Zhichen Zeng, Zhining Liu, Xuying Ning, Duo Zhou, Jingrui He
Published: 2026-01-12
View on arXiv →#cs.AI
Abstract
This paper proposes AdaFuse, an adaptive ensemble decoding method with test-time scaling for large language models (LLMs). This approach aims to enhance the performance of LLMs by combining outputs from multiple models dynamically during inference, optimizing for accuracy and efficiency in various applications.