AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs

By: Chengming Cui, Tianxin Wei, Ziyi Chen, Ruizhong Qiu, Zhichen Zeng, Zhining Liu, Xuying Ning, Duo Zhou, Jingrui He

Published: 2026-01-12

View on arXiv →
#cs.AI

Abstract

This paper proposes AdaFuse, an adaptive ensemble decoding method with test-time scaling for large language models (LLMs). This approach aims to enhance the performance of LLMs by combining outputs from multiple models dynamically during inference, optimizing for accuracy and efficiency in various applications.

FEEDBACK

Projects

No projects yet

AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs | ArXiv Intelligence