Health-SCORE: Towards Scalable Rubrics for Improving Health-LLMs
By: Zhichao Yang, Sepehr Janghorbani, Dongxu Zhang, Jun Han, Qian Qian, Andrew Ressler II, Gregory D. Lyng, Sanjit Singh Batra, Robert E. Tillman
Published: 2026-01-27
View on arXiv →#cs.AI
Abstract
This research focuses on developing scalable rubrics to enhance the quality and reliability of Large Language Models (LLMs) specifically tailored for healthcare applications. The goal is to improve their real-world utility, safety, and trustworthiness in clinical and health-related settings.