Health-SCORE: Towards Scalable Rubrics for Improving Health-LLMs

By: Zhichao Yang, Sepehr Janghorbani, Dongxu Zhang, Jun Han, Qian Qian, Andrew Ressler II, Gregory D. Lyng, Sanjit Singh Batra, Robert E. Tillman

Published: 2026-01-27

View on arXiv →

#cs.AI

Abstract

This research focuses on developing scalable rubrics to enhance the quality and reliability of Large Language Models (LLMs) specifically tailored for healthcare applications. The goal is to improve their real-world utility, safety, and trustworthiness in clinical and health-related settings.

FEEDBACK

Projects

No projects yet