LLMs as Clinical Research Assistants: Secure and Accurate Extraction from Unstructured EHR Narratives
By: Mitchell A. Klusty, Elizabeth C. Solie, Caroline N. Leach, W. Vaiden Logan, Lynnet E. Richey, John C. Gensel, David P. Szczykutowicz, Bryan C. McLellan, Emily B. Collier, Samuel E. Armstrong, V. K. Cody Bumgardner
Published: 2025-12-17
View on arXiv →#cs.AI
Abstract
This paper presents a secure, modular framework that leverages locally deployed large language models (LLMs) to automate structured feature extraction from unstructured electronic health record (EHR) narratives. The system integrates retrieval augmented generation (RAG) and structured response methods to reduce the burden of manual chart review, increase consistency in data capture, and accelerate clinical research, achieving high accuracy across multiple medical characteristics.