A Real-World Evaluation of LLM Medication Safety Reviews in NHS Primary Care
By: Oliver Normand, Esther Borsi, Mitch Fruin, Lauren E Walker, Jamie Heagerty, Chris C. Holmes, Anthony J Avery, Iain E Buchan, Harry Coppock
Published: 2025-12-24
View on arXiv →Abstract
Large Language Models (LLMs) show promise for medication safety in healthcare. This paper presents a real-world evaluation of an LLM-powered system for medication safety reviews in NHS Primary Care, identifying potential errors, drug-drug interactions, and adverse reactions from patient records. A retrospective study on anonymized NHS patient data revealed the LLM system achieved 100% sensitivity in detecting critical safety issues, but only correctly identified all issues and interventions in 46.9% of patients. Failure analysis indicated that contextual reasoning, rather than lack of medication knowledge, was the dominant failure mechanism, highlighting shortcomings that need addressing before safe clinical deployment.