Enhancing Radiology Report Generation and Visual Grounding using Reinforcement Learning

By: Benjamin Gundersen, Nicolas Deperrois, Samuel Ruiperez-Campillo, Thomas M. Sutter, Julia E. Vogt, Michael Moor

Published: 2025-12-11

View on arXiv →
#cs.AI

Abstract

This research explores enhancing radiology report generation and visual grounding in medical imaging by applying reinforcement learning (RL) to vision-language models (VLMs). It investigates how RL, combined with task-specific feedback, can improve the quality of CXR interpretation beyond supervised fine-tuning, leading to state-of-the-art performance in generating accurate and clinically aligned reports.

FEEDBACK

Projects

No projects yet

Enhancing Radiology Report Generation and Visual Grounding using Reinforcement Learning | ArXiv Intelligence