Item

X-Gen: Enhancing Radiology Report Generation via LLM-Driven Data Augmentation and Decoupled Training

Wang, Chaohan
Chen, Qi
To, Minh-Son
Kutaiba, Numan
Yoo, Jae-Gon
Xie, Yutong
Wu, Qi
Supervisor
Department
Computer Vision
Embargo End Date
Type
Conference proceeding
Date
License
Language
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
The scarcity and limited accessibility of medical data significantly challenge deep learning applications in medical AI. Radiology report generation (RRG), a key medical AI research area, could greatly improve computer-aided diagnosis through automated X-ray image interpretation. However, obtaining paired $X$-ray images and reports is labor-intensive and restricted by strict regulations. Large language models (LLMs), such as GPT-4, provide a promising alternative by enabling cost-effective text data augmentation and report rewriting in varied styles. We rigorously assess augmented data's clinical accuracy and stylistic similarity to radiologist-authored reports through expert evaluations. Interestingly, augmented data enhances RRG model performance, yet performance declines when augmented data surpasses original data volume due to style distribution shifts. To mitigate this, we propose integrating a conditional variational autoencoder (cVAE) into the RRG model to separate medical semantics from writing styles during training, enabling better handling of augmented data's distribution shift. Our proposed method, X-Gen, combines data augmentation with decoupled training. Tested on two public Chest X-ray datasets and a private abdomen X-ray dataset, $X$-Gen significantly improves the performance of baseline models, showcasing its effectiveness and versatility in X-ray report generation.
Citation
C. Wang, Q. Chen, M.-S. To, N. Kutaiba, J.-G. Yoo, Y. Xie, Q. Wu, "X-Gen: Enhancing Radiology Report Generation via LLM-Driven Data Augmentation and Decoupled Training," 2025, pp. 1-8.
Source
2025 International Conference on Digital Image Computing: Techniques and Applications (DICTA)
Conference
2025 International Conference on Digital Image Computing: Techniques and Applications (DICTA)
Keywords
46 Information and Computing Sciences, 4605 Data Management and Data Science
Subjects
Source
2025 International Conference on Digital Image Computing: Techniques and Applications (DICTA)
Publisher
IEEE
Full-text link