Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/114218
| Title: | Radiology report generation using automatic keyword adaptation, frequency-based multi-label classification and text-to-text large language models | Authors: | He, Z Wong, ANN Yoo, JS |
Issue Date: | Sep-2025 | Source: | Computers in biology and medicine, Sept 2025, v. 196, 110625 | Abstract: | Background: Radiology reports are essential in medical imaging, providing critical insights for diagnosis, treatment, and patient management by bridging the gap between radiologists and referring physicians. However, the manual generation of radiology reports is time-consuming and labor-intensive, leading to inefficiencies and delays in clinical workflows, particularly as case volumes increase. Although deep learning approaches have shown promise in automating radiology report generation, existing methods, particularly those based on the encoder–decoder framework, suffer from significant limitations. These include a lack of explainability due to black-box features generated by encoder and limited adaptability to diverse clinical settings. Methods: In this study, we address these challenges by proposing a novel deep learning framework for radiology report generation that enhances explainability, accuracy, and adaptability. Our approach replaces traditional black-box features in computer vision with transparent keyword lists, improving the interpretability of the feature extraction process. To generate these keyword lists, we apply a multi-label classification technique, which is further enhanced by an automatic keyword adaptation mechanism. This adaptation dynamically configures the multi-label classification to better adapt specific clinical environments, reducing the reliance on manually curated reference keyword lists and improving model adaptability across diverse datasets. We also introduce a frequency-based multi-label classification strategy to address the issue of keyword imbalance, ensuring that rare but clinically significant terms are accurately identified. Finally, we leverage a pre-trained text-to-text large language model (LLM) to generate human-like, clinically relevant radiology reports from the extracted keyword lists, ensuring linguistic quality and clinical coherence. Results: We evaluate our method using two public datasets, IU-XRay and MIMIC-CXR, demonstrating superior performance over state-of-the-art methods. Our framework not only improves the accuracy and reliability of radiology report generation but also enhances the explainability of the process, fostering greater trust and adoption of AI-driven solutions in clinical practice. Comprehensive ablation studies confirm the robustness and effectiveness of each component, highlighting the significant contributions of our framework to advancing automated radiology reporting. Conclusion: In conclusion, we developed a novel deep-learning based radiology report generation method for preparing high-quality and explainable radiology report for chest X-ray images using the multi-label classification and a text-to-text large language model. Our method could address the lack of explainability in the current workflow and provide a clear and flexible automated pipeline to reduce the workload of radiologists and support the further applications related to Human–AI interactive communications. Graphical abstract: [Figure not available: see fulltext.] |
Keywords: | Automatic keyword adaptation Frequency-based multi-label classification Large language model Radiology report generation |
Publisher: | Elsevier Ltd | Journal: | Computers in biology and medicine | ISSN: | 0010-4825 | EISSN: | 1879-0534 | DOI: | 10.1016/j.compbiomed.2025.110625 | Rights: | © 2025 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). The following publication He, Z., Wong, A. N. N., & Yoo, J. S. (2025). Radiology report generation using automatic keyword adaptation, frequency-based multi-label classification and text-to-text large language models. Computers in Biology and Medicine, 196, 110625 is available at https://doi.org/10.1016/j.compbiomed.2025.110625. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 1-s2.0-S001048252500976X-main.pdf | 9.34 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



