Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/111644
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Health Technology and Informaticsen_US
dc.creatorChen, Zen_US
dc.creatorChambara, Nen_US
dc.creatorWu, Cen_US
dc.creatorLo, Xen_US
dc.creatorLiu, SYWen_US
dc.creatorGunda, STen_US
dc.creatorHan, Xen_US
dc.creatorQu, Jen_US
dc.creatorChen, Fen_US
dc.creatorYing, TCMen_US
dc.date.accessioned2025-03-04T06:43:38Z-
dc.date.available2025-03-04T06:43:38Z-
dc.identifier.issn1355-008Xen_US
dc.identifier.urihttp://hdl.handle.net/10397/111644-
dc.language.isoenen_US
dc.publisherSpringeren_US
dc.rights© The Author(s) 2024.en_US
dc.rightsThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.en_US
dc.rightsThe following publication Chen, Z., Chambara, N., Wu, C., Lo, X., Liu, S. Y. W., Gunda, S. T., ... & Ying, M. T. C. (2024). Assessing the feasibility of ChatGPT-4o and Claude 3-Opus in thyroid nodule classification based on ultrasound images. Endocrine, 1-9 is available at https://10.1007/s12020-024-04066-x.en_US
dc.subjectArtificial intelligenceen_US
dc.subjectDiagnostic accuracyen_US
dc.subjectLarge language modelen_US
dc.subjectThyroid canceren_US
dc.subjectUltrasounden_US
dc.titleAssessing the feasibility of ChatGPT-4o and Claude 3-Opus in thyroid nodule classification based on ultrasound imagesen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage1041en_US
dc.identifier.epage1049en_US
dc.identifier.volume87en_US
dc.identifier.issue3en_US
dc.identifier.doi10.1007/s12020-024-04066-xen_US
dcterms.abstractPurpose: Large language models (LLMs) are pivotal in artificial intelligence, demonstrating advanced capabilities in natural language understanding and multimodal interactions, with significant potential in medical applications. This study explores the feasibility and efficacy of LLMs, specifically ChatGPT-4o and Claude 3-Opus, in classifying thyroid nodules using ultrasound images.en_US
dcterms.abstractMethods: This study included 112 patients with a total of 116 thyroid nodules, comprising 75 benign and 41 malignant cases. Ultrasound images of these nodules were analyzed using ChatGPT-4o and Claude 3-Opus to diagnose the benign or malignant nature of the nodules. An independent evaluation by a junior radiologist was also conducted. Diagnostic performance was assessed using Cohen’s Kappa and receiver operating characteristic (ROC) curve analysis, referencing pathological diagnoses.en_US
dcterms.abstractResults: ChatGPT-4o demonstrated poor agreement with pathological results (Kappa = 0.116), while Claude 3-Opus showed even lower agreement (Kappa = 0.034). The junior radiologist exhibited moderate agreement (Kappa = 0.450). ChatGPT-4o achieved an area under the ROC curve (AUC) of 57.0% (95% CI: 48.6–65.5%), slightly outperforming Claude 3-Opus (AUC of 52.0%, 95% CI: 43.2–60.9%). In contrast, the junior radiologist achieved a significantly higher AUC of 72.4% (95% CI: 63.7–81.1%). The unnecessary biopsy rates were 41.4% for ChatGPT-4o, 43.1% for Claude 3-Opus, and 12.1% for the junior radiologist.en_US
dcterms.abstractConclusion: While LLMs such as ChatGPT-4o and Claude 3-Opus show promise for future applications in medical imaging, their current use in clinical diagnostics should be approached cautiously due to their limited accuracy.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationEndocrine, 2025, v. 87, no. 3. p. 1041-1049en_US
dcterms.isPartOfEndocrineen_US
dcterms.issued2025-
dc.identifier.scopus2-s2.0-85206669157-
dc.identifier.pmid39394537-
dc.identifier.eissn1559-0100en_US
dc.description.validate202503 bchyen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_TA-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextHong Kong Polytechnic Universityen_US
dc.description.pubStatusPublisheden_US
dc.description.TASpringer Nature (2024)en_US
dc.description.oaCategoryTAen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
s12020-024-04066-x.pdf2.04 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

20
Citations as of Apr 14, 2025

Downloads

6
Citations as of Apr 14, 2025

SCOPUSTM   
Citations

21
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

16
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.