Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/110294
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Rehabilitation Sciences | - |
dc.creator | Liu, JQJ | - |
dc.creator | Hui, KTK | - |
dc.creator | Al, Zoubi, F | - |
dc.creator | Zhou, ZZX | - |
dc.creator | Samartzis, D | - |
dc.creator | Yu, CCH | - |
dc.creator | Chang, JR | - |
dc.creator | Wong, AYL | - |
dc.date.accessioned | 2024-12-03T03:09:16Z | - |
dc.date.available | 2024-12-03T03:09:16Z | - |
dc.identifier.uri | http://hdl.handle.net/10397/110294 | - |
dc.language.iso | en | en_US |
dc.publisher | BioMed Central Ltd. | en_US |
dc.rights | © The Author(s) 2024. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. | en_US |
dc.rights | The following publication Liu, J.Q.J., Hui, K.T.K., Al Zoubi, F. et al. The great detectives: humans versus AI detectors in catching large language model-generated medical writing. Int J Educ Integr 20, 8 (2024) is available at https://doi.org/10.1007/s40979-024-00155-6. | en_US |
dc.subject | Academic integrity | en_US |
dc.subject | Accuracy | en_US |
dc.subject | AI content detectors | en_US |
dc.subject | Artificial intelligence | en_US |
dc.subject | ChatGPT | en_US |
dc.subject | Generative AI | en_US |
dc.subject | Paraphrasing tools | en_US |
dc.subject | Peer review | en_US |
dc.subject | Perplexity scores | en_US |
dc.subject | Scientific rigour | en_US |
dc.title | The great detectives : humans versus AI detectors in catching large language model-generated medical writing | en_US |
dc.type | Journal/Magazine Article | en_US |
dc.identifier.volume | 20 | - |
dc.identifier.doi | 10.1007/s40979-024-00155-6 | - |
dcterms.abstract | Background: The application of artificial intelligence (AI) in academic writing has raised concerns regarding accuracy, ethics, and scientific rigour. Some AI content detectors may not accurately identify AI-generated texts, especially those that have undergone paraphrasing. Therefore, there is a pressing need for efficacious approaches or guidelines to govern AI usage in specific disciplines. | - |
dcterms.abstract | Objective: Our study aims to compare the accuracy of mainstream AI content detectors and human reviewers in detecting AI-generated rehabilitation-related articles with or without paraphrasing. | - |
dcterms.abstract | Study design: This cross-sectional study purposively chose 50 rehabilitation-related articles from four peer-reviewed journals, and then fabricated another 50 articles using ChatGPT. Specifically, ChatGPT was used to generate the introduction, discussion, and conclusion sections based on the original titles, methods, and results. Wordtune was then used to rephrase the ChatGPT-generated articles. Six common AI content detectors (Originality.ai, Turnitin, ZeroGPT, GPTZero, Content at Scale, and GPT-2 Output Detector) were employed to identify AI content for the original, ChatGPT-generated and AI-rephrased articles. Four human reviewers (two student reviewers and two professorial reviewers) were recruited to differentiate between the original articles and AI-rephrased articles, which were expected to be more difficult to detect. They were instructed to give reasons for their judgements. | - |
dcterms.abstract | Results: Originality.ai correctly detected 100% of ChatGPT-generated and AI-rephrased texts. ZeroGPT accurately detected 96% of ChatGPT-generated and 88% of AI-rephrased articles. The areas under the receiver operating characteristic curve (AUROC) of ZeroGPT were 0.98 for identifying human-written and AI articles. Turnitin showed a 0% misclassification rate for human-written articles, although it only identified 30% of AI-rephrased articles. Professorial reviewers accurately discriminated at least 96% of AI-rephrased articles, but they misclassified 12% of human-written articles as AI-generated. On average, students only identified 76% of AI-rephrased articles. Reviewers identified AI-rephrased articles based on ‘incoherent content’ (34.36%), followed by ‘grammatical errors’ (20.26%), and ‘insufficient evidence’ (16.15%). | - |
dcterms.abstract | Conclusions and relevance: This study directly compared the accuracy of advanced AI detectors and human reviewers in detecting AI-generated medical writing after paraphrasing. Our findings demonstrate that specific detectors and experienced reviewers can accurately identify articles generated by Large Language Models, even after paraphrasing. The rationale employed by our reviewers in their assessments can inform future evaluation strategies for monitoring AI usage in medical education or publications. AI content detectors may be incorporated as an additional screening tool in the peer-review process of academic journals. | - |
dcterms.accessRights | open access | en_US |
dcterms.bibliographicCitation | International journal for educational integrity, 2024, v. 20, 8 | - |
dcterms.isPartOf | International journal for educational integrity | - |
dcterms.issued | 2024 | - |
dc.identifier.scopus | 2-s2.0-85193720260 | - |
dc.identifier.eissn | 1833-2595 | - |
dc.identifier.artn | 8 | - |
dc.description.validate | 202412 bcch | - |
dc.description.oa | Version of Record | en_US |
dc.identifier.FolderNumber | OA_Scopus/WOS | en_US |
dc.description.fundingSource | Others | en_US |
dc.description.fundingText | GP Batteries Industrial Safety Trust Fund | en_US |
dc.description.pubStatus | Published | en_US |
dc.description.oaCategory | CC | en_US |
Appears in Collections: | Journal/Magazine Article |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
s40979-024-00155-6.pdf | 1.75 MB | Adobe PDF | View/Open |
Page views
27
Citations as of Apr 14, 2025
Downloads
4
Citations as of Apr 14, 2025
SCOPUSTM
Citations
16
Citations as of Jun 19, 2025

Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.