• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用人类和人工智能生成的科学写作来评估人工智能内容检测器的性能。

Performance of Artificial Intelligence Content Detectors Using Human and Artificial Intelligence-Generated Scientific Writing.

机构信息

Department of Surgery, Division of Surgical Oncology, Medical College of Wisconsin, Milwaukee, WI, USA.

Department of Surgery, Division of Trauma, Critical Care, and Acute Care Surgery, Duke University, Durham, NC, USA.

出版信息

Ann Surg Oncol. 2024 Oct;31(10):6387-6393. doi: 10.1245/s10434-024-15549-6. Epub 2024 Jun 22.

DOI:10.1245/s10434-024-15549-6
PMID:38909113
Abstract

BACKGROUND

Few studies have examined the performance of artificial intelligence (AI) content detection in scientific writing. This study evaluates the performance of publicly available AI content detectors when applied to both human-written and AI-generated scientific articles.

METHODS

Articles published in Annals of Surgical Oncology (ASO) during the year 2022, as well as AI-generated articles using OpenAI's ChatGPT, were analyzed by three AI content detectors to assess the probability of AI-generated content. Full manuscripts and their individual sections were evaluated. Group comparisons and trend analyses were conducted by using ANOVA and linear regression. Classification performance was determined using area under the curve (AUC).

RESULTS

A total of 449 original articles met inclusion criteria and were evaluated to determine the likelihood of being generated by AI. Each detector also evaluated 47 AI-generated articles by using titles from ASO articles. Human-written articles had an average probability of being AI-generated of 9.4% with significant differences between the detectors. Only two (0.4%) human-written manuscripts were detected as having a 0% probability of being AI-generated by all three detectors. Completely AI-generated articles were evaluated to have a higher average probability of being AI-generated (43.5%) with a range from 12.0 to 99.9%.

CONCLUSIONS

This study demonstrates differences in the performance of various AI content detectors with the potential to label human-written articles as AI-generated. Any effort toward implementing AI detectors must include a strategy for continuous evaluation and validation as AI models and detectors rapidly evolve.

摘要

背景

很少有研究检查人工智能 (AI) 内容检测在科学写作中的性能。本研究评估了三种公开可用的 AI 内容检测器在应用于人类撰写和 AI 生成的科学文章时的性能。

方法

分析了 2022 年发表在《外科肿瘤学年鉴》(Annals of Surgical Oncology,ASO)上的文章以及使用 OpenAI 的 ChatGPT 生成的 AI 文章,以评估 AI 生成内容的可能性。评估了全文及其各个部分。使用 ANOVA 和线性回归进行组间比较和趋势分析。使用曲线下面积 (AUC) 确定分类性能。

结果

共有 449 篇原始文章符合纳入标准,并评估了它们被 AI 生成的可能性。每个检测器还使用来自 ASO 文章的标题评估了 47 篇 AI 生成的文章。人类撰写的文章被 AI 生成的平均概率为 9.4%,各检测器之间存在显著差异。只有两篇(0.4%)人类撰写的手稿被所有三个检测器检测为 AI 生成的概率为 0%。完全 AI 生成的文章被评估为具有更高的 AI 生成平均概率(43.5%),范围为 12.0%至 99.9%。

结论

本研究表明,各种 AI 内容检测器的性能存在差异,有可能将人类撰写的文章标记为 AI 生成。任何实施 AI 检测器的努力都必须包括一项策略,以随着 AI 模型和检测器的快速发展,不断评估和验证。

相似文献

1
Performance of Artificial Intelligence Content Detectors Using Human and Artificial Intelligence-Generated Scientific Writing.使用人类和人工智能生成的科学写作来评估人工智能内容检测器的性能。
Ann Surg Oncol. 2024 Oct;31(10):6387-6393. doi: 10.1245/s10434-024-15549-6. Epub 2024 Jun 22.
2
Characterizing the Increase in Artificial Intelligence Content Detection in Oncology Scientific Abstracts From 2021 to 2023.描述 2021 年至 2023 年肿瘤学科学摘要中人工智能内容检测的增加情况。
JCO Clin Cancer Inform. 2024 May;8:e2400077. doi: 10.1200/CCI.24.00077.
3
Human vs machine: identifying ChatGPT-generated abstracts in Gynecology and Urogynecology.人机之争:在妇科和泌尿外科学中识别 ChatGPT 生成的摘要。
Am J Obstet Gynecol. 2024 Aug;231(2):276.e1-276.e10. doi: 10.1016/j.ajog.2024.04.045. Epub 2024 May 6.
4
Detecting generative artificial intelligence in scientific articles: Evasion techniques and implications for scientific integrity.检测科学文章中的生成式人工智能:规避技术及其对科学诚信的影响。
Orthop Traumatol Surg Res. 2023 Dec;109(8):103706. doi: 10.1016/j.otsr.2023.103706. Epub 2023 Oct 12.
5
Assessing the Reproducibility of the Structured Abstracts Generated by ChatGPT and Bard Compared to Human-Written Abstracts in the Field of Spine Surgery: Comparative Analysis.评估 ChatGPT 和 Bard 生成的结构化摘要与脊柱外科领域人类撰写的摘要在可重复性方面的比较:对比分析。
J Med Internet Res. 2024 Jun 26;26:e52001. doi: 10.2196/52001.
6
The Use of Artificial Intelligence in Writing Scientific Review Articles.人工智能在撰写科学综述文章中的应用。
Curr Osteoporos Rep. 2024 Feb;22(1):115-121. doi: 10.1007/s11914-023-00852-0. Epub 2024 Jan 16.
7
What is the rate of text generated by artificial intelligence over a year of publication in Orthopedics & Traumatology: Surgery & Research? Analysis of 425 articles before versus after the launch of ChatGPT in November 2022.在《矫形外科与创伤学:手术与研究》杂志上发表的人工智能文本在一年时间内的生成率是多少?分析 2022 年 11 月 ChatGPT 发布前后的 425 篇文章。
Orthop Traumatol Surg Res. 2023 Dec;109(8):103694. doi: 10.1016/j.otsr.2023.103694. Epub 2023 Sep 29.
8
Application of artificial intelligence chatbots, including ChatGPT, in education, scholarly work, programming, and content generation and its prospects: a narrative review.人工智能聊天机器人(包括 ChatGPT)在教育、学术工作、编程、内容生成等领域的应用及其前景:叙述性综述。
J Educ Eval Health Prof. 2023;20:38. doi: 10.3352/jeehp.2023.20.38. Epub 2023 Dec 27.
9
Artificial Intelligence Can Generate Fraudulent but Authentic-Looking Scientific Medical Articles: Pandora's Box Has Been Opened.人工智能可以生成虚假但看起来真实的科学医学文章:潘多拉的盒子已经被打开。
J Med Internet Res. 2023 May 31;25:e46924. doi: 10.2196/46924.
10
AI vs academia: Experimental study on AI text detectors' accuracy in behavioral health academic writing.人工智能与学术界:关于人工智能文本检测器在行为健康学术写作中准确性的实验研究
Account Res. 2024 Mar 22:1-17. doi: 10.1080/08989621.2024.2331757.

引用本文的文献

1
ChatGPT in Academic Writing: A Scientometric Analysis of Literature Published Between 2022 and 2023.学术写作中的ChatGPT:对2022年至2023年发表文献的科学计量分析
J Empir Res Hum Res Ethics. 2025 Jul;20(3):131-148. doi: 10.1177/15562646251350203. Epub 2025 Jun 22.
2
How do oncology journals approach plagiarism? A website review.肿瘤学期刊如何处理抄袭问题?一项网站审查。
Res Integr Peer Rev. 2025 Mar 31;10(1):3. doi: 10.1186/s41073-025-00160-4.
3
AI detectors are poor western blot classifiers: a study of accuracy and predictive values.

本文引用的文献

1
Opportunities, Challenges, and Future Directions of Generative Artificial Intelligence in Medical Education: Scoping Review.生成式人工智能在医学教育中的机遇、挑战与未来方向:范围综述
JMIR Med Educ. 2023 Oct 20;9:e48785. doi: 10.2196/48785.
2
Why technical solutions for detecting AI-generated content in research and education are insufficient.为何用于检测研究与教育中人工智能生成内容的技术解决方案并不充分。
Patterns (N Y). 2023 Jul 14;4(7):100796. doi: 10.1016/j.patter.2023.100796.
3
Evaluation and Comparison of Ophthalmic Scientific Abstracts and References by Current Artificial Intelligence Chatbots.
人工智能检测工具在蛋白质印迹法分类方面表现不佳:准确性和预测价值研究
PeerJ. 2025 Feb 20;13:e18988. doi: 10.7717/peerj.18988. eCollection 2025.
4
Artificial Intelligence in Medical Writing: Addressing Untouched Threats.医学写作中的人工智能:应对未被触及的威胁。
JMA J. 2025 Jan 15;8(1):273-275. doi: 10.31662/jmaj.2024-0268. Epub 2024 Dec 6.
5
Letter to the Editor: The Fate of Individual Tone in the Age of AI Writing.致编辑的信:人工智能写作时代中个体风格的命运
Ann Surg Oncol. 2025 Apr;32(4):2419-2420. doi: 10.1245/s10434-024-16678-8. Epub 2024 Dec 9.
6
Do I Write Like Artificial Intelligence?我写得像人工智能吗?
Ann Surg Oncol. 2025 Apr;32(4):2423-2424. doi: 10.1245/s10434-024-16480-6. Epub 2024 Nov 13.
当前人工智能聊天机器人对眼科科学摘要和参考文献的评估与比较。
JAMA Ophthalmol. 2023 Sep 1;141(9):819-824. doi: 10.1001/jamaophthalmol.2023.3119.
4
Editorial: Generative artificial intelligence as a plagiarism problem.社论:生成式人工智能作为一种剽窃问题。
Biol Psychol. 2023 Jul;181:108621. doi: 10.1016/j.biopsycho.2023.108621. Epub 2023 Jun 24.
5
Exploring the Boundaries of Reality: Investigating the Phenomenon of Artificial Intelligence Hallucination in Scientific Writing Through ChatGPT References.探索现实的边界:通过ChatGPT参考文献研究科学写作中的人工智能幻觉现象。
Cureus. 2023 Apr 11;15(4):e37432. doi: 10.7759/cureus.37432. eCollection 2023 Apr.
6
The role of ChatGPT in scientific communication: writing better scientific review articles.ChatGPT在科学交流中的作用:撰写更优质的科学综述文章。
Am J Cancer Res. 2023 Apr 15;13(4):1148-1154. eCollection 2023.
7
Generative artificial intelligence: Can ChatGPT write a quality abstract?生成式人工智能:ChatGPT 能写出高质量的摘要吗?
Emerg Med Australas. 2023 Oct;35(5):809-811. doi: 10.1111/1742-6723.14233. Epub 2023 May 4.
8
Comparing scientific abstracts generated by ChatGPT to real abstracts with detectors and blinded human reviewers.使用检测器和不知情的人类评审员,将ChatGPT生成的科学摘要与真实摘要进行比较。
NPJ Digit Med. 2023 Apr 26;6(1):75. doi: 10.1038/s41746-023-00819-6.
9
From human writing to artificial intelligence generated text: examining the prospects and potential threats of ChatGPT in academic writing.从人类写作到人工智能生成的文本:审视ChatGPT在学术写作中的前景与潜在威胁。
Biol Sport. 2023 Apr;40(2):615-622. doi: 10.5114/biolsport.2023.125623. Epub 2023 Mar 15.
10
Journals take up arms against AI-written text.期刊杂志联手抵制 AI 生成文本。
Science. 2023 Feb 24;379(6634):740-741. doi: 10.1126/science.adh2762. Epub 2023 Feb 23.