使用ChatGPT-4.0简化乳腺病理报告：可读性与准确性研究

The Use of ChatGPT-4.0 to Simplify Breast Pathology Reports: A Study on Readability and Accuracy.

作者信息

Bheemireddy Samhita, Leslie Sarah E, Durden Jakob A, Burnet George, Aryanpour Zain, Fong Ashlyn, Higgins Madeline G, Greenseid Samantha, McLemore Lauren, Li Gande, Miles Randy, Taft Nancy, Tevis Sarah

机构信息

Albany Medical College, Albany Medical Center, Albany, NY, USA.

Adult and Child Center for Outcomes Research and Delivery Science (ACCORDS), University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

出版信息

Ann Surg Oncol. 2025 Jul 21. doi: 10.1245/s10434-025-17860-2.

DOI:10.1245/s10434-025-17860-2

PMID:40690168

Abstract

BACKGROUND

Patients have immediate access to their diagnostic reports but these reports exceed the recommended reading level for patient-facing materials. Generative artificial intelligence may be a tool for improving patient comprehension of health information. This study assessed the readability and accuracy of ChatGPT-simplified breast pathology reports.

METHODS

Ten de-identified patient breast pathology reports were simplified by ChatGPT-4.0 using three different prompts. Prompt 1 requested simplification, Prompt 2 added a 6th-grade-level specification, and Prompt 3 requested essential information. The Flesch-Kincaid Reading Level (FKRL) and Flesch Reading Ease Score (FRES) were utilized to quantify readability and ease of reading, respectively. Five physicians used a four-point scale to assess factual correctness, relevancy, and fabrications to determine overall accuracy. Mean scores and standard deviations for FKRL, FRES, and accuracy scores were compared using analysis of variance (ANOVA) and t-tests.

RESULTS

Prompt 2 demonstrated a reduction in FKRL (p < 0.001) and an increase in FRES (p < 0.001), demonstrating improved readability and ease of reading. ChatGPT-simplified reports received an overall accuracy score of 3.59/4 (standard deviation [SD] ± 0.17). The scores by rubric category were 3.62 (SD ± 0.31) for factual correctness (4 = completely correct), 3.27 (SD ± 0.44) for relevancy (4 = completely relevant), and 3.89 (SD ± 0.11) for fabricated information (4 = no fabricated information).

CONCLUSIONS

ChatGPT simplified breast pathology reports to the reading level recommended for patient-facing materials when given a grade-level specification while mostly maintaining accuracy. To minimize the risk of medically inaccurate and/or misleading information, ChatGPT-simplified reports should be reviewed before dissemination.

摘要

背景

患者可立即获取其诊断报告，但这些报告超出了面向患者材料的推荐阅读水平。生成式人工智能可能是提高患者对健康信息理解的一种工具。本研究评估了ChatGPT简化的乳腺病理报告的可读性和准确性。

方法

ChatGPT-4.0使用三种不同提示对10份去识别化的患者乳腺病理报告进行简化。提示1要求简化，提示2添加了六年级水平的规范，提示3要求提供基本信息。分别使用弗莱施-金凯德阅读水平（FKRL）和弗莱施阅读易读性评分（FRES）来量化可读性和易读性。五名医生使用四点量表评估事实正确性、相关性和虚构内容，以确定总体准确性。使用方差分析（ANOVA）和t检验比较FKRL、FRES和准确性评分的平均分数和标准差。

结果

提示2显示FKRL降低（p < 0.001），FRES增加（p < 0.001），表明可读性和易读性得到改善。ChatGPT简化的报告总体准确性得分为3.59/4（标准差[SD]±0.17）。按评分标准类别划分的分数分别为：事实正确性3.62（SD±0.31）（4 = 完全正确）、相关性3.27（SD±0.44）（4 = 完全相关）、虚构信息3.89（SD±0.11）（4 = 无虚构信息）。

结论

当给出年级水平规范时，ChatGPT将乳腺病理报告简化到了面向患者材料推荐的阅读水平，同时基本保持了准确性。为了将医学上不准确和/或误导性信息的风险降至最低，ChatGPT简化的报告在传播前应进行审核。

相似文献

The Use of ChatGPT-4.0 to Simplify Breast Pathology Reports: A Study on Readability and Accuracy.

Ann Surg Oncol. 2025 Jul 21. doi: 10.1245/s10434-025-17860-2.

Enhancing the Readability of Online Patient Education Materials Using Large Language Models: Cross-Sectional Study.

J Med Internet Res. 2025 Jun 4;27:e69955. doi: 10.2196/69955.

Can Artificial Intelligence Improve the Readability of Patient Education Materials?

Clin Orthop Relat Res. 2023 Nov 1;481(11):2260-2267. doi: 10.1097/CORR.0000000000002668. Epub 2023 Apr 28.

Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.

Cureus. 2025 Jun 1;17(6):e85174. doi: 10.7759/cureus.85174. eCollection 2025 Jun.

Artificial Intelligence Shows Limited Success in Improving Readability Levels of Spanish-language Orthopaedic Patient Education Materials.

Clin Orthop Relat Res. 2025 Feb 11. doi: 10.1097/CORR.0000000000003413.

Using Artificial Intelligence ChatGPT to Access Medical Information about Chemical Eye Injuries: A Comparative Study.

JMIR Form Res. 2025 Jun 30. doi: 10.2196/73642.

Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?

Clin Orthop Relat Res. 2025 Feb 1;483(2):306-315. doi: 10.1097/CORR.0000000000003263. Epub 2024 Sep 25.

Can artificial intelligence improve the readability of patient education information in gynecology?

Am J Obstet Gynecol. 2025 Jun 25. doi: 10.1016/j.ajog.2025.06.047.

Evaluation of Information Provided by ChatGPT Versions on Traumatic Dental Injuries for Dental Students and Professionals.

Dent Traumatol. 2025 Aug;41(4):427-436. doi: 10.1111/edt.13042. Epub 2025 Jan 23.

Accuracy and Readability of ChatGPT Responses to Patient-Centric Strabismus Questions.

J Pediatr Ophthalmol Strabismus. 2025 May-Jun;62(3):220-227. doi: 10.3928/01913913-20250110-02. Epub 2025 Feb 19.

本文引用的文献

Health literacy and all-cause mortality among cancer patients.

Cancer. 2025 Mar 15;131(6):e35794. doi: 10.1002/cncr.35794.

Patients' Trust in Health Systems to Use Artificial Intelligence.

JAMA Netw Open. 2025 Feb 3;8(2):e2460628. doi: 10.1001/jamanetworkopen.2024.60628.

Embrace with caution: The limitations of generative artificial intelligence in responding to patient health care queries.

Cancer. 2025 Jan 1;131(1):e35651. doi: 10.1002/cncr.35651. Epub 2024 Nov 19.

Generative artificial intelligence as a source of breast cancer information for patients: Proceed with caution.

Cancer. 2025 Jan 1;131(1):e35521. doi: 10.1002/cncr.35521. Epub 2024 Aug 30.

Large language models in health care: Development, applications, and challenges.

Health Care Sci. 2023 Jul 24;2(4):255-263. doi: 10.1002/hcs2.61. eCollection 2023 Aug.

Preparing for an Artificial Intelligence-Enabled Future: Patient Perspectives on Engagement and Health Care Professional Training for Adopting Artificial Intelligence Technologies in Health Care Settings.

JMIR AI. 2023 Mar 2;2:e40973. doi: 10.2196/40973.

Use of Artificial Intelligence Chatbots in Interpretation of Pathology Reports.

JAMA Netw Open. 2024 May 1;7(5):e2412767. doi: 10.1001/jamanetworkopen.2024.12767.

The Role of AI in Hospitals and Clinics: Transforming Healthcare in the 21st Century.

Bioengineering (Basel). 2024 Mar 29;11(4):337. doi: 10.3390/bioengineering11040337.

From technical to understandable: Artificial Intelligence Large Language Models improve the readability of knee radiology reports.

Knee Surg Sports Traumatol Arthrosc. 2024 May;32(5):1077-1086. doi: 10.1002/ksa.12133. Epub 2024 Mar 15.

ChatGPT vs. web search for patient questions: what does ChatGPT do better?

Eur Arch Otorhinolaryngol. 2024 Jun;281(6):3219-3225. doi: 10.1007/s00405-024-08524-0. Epub 2024 Feb 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用ChatGPT-4.0简化乳腺病理报告：可读性与准确性研究

The Use of ChatGPT-4.0 to Simplify Breast Pathology Reports: A Study on Readability and Accuracy.

作者信息

机构信息

Albany Medical College, Albany Medical Center, Albany, NY, USA.

Adult and Child Center for Outcomes Research and Delivery Science (ACCORDS), University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

出版信息

Ann Surg Oncol. 2025 Jul 21. doi: 10.1245/s10434-025-17860-2.

DOI:10.1245/s10434-025-17860-2

PMID:40690168

Abstract

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

摘要

使用ChatGPT-4.0简化乳腺病理报告：可读性与准确性研究

The Use of ChatGPT-4.0 to Simplify Breast Pathology Reports: A Study on Readability and Accuracy.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用ChatGPT-4.0简化乳腺病理报告：可读性与准确性研究

The Use of ChatGPT-4.0 to Simplify Breast Pathology Reports: A Study on Readability and Accuracy.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献