用于提取肾活检病理诊断的自然语言处理模型的开发

Development of a Natural Language Processing Model for Extracting Kidney Biopsy Pathology Diagnoses.

作者信息

Bobart Shane A, Hsu Enshuo, Potter Thomas, Truong Luan, Waterman Amy, Jones Stephen, Shafi Tariq

机构信息

Division of Nephrology, Hypertension and Transplantation, Houston Methodist Hospital, Houston, TX.

Division of Nephrology and Hypertension, Mayo Clinic, Jacksonville, FL.

出版信息

Kidney Med. 2025 Jun 14;7(8):101047. doi: 10.1016/j.xkme.2025.101047. eCollection 2025 Aug.

DOI:10.1016/j.xkme.2025.101047

PMID:40746935

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12311501/

Abstract

RATIONALE & OBJECTIVE: Kidney biopsy reports are in a nonindexed text format, and the diagnosis requires labor-intensive manual abstraction. Natural language processing (NLP) has not been rigorously tested for kidney biopsy diagnosis extraction. Our objective was to develop an accurate model to extract the biopsy diagnosis from free-text reports.

STUDY DESIGN

Text classification using NLP.

SETTING & PARTICIPANTS: 2,666 patients with 3,042 native kidney biopsy reports in the Portable Document Format, from June 2016 to December 2023.

PREDICTOR

Kidney biopsy diagnosis.

OUTCOMES

The performance of the NLP algorithm for all and the 20 most common diagnoses based on precision, recall, F1 score, and area under the receiver operating curve (AUROC).

ANALYTICAL APPROACH

A domain expert manually abstracted the diagnosis, and a renal pathologist validated a random subset (n = 200). Structured Query Language server and Python processed reports into machine-readable free text. We used PubMed Bidirectional Encoder Representations from Transformers to develop our NLP algorithm. We randomly split the reports into training (80%; n = 2,434) and testing (20%; n = 608) sets to train the NLP system. We further divided the testing set into 20% validation and 80% fine-tuning sets.

RESULTS

The median age was 57 years, with 50% female, 29% African Americans, and 23% Hispanic participants. The 5 most frequent glomerular diagnoses were diabetic kidney disease (23.7%), focal segmental glomerulosclerosis (15.5%), lupus nephritis (9.7%), immunoglobulin A nephropathy (8.9), and membranous nephropathy (7.2%). The Cohen kappa coefficient for interrater reliability was 0.76. PubMed Bidirectional Encoder Representations from Transformers fine-tuned with a training set showed the average AUROC for NLP performance in the testing set of 0.95 across all diagnoses with an F1 score of 0.57. For the 20 most common diagnoses, the AUROC was 0.97 with an F1 score of 0.72. Limitations: Single centered; sample size and use limited to research purposes.

CONCLUSIONS

We demonstrate an accurate and scalable NLP system to extract the primary diagnosis from free-text kidney biopsy reports, which can facilitate epidemiologic studies and identify patients for clinical trial recruitment.

摘要

原理与目的

肾活检报告为非索引文本格式，诊断需要耗费大量人力进行人工提取。自然语言处理（NLP）在肾活检诊断提取方面尚未经过严格测试。我们的目标是开发一种准确的模型，从自由文本报告中提取活检诊断。

研究设计

使用NLP进行文本分类。

设置与参与者

2016年6月至2023年12月期间，2666例患者有3042份原生肾活检报告，格式为便携式文档格式。

预测因素

肾活检诊断。

结果

NLP算法基于精度、召回率、F1分数和受试者工作特征曲线下面积（AUROC）对所有诊断以及20种最常见诊断的性能。

分析方法

领域专家人工提取诊断，肾脏病理学家对随机子集（n = 200）进行验证。结构化查询语言服务器和Python将报告处理为机器可读的自由文本。我们使用来自Transformer的PubMed双向编码器表示来开发我们的NLP算法。我们将报告随机分为训练集（80%；n = 2434）和测试集（20%；n = 608）来训练NLP系统。我们进一步将测试集分为20%的验证集和80%的微调集。

结果

中位年龄为57岁，50%为女性，29%为非裔美国人，23%为西班牙裔参与者。5种最常见的肾小球诊断为糖尿病肾病（23.7%）、局灶节段性肾小球硬化（15.5%）、狼疮性肾炎（9.7%）、免疫球蛋白A肾病（8.9%）和膜性肾病（7.2%）。评分者间可靠性的Cohen kappa系数为0.76。使用训练集进行微调的来自Transformer的PubMed双向编码器表示显示，在测试集中，NLP性能的平均AUROC在所有诊断中为0.95，F1分数为0.57。对于20种最常见的诊断，AUROC为0.97，F1分数为0.72。局限性：单中心；样本量和用途限于研究目的。

结论

我们展示了一种准确且可扩展的NLP系统，可从自由文本肾活检报告中提取主要诊断，这有助于流行病学研究并识别适合临床试验招募的患者。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/94da/12311501/a47f03067b88/gr1.jpg

相似文献

Development of a Natural Language Processing Model for Extracting Kidney Biopsy Pathology Diagnoses.用于提取肾活检病理诊断的自然语言处理模型的开发

Kidney Med. 2025 Jun 14;7(8):101047. doi: 10.1016/j.xkme.2025.101047. eCollection 2025 Aug.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

PDF Entity Annotation Tool (PEAT).PDF实体注释工具（PEAT）。

J Open Source Softw. 2025 Apr 8;10(108):5336. doi: 10.21105/joss.05336.

Automated Transformation of Unstructured Cardiovascular Diagnostic Reports into Structured Datasets Using Sequentially Deployed Large Language Models.使用顺序部署的大语言模型将非结构化心血管诊断报告自动转换为结构化数据集

medRxiv. 2024 Oct 8:2024.10.08.24315035. doi: 10.1101/2024.10.08.24315035.

Language Models for Multilabel Document Classification of Surgical Concepts in Exploratory Laparotomy Operative Notes: Algorithm Development Study.用于探索性剖腹手术记录中手术概念多标签文档分类的语言模型：算法开发研究

JMIR Med Inform. 2025 Jul 9;13:e71176. doi: 10.2196/71176.

Variation within and between digital pathology and light microscopy for the diagnosis of histopathology slides: blinded crossover comparison study.数字病理学与光学显微镜检查在组织病理学切片诊断中的内部及相互间差异：双盲交叉对比研究

Health Technol Assess. 2025 Jul;29(30):1-75. doi: 10.3310/SPLK4325.

De Novo Natural Language Processing Algorithm Accurately Identifies Myxofibrosarcoma From Pathology Reports.全新自然语言处理算法可从病理报告中准确识别黏液纤维肉瘤。

Clin Orthop Relat Res. 2025 Jan 1;483(1):80-87. doi: 10.1097/CORR.0000000000003270. Epub 2024 Oct 2.

Use of deep learning-based NLP models for full-text data elements extraction for systematic literature review tasks.基于深度学习的自然语言处理模型在系统文献综述任务的全文数据元素提取中的应用。

Sci Rep. 2025 Jun 3;15(1):19379. doi: 10.1038/s41598-025-03979-5.

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

本文引用的文献

BERT-based natural language processing analysis of French CT reports: Application to the measurement of the positivity rate for pulmonary embolism.基于BERT的法语CT报告自然语言处理分析：在肺栓塞阳性率测量中的应用

Res Diagn Interv Imaging. 2023 Mar 27;6:100027. doi: 10.1016/j.redii.2023.100027. eCollection 2023 Jun.

Bidirectional Encoder Representations from Transformers in Radiology: A Systematic Review of Natural Language Processing Applications.基于 Transformer 的双向编码器表示在放射学中的应用：自然语言处理应用的系统评价。

J Am Coll Radiol. 2024 Jun;21(6):914-941. doi: 10.1016/j.jacr.2024.01.012. Epub 2024 Jan 30.

The Development of a Comprehensive Clinicopathologic Registry for Glomerular Diseases Using Natural Language Processing.利用自然语言处理技术开发肾小球疾病综合临床病理登记系统

Can J Kidney Health Dis. 2023 Jun 16;10:20543581231178963. doi: 10.1177/20543581231178963. eCollection 2023.

An accessible, efficient, and accurate natural language processing method for extracting diagnostic data from pathology reports.一种用于从病理报告中提取诊断数据的便捷、高效且准确的自然语言处理方法。

J Pathol Inform. 2022 Nov 8;13:100154. doi: 10.1016/j.jpi.2022.100154. eCollection 2022.

The Cleveland Clinic Kidney Biopsy Epidemiological Project.克利夫兰诊所肾脏活检流行病学项目。

Kidney360. 2022 Oct 18;3(12):2077-2085. doi: 10.34067/KID.0005882022. eCollection 2022 Dec 29.

Ketogenic-Diet Shake Containing -Associated Acute Interstitial Nephritis.含生酮饮食奶昔相关性急性间质性肾炎

Case Rep Nephrol Dial. 2022 Oct 27;12(3):219-225. doi: 10.1159/000526391. eCollection 2022 Sep-Dec.

Natural Language Processing in Diagnostic Texts from Nephropathology.肾脏病病理学诊断文本中的自然语言处理

Diagnostics (Basel). 2022 Jul 15;12(7):1726. doi: 10.3390/diagnostics12071726.

Acute glomerulonephritis.急性肾小球肾炎

Lancet. 2022 Apr 23;399(10335):1646-1663. doi: 10.1016/S0140-6736(22)00461-5.

Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records.深度学习自然语言处理算法在电子病历中从病理报告中提取关键词的验证。

Sci Rep. 2020 Nov 20;10(1):20265. doi: 10.1038/s41598-020-77258-w.

Clinical Characteristics of and Risk Factors for Chronic Kidney Disease Among Adults and Children: An Analysis of the CURE-CKD Registry.成人和儿童慢性肾脏病的临床特征和危险因素：CURE-CKD 登记分析。

JAMA Netw Open. 2019 Dec 2;2(12):e1918169. doi: 10.1001/jamanetworkopen.2019.18169.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于提取肾活检病理诊断的自然语言处理模型的开发

Development of a Natural Language Processing Model for Extracting Kidney Biopsy Pathology Diagnoses.

作者信息

机构信息

出版信息

STUDY DESIGN

PREDICTOR

OUTCOMES

ANALYTICAL APPROACH

RESULTS

CONCLUSIONS

原理与目的

研究设计

设置与参与者

预测因素

结果

分析方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献