• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过 ARGO(肿瘤血液病自动记录生成器)从病理报告中生成电子病例报告表

Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology.

机构信息

Hematology and Cell Therapy Unit, IRCCS Istituto Tumori 'Giovanni Paolo II', Viale Orazio Flacco, 65, Bari, Italy.

Department of Electrical and Information Engineering, Politecnico of Bari, Bari, Italy.

出版信息

Sci Rep. 2021 Dec 10;11(1):23823. doi: 10.1038/s41598-021-03204-z.

DOI:10.1038/s41598-021-03204-z
PMID:34893665
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8664934/
Abstract

The unstructured nature of Real-World (RW) data from onco-hematological patients and the scarce accessibility to integrated systems restrain the use of RW information for research purposes. Natural Language Processing (NLP) might help in transposing unstructured reports into standardized electronic health records. We exploited NLP to develop an automated tool, named ARGO (Automatic Record Generator for Onco-hematology) to recognize information from pathology reports and populate electronic case report forms (eCRFs) pre-implemented by REDCap. ARGO was applied to hemo-lymphopathology reports of diffuse large B-cell, follicular, and mantle cell lymphomas, and assessed for accuracy (A), precision (P), recall (R) and F1-score (F) on internal (n = 239) and external (n = 93) report series. 326 (98.2%) reports were converted into corresponding eCRFs. Overall, ARGO showed high performance in capturing (1) identification report number (all metrics > 90%), (2) biopsy date (all metrics > 90% in both series), (3) specimen type (86.6% and 91.4% of A, 98.5% and 100.0% of P, 92.5% and 95.5% of F, and 87.2% and 91.4% of R for internal and external series, respectively), (4) diagnosis (100% of P with A, R and F of 90% in both series). We developed and validated a generalizable tool that generates structured eCRFs from real-life pathology reports.

摘要

真实世界(RW)中来自血液肿瘤患者的数据具有非结构化性质,且集成系统的获取途径稀缺,这限制了 RW 信息在研究中的应用。自然语言处理(NLP)可以帮助将非结构化报告转换为标准化的电子健康记录。我们利用 NLP 开发了一种名为 ARGO(Onco-Hematology 自动记录生成器)的自动化工具,用于识别病理报告中的信息并填充 REDCap 预先实施的电子病例报告表(eCRF)。ARGO 应用于弥漫性大 B 细胞淋巴瘤、滤泡性淋巴瘤和套细胞淋巴瘤的血液淋巴病理学报告,并在内部(n=239)和外部(n=93)报告系列中评估准确性(A)、精密度(P)、召回率(R)和 F1 分数(F)。326(98.2%)份报告被转换为相应的 eCRF。总体而言,ARGO 在捕获以下内容方面表现出了较高的性能:(1)报告编号的识别(所有指标均>90%),(2)活检日期(内部和外部系列的所有指标均>90%),(3)标本类型(内部和外部系列的 A 分别为 86.6%和 91.4%,P 为 98.5%和 100.0%,F 为 92.5%和 95.5%,R 为 87.2%和 91.4%),(4)诊断(内部和外部系列的 P 均为 100%,A、R 和 F 均为 90%)。我们开发并验证了一种可推广的工具,可从真实的病理报告中生成结构化的 eCRF。

相似文献

1
Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology.通过 ARGO(肿瘤血液病自动记录生成器)从病理报告中生成电子病例报告表
Sci Rep. 2021 Dec 10;11(1):23823. doi: 10.1038/s41598-021-03204-z.
2
Obtaining Knowledge in Pathology Reports Through a Natural Language Processing Approach With Classification, Named-Entity Recognition, and Relation-Extraction Heuristics.通过采用分类、命名实体识别和关系提取启发式方法的自然语言处理途径从病理报告中获取知识。
JCO Clin Cancer Inform. 2019 Aug;3:1-8. doi: 10.1200/CCI.19.00008.
3
ARGO 2.0: a Hybrid NLP/ML Framework for Diagnosis Standardization.ARGO 2.0:用于诊断标准化的混合自然语言处理/机器学习框架。
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340022.
4
Natural Language Processing Accurately Calculates Adenoma and Sessile Serrated Polyp Detection Rates.自然语言处理准确计算腺瘤和无蒂锯齿状息肉的检出率。
Dig Dis Sci. 2018 Jul;63(7):1794-1800. doi: 10.1007/s10620-018-5078-4. Epub 2018 Apr 26.
5
Enhancing Case Capture, Quality, and Completeness of Primary Melanoma Pathology Records via Natural Language Processing.通过自然语言处理提高原发性黑色素瘤病理记录的病例捕获率、质量和完整性。
JCO Clin Cancer Inform. 2019 Aug;3:1-11. doi: 10.1200/CCI.19.00006.
6
Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.设计一个基于 openEHR 的管道,使用自然语言处理提取和标准化非结构化临床数据。
Methods Inf Med. 2020 Dec;59(S 02):e64-e78. doi: 10.1055/s-0040-1716403. Epub 2020 Oct 14.
7
Natural language processing of radiology reports for identification of skeletal site-specific fractures.放射科报告的自然语言处理以识别骨骼部位特异性骨折。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):73. doi: 10.1186/s12911-019-0780-5.
8
A natural language processing program effectively extracts key pathologic findings from radical prostatectomy reports.一个自然语言处理程序能有效地从根治性前列腺切除术报告中提取关键病理结果。
J Endourol. 2014 Dec;28(12):1474-8. doi: 10.1089/end.2014.0221.
9
Automated outcome classification of emergency department computed tomography imaging reports.急诊 CT 影像报告的自动化结果分类。
Acad Emerg Med. 2013 Aug;20(8):848-54. doi: 10.1111/acem.12174.
10
Cohort profile: St. Michael's Hospital Tuberculosis Database (SMH-TB), a retrospective cohort of electronic health record data and variables extracted using natural language processing.队列资料简介:圣迈克尔医院结核病数据库(SMH-TB),这是一个使用自然语言处理提取电子健康记录数据和变量的回顾性队列。
PLoS One. 2021 Mar 3;16(3):e0247872. doi: 10.1371/journal.pone.0247872. eCollection 2021.

引用本文的文献

1
Development of an interactive web dashboard to facilitate the reexamination of pathology reports for instances of underbilling of CPT codes.开发一个交互式网络仪表板,以方便对CPT编码计费不足的情况重新审查病理报告。
J Pathol Inform. 2023 Jan 12;14:100187. doi: 10.1016/j.jpi.2023.100187. eCollection 2023.

本文引用的文献

1
Natural language processing systems for pathology parsing in limited data environments with uncertainty estimation.用于在具有不确定性估计的有限数据环境中进行病理学解析的自然语言处理系统。
JAMIA Open. 2020 Oct 14;3(3):431-438. doi: 10.1093/jamiaopen/ooaa029. eCollection 2020 Oct.
2
Transformation of Pathology Reports Into the Common Data Model With Oncology Module: Use Case for Colon Cancer.将病理学报告转化为带有肿瘤学模块的通用数据模型:结肠癌用例。
J Med Internet Res. 2020 Dec 9;22(12):e18526. doi: 10.2196/18526.
3
Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.
利用放射学报告的自然语言处理技术对晚期慢性胰腺炎患者进行特征描述。
PLoS One. 2020 Aug 19;15(8):e0236817. doi: 10.1371/journal.pone.0236817. eCollection 2020.
4
FasTag: Automatic text classification of unstructured medical narratives.FasTag:用于非结构化医疗叙事的自动文本分类。
PLoS One. 2020 Jun 22;15(6):e0234647. doi: 10.1371/journal.pone.0234647. eCollection 2020.
5
Developing an FHIR-Based Computational Pipeline for Automatic Population of Case Report Forms for Colorectal Cancer Clinical Trials Using Electronic Health Records.开发一个基于FHIR的计算管道,用于使用电子健康记录自动填充结直肠癌临床试验病例报告表。
JCO Clin Cancer Inform. 2020 Mar;4:201-209. doi: 10.1200/CCI.19.00116.
6
High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP).使用一种常见的半监督方法(PheCAP)对电子病历数据进行高通量表型分析。
Nat Protoc. 2019 Dec;14(12):3426-3444. doi: 10.1038/s41596-019-0227-6. Epub 2019 Nov 20.
7
Applying Data Warehousing to a Phase III Clinical Trial From the Fondazione Italiana Linfomi Ensures Superior Data Quality and Improved Assessment of Clinical Outcomes.将数据仓库应用于意大利淋巴瘤基金会的III期临床试验可确保卓越的数据质量并改善临床结果评估。
JCO Clin Cancer Inform. 2019 Oct;3:1-15. doi: 10.1200/CCI.19.00049.
8
Automating the Capture of Structured Pathology Data for Prostate Cancer Clinical Care and Research.为前列腺癌临床护理与研究自动采集结构化病理数据
JCO Clin Cancer Inform. 2019 Jul;3:1-8. doi: 10.1200/CCI.18.00084.
9
The REDCap consortium: Building an international community of software platform partners.REDCap 联盟:构建软件平台合作伙伴的国际社区。
J Biomed Inform. 2019 Jul;95:103208. doi: 10.1016/j.jbi.2019.103208. Epub 2019 May 9.
10
Automated Extraction of Grade, Stage, and Quality Information From Transurethral Resection of Bladder Tumor Pathology Reports Using Natural Language Processing.使用自然语言处理技术从膀胱肿瘤经尿道切除术病理报告中自动提取分级、分期和质量信息
JCO Clin Cancer Inform. 2018 Dec;2:1-8. doi: 10.1200/CCI.17.00128.