闭环：自动识别扫描文档中异常的成像结果。

Closing the loop: automatically identifying abnormal imaging results in scanned documents.

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, USA.

McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, Texas, USA.

出版信息

J Am Med Inform Assoc. 2022 Apr 13;29(5):831-840. doi: 10.1093/jamia/ocac007.

DOI:10.1093/jamia/ocac007

PMID:35146510

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9714594/

Abstract

OBJECTIVES

Scanned documents (SDs), while common in electronic health records and potentially rich in clinically relevant information, rarely fit well with clinician workflow. Here, we identify scanned imaging reports requiring follow-up with high recall and practically useful precision.

MATERIALS AND METHODS

We focused on identifying imaging findings for 3 common causes of malpractice claims: (1) potentially malignant breast (mammography) and (2) lung (chest computed tomography [CT]) lesions and (3) long-bone fracture (X-ray) reports. We train our ClinicalBERT-based pipeline on existing typed/dictated reports classified manually or using ICD-10 codes, evaluate using a test set of manually classified SDs, and compare against string-matching (baseline approach).

RESULTS

A total of 393 mammograms, 305 chest CT, and 683 bone X-ray reports were manually reviewed. The string-matching approach had an F1 of 0.667. For mammograms, chest CTs, and bone X-rays, respectively: models trained on manually classified training data and optimized for F1 reached an F1 of 0.900, 0.905, and 0.817, while separate models optimized for recall achieved a recall of 1.000 with precisions of 0.727, 0.518, and 0.275. Models trained on ICD-10-labelled data and optimized for F1 achieved F1 scores of 0.647, 0.830, and 0.643, while those optimized for recall achieved a recall of 1.0 with precisions of 0.407, 0.683, and 0.358.

DISCUSSION

Our pipeline can identify abnormal reports with potentially useful performance and so decrease the manual effort required to screen for abnormal findings that require follow-up.

CONCLUSION

It is possible to automatically identify clinically significant abnormalities in SDs with high recall and practically useful precision in a generalizable and minimally laborious way.

摘要

目的

扫描文档（SD）在电子健康记录中很常见，并且可能包含丰富的临床相关信息，但很少与临床医生的工作流程相匹配。在这里，我们确定了需要高召回率和实用精度进行随访的扫描成像报告。

材料和方法

我们专注于识别三种常见医疗事故索赔原因的成像结果：（1）潜在恶性乳腺（乳房 X 光片）和（2）肺（胸部 CT）病变以及（3）长骨骨折（X 光）报告。我们使用现有的手动分类或使用 ICD-10 代码分类的已键入/已口述报告来训练基于 ClinicalBERT 的管道，使用手动分类的 SD 测试集进行评估，并与字符串匹配（基线方法）进行比较。

结果

总共审查了 393 张乳房 X 光片、305 张胸部 CT 和 683 张骨 X 射线报告。字符串匹配方法的 F1 值为 0.667。对于乳房 X 光片、胸部 CT 和骨 X 射线，分别使用手动分类训练数据训练并针对 F1 进行优化的模型达到了 0.900、0.905 和 0.817 的 F1 值，而单独针对召回率进行优化的模型则达到了 1.000 的召回率和 0.727、0.518 和 0.275 的精度。使用 ICD-10 标记数据训练并针对 F1 进行优化的模型达到了 0.647、0.830 和 0.643 的 F1 值，而针对召回率进行优化的模型则达到了 1.0 的召回率和 0.407、0.683 和 0.358 的精度。

讨论

我们的管道可以识别具有潜在有用性能的异常报告，从而减少筛选需要随访的异常发现所需的人工工作。

结论

可以以可推广和最小化劳动的方式自动识别 SD 中的具有高召回率和实用精度的临床显著异常。

相似文献

Closing the loop: automatically identifying abnormal imaging results in scanned documents.

J Am Med Inform Assoc. 2022 Apr 13;29(5):831-840. doi: 10.1093/jamia/ocac007.

Identification of Long Bone Fractures in Radiology Reports Using Natural Language Processing to support Healthcare Quality Improvement.

Appl Clin Inform. 2016 Nov 9;7(4):1051-1068. doi: 10.4338/ACI-2016-08-RA-0129.

Facilitating clinical research through automation: Combining optical character recognition with natural language processing.

Clin Trials. 2022 Oct;19(5):504-511. doi: 10.1177/17407745221093621. Epub 2022 May 24.

Automatic classification of scanned electronic health record documents.

Int J Med Inform. 2020 Dec;144:104302. doi: 10.1016/j.ijmedinf.2020.104302. Epub 2020 Oct 17.

Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury.

Acad Emerg Med. 2016 Feb;23(2):171-8. doi: 10.1111/acem.12859. Epub 2016 Jan 14.

Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports.

Radiology. 2018 May;287(2):570-580. doi: 10.1148/radiol.2018171093. Epub 2018 Jan 30.

Automated outcome classification of emergency department computed tomography imaging reports.

Acad Emerg Med. 2013 Aug;20(8):848-54. doi: 10.1111/acem.12174.

Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.

AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.

Use of Machine Learning to Identify Follow-Up Recommendations in Radiology Reports.

J Am Coll Radiol. 2019 Mar;16(3):336-343. doi: 10.1016/j.jacr.2018.10.020. Epub 2018 Dec 29.

Natural Language Processing for Identification of Incidental Pulmonary Nodules in Radiology Reports.

J Am Coll Radiol. 2019 Nov;16(11):1587-1594. doi: 10.1016/j.jacr.2019.04.026. Epub 2019 May 24.

引用本文的文献

Generalizable and automated classification of TNM stage from pathology reports with external validation.

Nat Commun. 2024 Oct 16;15(1):8916. doi: 10.1038/s41467-024-53190-9.

The incremental design of a machine learning framework for medical records processing.

J Am Med Inform Assoc. 2024 Oct 1;31(10):2236-2245. doi: 10.1093/jamia/ocae194.

Extracting laboratory test information from paper-based reports.

BMC Med Inform Decis Mak. 2023 Nov 6;23(1):251. doi: 10.1186/s12911-023-02346-6.

A scoping review of natural language processing of radiology reports in breast cancer.

Front Oncol. 2023 Apr 12;13:1160167. doi: 10.3389/fonc.2023.1160167. eCollection 2023.

本文引用的文献

Automatic classification of scanned electronic health record documents.

Int J Med Inform. 2020 Dec;144:104302. doi: 10.1016/j.ijmedinf.2020.104302. Epub 2020 Oct 17.

Array programming with NumPy.

Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.

Follow Up of Incidental High-Risk Pulmonary Nodules on Computed Tomography Pulmonary Angiography at Care Transitions.

J Hosp Med. 2019 Jun 1;14(6):349-352. doi: 10.12788/jhm.3128. Epub 2019 Feb 20.

Access to Routinely Collected Clinical Data for Research: A Process Implemented at an Academic Medical Center.

Clin Transl Sci. 2019 May;12(3):231-235. doi: 10.1111/cts.12614. Epub 2019 Feb 12.

Adherence to Radiology Recommendations in a Clinical CT Lung Screening Program.

J Am Coll Radiol. 2018 Feb;15(2):282-286. doi: 10.1016/j.jacr.2017.10.014. Epub 2017 Dec 28.

Using Natural Language Processing of Free-Text Radiology Reports to Identify Type 1 Modic Endplate Changes.

J Digit Imaging. 2018 Feb;31(1):84-90. doi: 10.1007/s10278-017-0013-3.

Temporal bone radiology report classification using open source machine learning and natural langue processing libraries.

BMC Med Inform Decis Mak. 2016 Jun 6;16:65. doi: 10.1186/s12911-016-0306-3.

Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury.

Acad Emerg Med. 2016 Feb;23(2):171-8. doi: 10.1111/acem.12859. Epub 2016 Jan 14.

Radiology Malpractice Claims in the United States From 2008 to 2012: Characteristics and Implications.

J Am Coll Radiol. 2016 Feb;13(2):124-30. doi: 10.1016/j.jacr.2015.07.013. Epub 2015 Oct 9.

Natural language processing of radiology reports for the detection of thromboembolic diseases and clinically relevant incidental findings.

BMC Bioinformatics. 2014 Aug 7;15(1):266. doi: 10.1186/1471-2105-15-266.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

闭环：自动识别扫描文档中异常的成像结果。

Closing the loop: automatically identifying abnormal imaging results in scanned documents.

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, USA.

McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, Texas, USA.

出版信息

J Am Med Inform Assoc. 2022 Apr 13;29(5):831-840. doi: 10.1093/jamia/ocac007.

DOI:10.1093/jamia/ocac007

PMID:35146510

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9714594/

Abstract

OBJECTIVES

MATERIALS AND METHODS

RESULTS

DISCUSSION

Our pipeline can identify abnormal reports with potentially useful performance and so decrease the manual effort required to screen for abnormal findings that require follow-up.

CONCLUSION

It is possible to automatically identify clinically significant abnormalities in SDs with high recall and practically useful precision in a generalizable and minimally laborious way.

摘要

目的

材料和方法

结果

讨论

我们的管道可以识别具有潜在有用性能的异常报告，从而减少筛选需要随访的异常发现所需的人工工作。

结论

可以以可推广和最小化劳动的方式自动识别 SD 中的具有高召回率和实用精度的临床显著异常。

闭环：自动识别扫描文档中异常的成像结果。

Closing the loop: automatically identifying abnormal imaging results in scanned documents.

机构信息

出版信息

OBJECTIVES

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

目的

材料和方法

结果

讨论

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

闭环：自动识别扫描文档中异常的成像结果。

Closing the loop: automatically identifying abnormal imaging results in scanned documents.

机构信息

出版信息

OBJECTIVES

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

目的

材料和方法

结果

讨论

结论