• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

病理报告的文本搜索方法能否在大型管理数据库中准确识别直肠癌患者?

Can Text-Search Methods of Pathology Reports Accurately Identify Patients with Rectal Cancer in Large Administrative Databases?

作者信息

Musselman Reilly P, Rothwell Deanna, Auer Rebecca C, Moloo Husein, Boushey Robin P, van Walraven Carl

机构信息

Division of General Surgery, University of Ottawa, Ottawa, ON, Canada.

Department Epidemiology and Community Medicine, Ottawa Hospital Research Institute, Ottawa, ON, Canada.

出版信息

J Pathol Inform. 2018 May 2;9:18. doi: 10.4103/jpi.jpi_71_17. eCollection 2018.

DOI:10.4103/jpi.jpi_71_17
PMID:29862128
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5952547/
Abstract

BACKGROUND

The aim of this study is to derive and to validate a cohort of rectal cancer surgical patients within administrative datasets using text-search analysis of pathology reports.

MATERIALS AND METHODS

A text-search algorithm was developed and validated on pathology reports from 694 known rectal cancers, 1000 known colon cancers, and 1000 noncolorectal specimens. The algorithm was applied to all pathology reports available within the Ottawa Hospital Data Warehouse from 1996 to 2010. Identified pathology reports were validated as rectal cancer specimens through manual chart review. Sensitivity, specificity, and positive predictive value (PPV) of the text-search methodology were calculated.

RESULTS

In the derivation cohort of pathology reports ( = 2694), the text-search algorithm had a sensitivity and specificity of 100% and 98.6%, respectively. When this algorithm was applied to all pathology reports from 1996 to 2010 ( = 284,032), 5588 pathology reports were identified as consistent with rectal cancer. Medical record review determined that 4550 patients did not have rectal cancer, leaving a final cohort of 1038 rectal cancer patients. Sensitivity and specificity of the text-search algorithm were 100% and 98.4%, respectively. PPV of the algorithm was 18.6%.

CONCLUSIONS

Text-search methodology is a feasible way to identify all rectal cancer surgery patients through administrative datasets with high sensitivity and specificity. However, in the presence of a low pretest probability, text-search methods must be combined with a validation method, such as manual chart review, to be a viable approach.

摘要

背景

本研究的目的是通过对病理报告进行文本搜索分析,在管理数据集中推导并验证一组直肠癌手术患者。

材料与方法

开发了一种文本搜索算法,并在694例已知直肠癌、1000例已知结肠癌和1000例非结直肠标本的病理报告上进行验证。该算法应用于渥太华医院数据仓库1996年至2010年期间所有可用的病理报告。通过人工病历审查将识别出的病理报告验证为直肠癌标本。计算文本搜索方法的敏感性、特异性和阳性预测值(PPV)。

结果

在病理报告推导队列(n = 2694)中,文本搜索算法的敏感性和特异性分别为100%和98.6%。当该算法应用于1996年至2010年的所有病理报告(n = 284,032)时,5588份病理报告被识别为与直肠癌一致。病历审查确定4550例患者没有直肠癌,最终队列中有1038例直肠癌患者。文本搜索算法的敏感性和特异性分别为100%和98.4%。该算法的PPV为18.6%。

结论

文本搜索方法是通过管理数据集以高敏感性和特异性识别所有直肠癌手术患者的可行方法。然而,在预测试概率较低的情况下,文本搜索方法必须与验证方法(如人工病历审查)相结合,才能成为一种可行的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0a1/5952547/0fbe2c1fe204/JPI-9-18-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0a1/5952547/4a1719602d20/JPI-9-18-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0a1/5952547/0fbe2c1fe204/JPI-9-18-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0a1/5952547/4a1719602d20/JPI-9-18-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0a1/5952547/0fbe2c1fe204/JPI-9-18-g004.jpg

相似文献

1
Can Text-Search Methods of Pathology Reports Accurately Identify Patients with Rectal Cancer in Large Administrative Databases?病理报告的文本搜索方法能否在大型管理数据库中准确识别直肠癌患者?
J Pathol Inform. 2018 May 2;9:18. doi: 10.4103/jpi.jpi_71_17. eCollection 2018.
2
Validation of Case Finding Algorithms for Hepatocellular Cancer From Administrative Data and Electronic Health Records Using Natural Language Processing.使用自然语言处理技术从行政数据和电子健康记录中验证肝细胞癌病例发现算法
Med Care. 2016 Feb;54(2):e9-14. doi: 10.1097/MLR.0b013e3182a30373.
3
Do Diagnostic and Procedure Codes Within Population-Based, Administrative Datasets Accurately Identify Patients with Rectal Cancer?基于人群的行政数据集内的诊断和操作代码能否准确识别直肠癌患者?
J Gastrointest Surg. 2019 Feb;23(2):367-376. doi: 10.1007/s11605-018-4043-z. Epub 2018 Dec 3.
4
De Novo Natural Language Processing Algorithm Accurately Identifies Myxofibrosarcoma From Pathology Reports.全新自然语言处理算法可从病理报告中准确识别黏液纤维肉瘤。
Clin Orthop Relat Res. 2025 Jan 1;483(1):80-87. doi: 10.1097/CORR.0000000000003270. Epub 2024 Oct 2.
5
Applying a Text-Search Algorithm to Radiology Reports Can Find More Patients With Pulmonary Nodules Than Radiology Coding Alone.将文本搜索算法应用于放射学报告比单独使用放射学编码能发现更多患有肺结节的患者。
Fed Pract. 2020 May;37(Suppl 2):S32-S37.
6
Retrospective Derivation and Validation of an Automated Electronic Search Algorithm to Identify Post Operative Cardiovascular and Thromboembolic Complications.一种用于识别术后心血管和血栓栓塞并发症的自动化电子搜索算法的回顾性推导与验证
Appl Clin Inform. 2015 Sep 9;6(3):565-76. doi: 10.4338/ACI-2015-03-RA-0026. eCollection 2015.
7
Retrospective derivation and validation of a search algorithm to identify extubation failure in the intensive care unit.用于识别重症监护病房拔管失败的搜索算法的回顾性推导与验证
BMC Anesthesiol. 2014 May 23;14:41. doi: 10.1186/1471-2253-14-41. eCollection 2014.
8
Detecting professional interpreter use among patients with limited English proficiency: Derivation and validation study.在英语水平有限的患者中检测专业口译员的使用情况:推导与验证研究。
SAGE Open Med. 2022 May 17;10:20503121221098146. doi: 10.1177/20503121221098146. eCollection 2022.
9
Assembling and validating data from multiple sources to study care for Veterans with bladder cancer.整合并验证来自多个来源的数据,以研究对膀胱癌退伍军人的护理。
BMC Urol. 2017 Sep 6;17(1):78. doi: 10.1186/s12894-017-0271-x.
10
Chiari malformation Type I surgery in pediatric patients. Part 1: validation of an ICD-9-CM code search algorithm.小儿患者的Ⅰ型Chiari畸形手术。第1部分:ICD-9-CM编码搜索算法的验证。
J Neurosurg Pediatr. 2016 May;17(5):519-24. doi: 10.3171/2015.10.PEDS15370. Epub 2016 Jan 22.

引用本文的文献

1
Do Diagnostic and Procedure Codes Within Population-Based, Administrative Datasets Accurately Identify Patients with Rectal Cancer?基于人群的行政数据集内的诊断和操作代码能否准确识别直肠癌患者?
J Gastrointest Surg. 2019 Feb;23(2):367-376. doi: 10.1007/s11605-018-4043-z. Epub 2018 Dec 3.

本文引用的文献

1
Effect of Laparoscopic-Assisted Resection vs Open Resection on Pathological Outcomes in Rectal Cancer: The ALaCaRT Randomized Clinical Trial.腹腔镜辅助与开放手术切除直肠癌对病理结局的影响:ALA-CART 随机临床试验。
JAMA. 2015 Oct 6;314(13):1356-63. doi: 10.1001/jama.2015.12009.
2
Development of an electronic breast pathology database in a community health system.在社区卫生系统中开发电子乳腺病理数据库。
J Pathol Inform. 2014 Jul 30;5(1):26. doi: 10.4103/2153-3539.137730. eCollection 2014.
3
ICD-10 codes used to identify adverse drug events in administrative data: a systematic review.
用于在行政数据中识别药物不良事件的 ICD-10 代码:系统评价。
J Am Med Inform Assoc. 2014 May-Jun;21(3):547-57. doi: 10.1136/amiajnl-2013-002116. Epub 2013 Nov 12.
4
Identifying patients with ischemic heart disease in an electronic medical record.在电子病历中识别缺血性心脏病患者。
J Prim Care Community Health. 2011 Jan 1;2(1):49-53. doi: 10.1177/2150131910382251.
5
Laparoscopic versus open surgery for rectal cancer (COLOR II): short-term outcomes of a randomised, phase 3 trial.腹腔镜与开腹手术治疗直肠癌(COLOR II):一项随机、3 期临床试验的短期结果。
Lancet Oncol. 2013 Mar;14(3):210-8. doi: 10.1016/S1470-2045(13)70016-0. Epub 2013 Feb 6.
6
Automated identification of postoperative complications within an electronic medical record using natural language processing.利用自然语言处理技术在电子病历中自动识别术后并发症。
JAMA. 2011 Aug 24;306(8):848-55. doi: 10.1001/jama.2011.1204.
7
Administrative database research infrequently used validated diagnostic or procedural codes.行政数据库研究很少使用经过验证的诊断或程序代码。
J Clin Epidemiol. 2011 Oct;64(10):1054-9. doi: 10.1016/j.jclinepi.2011.01.001. Epub 2011 Apr 6.
8
Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统(cTAKES):架构、组件评估和应用。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.
9
Incidence, follow-up, and outcomes of incidental abdominal aortic aneurysms.偶然发现的腹主动脉瘤的发病率、随访和结局。
J Vasc Surg. 2010 Aug;52(2):282-9.e1-2. doi: 10.1016/j.jvs.2010.03.006. Epub 2010 Jun 11.
10
The use of narrative text for injury surveillance research: a systematic review.利用叙事文本进行伤害监测研究:系统评价。
Accid Anal Prev. 2010 Mar;42(2):354-63. doi: 10.1016/j.aap.2009.09.020. Epub 2009 Oct 24.