• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Classification of cancer-related death certificates using machine learning.使用机器学习对癌症相关死亡证明进行分类。
Australas Med J. 2013 May 30;6(5):292-9. doi: 10.4066/AMJ.2013.1654. Print 2013.
2
Automatic ICD-10 classification of cancers from free-text death certificates.从自由文本死亡证明中对癌症进行ICD - 10自动分类。
Int J Med Inform. 2015 Nov;84(11):956-65. doi: 10.1016/j.ijmedinf.2015.08.004. Epub 2015 Aug 13.
3
Extracting cancer mortality statistics from death certificates: A hybrid machine learning and rule-based approach for common and rare cancers.从死亡证明中提取癌症死亡率统计数据:一种用于常见和罕见癌症的混合机器学习和基于规则的方法。
Artif Intell Med. 2018 Jul;89:1-9. doi: 10.1016/j.artmed.2018.04.011. Epub 2018 May 10.
4
Automatic classification of diseases from free-text death certificates for real-time surveillance.用于实时监测的基于自由文本死亡证明的疾病自动分类
BMC Med Inform Decis Mak. 2015 Jul 15;15:53. doi: 10.1186/s12911-015-0174-2.
5
Automatic classification of free-text medical causes from death certificates for reactive mortality surveillance in France.法国反应性死亡率监测中死亡证明的自由文本医疗原因自动分类。
Int J Med Inform. 2019 Nov;131:103915. doi: 10.1016/j.ijmedinf.2019.06.022. Epub 2019 Jul 6.
6
Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.开发和评估 RapTAT:一种用于从医学叙述中映射短语概念的机器学习系统。
J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4.
7
Automatic Classification of Free-Text Radiology Reports to Identify Limb Fractures using Machine Learning and the SNOMED CT Ontology.利用机器学习和SNOMED CT本体对自由文本放射学报告进行自动分类以识别肢体骨折
AMIA Jt Summits Transl Sci Proc. 2013 Mar 18;2013:300-4. eCollection 2013.
8
Enhancing timeliness of drug overdose mortality surveillance: A machine learning approach.提高药物过量死亡率监测的及时性:一种机器学习方法。
PLoS One. 2019 Oct 16;14(10):e0223318. doi: 10.1371/journal.pone.0223318. eCollection 2019.
9
Use of the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) for Processing Free Text in Health Care: Systematic Scoping Review.系统医学术语命名法(SNOMED CT)在医疗保健中处理自由文本的应用:系统范围综述。
J Med Internet Res. 2021 Jan 26;23(1):e24594. doi: 10.2196/24594.
10
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

引用本文的文献

1
Clinical Characteristics of COVID-19 Patients and Application to an Artificial Intelligence System for Disease Surveillance.新型冠状病毒肺炎患者的临床特征及其在疾病监测人工智能系统中的应用
J Clin Med. 2022 Mar 5;11(5):1437. doi: 10.3390/jcm11051437.
2
Use of the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) for Processing Free Text in Health Care: Systematic Scoping Review.系统医学术语命名法(SNOMED CT)在医疗保健中处理自由文本的应用:系统范围综述。
J Med Internet Res. 2021 Jan 26;23(1):e24594. doi: 10.2196/24594.
3
Enhancing timeliness of drug overdose mortality surveillance: A machine learning approach.提高药物过量死亡率监测的及时性:一种机器学习方法。
PLoS One. 2019 Oct 16;14(10):e0223318. doi: 10.1371/journal.pone.0223318. eCollection 2019.
4
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
5
Automatic classification of diseases from free-text death certificates for real-time surveillance.用于实时监测的基于自由文本死亡证明的疾病自动分类
BMC Med Inform Decis Mak. 2015 Jul 15;15:53. doi: 10.1186/s12911-015-0174-2.
6
Automatic detection of tweets reporting cases of influenza like illnesses in Australia.自动检测澳大利亚报告流感样病例的推文。
Health Inf Sci Syst. 2015 Feb 24;3(Suppl 1 HISA Big Data in Biomedicine and Healthcare 2013 Con):S4. doi: 10.1186/2047-2501-3-S1-S4. eCollection 2015.
7
Artificial intelligence in health - the three big challenges.健康领域的人工智能——三大挑战。
Australas Med J. 2013 May 30;6(5):315-7. doi: 10.4066/AMJ.2013.1758. Print 2013.

本文引用的文献

1
The impact of OCR accuracy on automated cancer classification of pathology reports.光学字符识别(OCR)准确性对病理报告自动癌症分类的影响。
Stud Health Technol Inform. 2012;178:250-6.
2
Identification of pneumonia and influenza deaths using the Death Certificate Pipeline.利用死亡证明管道识别肺炎和流感死亡。
BMC Med Inform Decis Mak. 2012 May 8;12:37. doi: 10.1186/1472-6947-12-37.
3
Automatic extraction of cancer characteristics from free-text pathology reports for cancer notifications.从用于癌症通报的自由文本病理报告中自动提取癌症特征。
Stud Health Technol Inform. 2011;168:117-24.
4
Cancer survival in Australia, Canada, Denmark, Norway, Sweden, and the UK, 1995-2007 (the International Cancer Benchmarking Partnership): an analysis of population-based cancer registry data.澳大利亚、加拿大、丹麦、挪威、瑞典和英国的癌症生存状况,1995-2007 年(国际癌症基准合作):基于人群的癌症登记数据分析。
Lancet. 2011 Jan 8;377(9760):127-38. doi: 10.1016/S0140-6736(10)62231-3. Epub 2010 Dec 21.
5
Symbolic rule-based classification of lung cancer stages from free-text pathology reports.基于符号规则的肺癌分期的自由文本病理学报告分类。
J Am Med Inform Assoc. 2010 Jul-Aug;17(4):440-5. doi: 10.1136/jamia.2010.003707.
6
Evaluation of a generalizable approach to clinical information retrieval using the automated retrieval console (ARC).使用自动化检索控制台 (ARC) 评估一种通用的临床信息检索方法。
J Am Med Inform Assoc. 2010 Jul-Aug;17(4):375-82. doi: 10.1136/jamia.2009.001412.
7
SNOMED clinical terms: overview of the development process and project status.医学系统命名法临床术语:开发过程与项目状态概述
Proc AMIA Symp. 2001:662-6.

使用机器学习对癌症相关死亡证明进行分类。

Classification of cancer-related death certificates using machine learning.

作者信息

Butt Luke, Zuccon Guido, Nguyen Anthony, Bergheim Anton, Grayson Narelle

机构信息

The Australian e-Health Research Centre, Brisbane, Queensland, Australia;

出版信息

Australas Med J. 2013 May 30;6(5):292-9. doi: 10.4066/AMJ.2013.1654. Print 2013.

DOI:10.4066/AMJ.2013.1654
PMID:23745151
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3674421/
Abstract

BACKGROUND

Cancer monitoring and prevention relies on the critical aspect of timely notification of cancer cases. However, the abstraction and classification of cancer from the free-text of pathology reports and other relevant documents, such as death certificates, exist as complex and time-consuming activities.

AIMS

In this paper, approaches for the automatic detection of notifiable cancer cases as the cause of death from free-text death certificates supplied to Cancer Registries are investigated.

METHOD

A number of machine learning classifiers were studied. Features were extracted using natural language techniques and the Medtex toolkit. The numerous features encompassed stemmed words, bi-grams, and concepts from the SNOMED CT medical terminology. The baseline consisted of a keyword spotter using keywords extracted from the long description of ICD-10 cancer related codes.

RESULTS

Death certificates with notifiable cancer listed as the cause of death can be effectively identified with the methods studied in this paper. A Support Vector Machine (SVM) classifier achieved best performance with an overall Fmeasure of 0.9866 when evaluated on a set of 5,000 freetext death certificates using the token stem feature set. The SNOMED CT concept plus token stem feature set reached the lowest variance (0.0032) and false negative rate (0.0297) while achieving an F-measure of 0.9864. The SVM classifier accounts for the first 18 of the top 40 evaluated runs, and entails the most robust classifier with a variance of 0.001141, half the variance of the other classifiers.

CONCLUSION

The selection of features significantly produced the most influences on the performance of the classifiers, although the type of classifier employed also affects performance. In contrast, the feature weighting schema created a negligible effect on performance. Specifically, it is found that stemmed tokens with or without SNOMED CT concepts create the most effective feature when combined with an SVM classifier.

摘要

背景

癌症监测与预防依赖于及时通报癌症病例这一关键环节。然而,从病理报告及其他相关文件(如死亡证明)的自由文本中提取和分类癌症信息,是复杂且耗时的工作。

目的

本文研究了从提供给癌症登记处的自由文本死亡证明中自动检测应通报癌症病例作为死亡原因的方法。

方法

研究了多种机器学习分类器。使用自然语言技术和Medtex工具包提取特征。众多特征包括词干、双词搭配以及来自SNOMED CT医学术语的概念。基线由一个关键词识别器组成,该识别器使用从ICD - 10癌症相关代码的详细描述中提取的关键词。

结果

本文研究的方法能够有效识别将应通报癌症列为死亡原因的死亡证明。在一组5000份自由文本死亡证明上使用词干特征集进行评估时,支持向量机(SVM)分类器表现最佳,总体F值为0.9866。SNOMED CT概念加词干特征集的方差最低(0.0032),假阴性率最低(0.0297),同时F值达到0.9864。SVM分类器在40次评估运行中排名前18,是最稳健的分类器,方差为0.001141,是其他分类器方差的一半。

结论

特征的选择对分类器性能的影响最为显著,尽管所采用的分类器类型也会影响性能。相比之下,特征加权方案对性能的影响可忽略不计。具体而言,发现带有或不带有SNOMED CT概念的词干与SVM分类器结合时能产生最有效的特征。