基于卷积神经网络的中文电子病历智能诊断。

Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks.

机构信息

College of Computer Science and Technology, Huaqiao University, Xiamen, 361021, China.

Research Department, Zhiye software, Xiamen, 361021, China.

出版信息

BMC Bioinformatics. 2019 Feb 1;20(1):62. doi: 10.1186/s12859-019-2617-8.

DOI:10.1186/s12859-019-2617-8

PMID:30709336

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6359854/

Abstract

BACKGROUND

Benefiting from big data, powerful computation and new algorithmic techniques, we have been witnessing the renaissance of deep learning, particularly the combination of natural language processing (NLP) and deep neural networks. The advent of electronic medical records (EMRs) has not only changed the format of medical records but also helped users to obtain information faster. However, there are many challenges regarding researching directly using Chinese EMRs, such as low quality, huge quantity, imbalance, semi-structure and non-structure, particularly the high density of the Chinese language compared with English. Therefore, effective word segmentation, word representation and model architecture are the core technologies in the literature on Chinese EMRs.

RESULTS

In this paper, we propose a deep learning framework to study intelligent diagnosis using Chinese EMR data, which incorporates a convolutional neural network (CNN) into an EMR classification application. The novelty of this paper is reflected in the following: (1) We construct a pediatric medical dictionary based on Chinese EMRs. (2) Word2vec adopted in word embedding is used to achieve the semantic description of the content of Chinese EMRs. (3) A fine-tuning CNN model is constructed to feed the pediatric diagnosis with Chinese EMR data. Our results on real-world pediatric Chinese EMRs demonstrate that the average accuracy and F1-score of the CNN models are up to 81%, which indicates the effectiveness of the CNN model for the classification of EMRs. Particularly, a fine-tuning one-layer CNN performs best among all CNNs, recurrent neural network (RNN) (long short-term memory, gated recurrent unit) and CNN-RNN models, and the average accuracy and F1-score are both up to 83%.

CONCLUSION

The CNN framework that includes word segmentation, word embedding and model training can serve as an intelligent auxiliary diagnosis tool for pediatricians. Particularly, a fine-tuning one-layer CNN performs well, which indicates that word order does not appear to have a useful effect on our Chinese EMRs.

摘要

背景

受益于大数据、强大的计算能力和新的算法技术，我们见证了深度学习的复兴，特别是自然语言处理（NLP）和深度神经网络的结合。电子病历（EMR）的出现不仅改变了病历的格式，还帮助用户更快地获取信息。然而，直接使用中文 EMR 进行研究存在许多挑战，例如质量低、数量大、不平衡、半结构化和非结构化，尤其是与英语相比，中文的密度更高。因此，有效的分词、词表示和模型架构是中文 EMR 文献研究的核心技术。

结果

本文提出了一种基于深度学习的框架，利用中文 EMR 数据进行智能诊断，将卷积神经网络（CNN）应用于 EMR 分类应用中。本文的创新之处在于：（1）我们基于中文 EMR 构建了儿科医学词典。（2）采用词向量进行词嵌入，实现中文 EMR 内容的语义描述。（3）构建一个微调 CNN 模型，为儿科诊断提供中文 EMR 数据。我们在真实的儿科中文 EMR 上的结果表明，CNN 模型的平均准确率和 F1 分数高达 81%，表明 CNN 模型在 EMR 分类中的有效性。特别是，在所有的 CNN、递归神经网络（RNN）（长短期记忆、门控循环单元）和 CNN-RNN 模型中，一层微调 CNN 的性能最好，平均准确率和 F1 分数均高达 83%。

结论

包含分词、词嵌入和模型训练的 CNN 框架可以作为儿科医生的智能辅助诊断工具。特别是，一层微调 CNN 表现良好，这表明在我们的中文 EMR 中，词序似乎没有有用的效果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51a/6359854/5dd245747425/12859_2019_2617_Fig1_HTML.jpg

相似文献

Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks.基于卷积神经网络的中文电子病历智能诊断。

BMC Bioinformatics. 2019 Feb 1;20(1):62. doi: 10.1186/s12859-019-2617-8.

Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation.基于多语义特征，利用经过稳健优化的基于变换器预训练方法的全词掩码和卷积神经网络从电子病历中进行中文临床命名实体识别：模型开发与验证

JMIR Med Inform. 2023 May 10;11:e44597. doi: 10.2196/44597.

A BIGRU-Based Stacked Attention Network for Biomedical Named Entity Recognition with Chinese EMRs.基于 BIGRU 的堆叠注意力网络在中文电子病历中的生物医学命名实体识别。

Stud Health Technol Inform. 2023 Nov 23;308:757-767. doi: 10.3233/SHTI230909.

Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。

J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

Model-Based Reasoning of Clinical Diagnosis in Integrative Medicine: Real-World Methodological Study of Electronic Medical Records and Natural Language Processing Methods.中西医结合临床诊断的基于模型的推理：电子病历与自然语言处理方法的真实世界方法学研究

JMIR Med Inform. 2020 Dec 21;8(12):e23082. doi: 10.2196/23082.

Temporal indexing of medical entity in Chinese clinical notes.中文临床记录中医疗实体的时间索引。

BMC Med Inform Decis Mak. 2019 Jan 31;19(Suppl 1):17. doi: 10.1186/s12911-019-0735-x.

Assistant diagnosis with Chinese electronic medical records based on CNN and BiLSTM with phrase-level and word-level attentions.基于 CNN 和 BiLSTM 的短语级和单词级注意力的中文电子病历辅助诊断。

BMC Bioinformatics. 2020 Jun 5;21(1):230. doi: 10.1186/s12859-020-03554-x.

An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records.基于注意力的深度学习模型在中文电子病历临床命名实体识别中的应用。

BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):235. doi: 10.1186/s12911-019-0933-6.

Detection of medical text semantic similarity based on convolutional neural network.基于卷积神经网络的医学文本语义相似度检测。

BMC Med Inform Decis Mak. 2019 Aug 7;19(1):156. doi: 10.1186/s12911-019-0880-2.

Chinese medical entity recognition based on the dual-branch TENER model.基于双分支 TENER 模型的中文医疗实体识别。

BMC Med Inform Decis Mak. 2023 Jul 24;23(1):136. doi: 10.1186/s12911-023-02243-y.

引用本文的文献

Learning from undercoded clinical records for automated International Classification of Diseases (ICD) coding.从未编码的临床记录中学习，实现自动化的国际疾病分类（ICD）编码。

J Am Med Inform Assoc. 2023 Feb 16;30(3):438-446. doi: 10.1093/jamia/ocac230.

DeepMPM: a mortality risk prediction model using longitudinal EHR data.DeepMPM：基于纵向电子健康记录数据的死亡率风险预测模型。

BMC Bioinformatics. 2022 Oct 14;23(1):423. doi: 10.1186/s12859-022-04975-6.

Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine.多任务联合学习模型在中医分词和证候分类中的应用

Int J Environ Res Public Health. 2022 May 5;19(9):5601. doi: 10.3390/ijerph19095601.

An ensemble approach for healthcare application and diagnosis using natural language processing.一种使用自然语言处理的医疗保健应用与诊断集成方法。

Cogn Neurodyn. 2022 Oct;16(5):1203-1220. doi: 10.1007/s11571-021-09758-y. Epub 2022 Jan 17.

Applying artificial intelligence for cancer immunotherapy.将人工智能应用于癌症免疫治疗。

Acta Pharm Sin B. 2021 Nov;11(11):3393-3405. doi: 10.1016/j.apsb.2021.02.007. Epub 2021 Feb 11.

Language Processing Model Construction and Simulation Based on Hybrid CNN and LSTM.基于混合 CNN 和 LSTM 的语言处理模型构建与仿真。

Comput Intell Neurosci. 2021 Jul 6;2021:2578422. doi: 10.1155/2021/2578422. eCollection 2021.

Identifying Patient Phenotype Cohorts Using Prehospital Electronic Health Record Data.利用院前电子健康记录数据识别患者表型队列

Prehosp Emerg Care. 2021 Jan 25:1-14. doi: 10.1080/10903127.2020.1859658.

BMC Bioinformatics. 2020 Jun 5;21(1):230. doi: 10.1186/s12859-020-03554-x.

本文引用的文献

Scalable and accurate deep learning with electronic health records.借助电子健康记录实现可扩展且准确的深度学习。

NPJ Digit Med. 2018 May 8;1:18. doi: 10.1038/s41746-018-0029-1. eCollection 2018.

Data-Driven Information Extraction from Chinese Electronic Medical Records.从中文电子病历中进行数据驱动的信息提取

PLoS One. 2015 Aug 21;10(8):e0136270. doi: 10.1371/journal.pone.0136270. eCollection 2015.

Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: a novel informatics approach.利用自然语言处理改善电子病历中克罗恩病和溃疡性结肠炎的病例定义：一种新的信息学方法。

Inflamm Bowel Dis. 2013 Jun;19(7):1411-20. doi: 10.1097/MIB.0b013e31828133fd.

An automated model to identify heart failure patients at risk for 30-day readmission or death using electronic medical record data.利用电子病历数据建立自动模型识别 30 天内再入院或死亡风险的心力衰竭患者。

Med Care. 2010 Nov;48(11):981-8. doi: 10.1097/MLR.0b013e3181ef60d9.

Barriers to the acceptance of electronic medical records by physicians from systematic review to taxonomy and interventions.从系统评价到分类学和干预措施看医生对电子病历的接受障碍。

BMC Health Serv Res. 2010 Aug 6;10:231. doi: 10.1186/1472-6963-10-231.

Semantic Space models for classification of consumer webpages on metadata attributes.基于语义空间模型的消费者网页元数据属性分类。

J Biomed Inform. 2010 Oct;43(5):725-35. doi: 10.1016/j.jbi.2010.06.005. Epub 2010 Jun 23.

Electronic medical records for discovery research in rheumatoid arthritis.电子病历在类风湿关节炎研究中的应用。

Arthritis Care Res (Hoboken). 2010 Aug;62(8):1120-7. doi: 10.1002/acr.20184.

A comparison of electronic records to paper records in mental health centers.精神卫生中心电子病历与纸质病历的比较。

Int J Qual Health Care. 2008 Apr;20(2):136-43. doi: 10.1093/intqhc/mzm064. Epub 2007 Dec 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于卷积神经网络的中文电子病历智能诊断。

Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献