GHS-NET：一种用于多标签生物医学文本分类的通用混合浅层神经网络。

GHS-NET a generic hybridized shallow neural network for multi-label biomedical text classification.

机构信息

Intelligent Criminology Research Lab, National Center of Artificial Intelligence, Al-Khawarizmi Institute of Computer Science, UET, Lahore, Pakistan; German Research Center for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany.

Intelligent Criminology Research Lab, National Center of Artificial Intelligence, Al-Khawarizmi Institute of Computer Science, UET, Lahore, Pakistan; Department of Computer Science, University of Engineering and Technology (UET), Lahore, Pakistan.

出版信息

J Biomed Inform. 2021 Apr;116:103699. doi: 10.1016/j.jbi.2021.103699. Epub 2021 Feb 15.

DOI:10.1016/j.jbi.2021.103699

PMID:33601013

Abstract

Exponential growth of biomedical literature and clinical data demands more robust yet precise computational methodologies to extract useful insights from biomedical literature and to perform accurate assignment of disease-specific codes. Such approaches can largely enhance the effectiveness of diverse biomedicine and bioinformatics applications. State-of-the-art computational biomedical text classification methodologies either solely leverage discrimintaive features extracted through convolution operations performed by deep convolutional neural network or contextual information extracted by recurrent neural network. However, none of the methodology takes advantage of both convolutional and recurrent neural networks. Further, existing methodologies lack to produce decent performance for the classification of different genre biomedical text such as biomedical literature or clinical notes. We, for the very first time, present a generic deep learning based hybrid multi-label classification methodology namely GHS-NET which can be utilized to accurately classify biomedical text of diverse genre. GHS-NET makes use of convolutional neural network to extract most discriminative features and bi-directional Long Short-Term Memory to acquire contextual information. GHS-NET effectiveness is evaluated for extreme multi-label biomedical literature classification and assignment of ICD-9 codes to clinical notes. For the task of extreme multi-label biomedical literature classification, performance comparison of GHS-Net and state-of-the-art deep learning based methodology reveals that GHS-Net marks the increment of 1%, 6%, and 1% for hallmarks of cancer dataset, 10%, 16%, and 11% for chemical exposure dataset in terms of precision, recall, and F1-score. For the task of clinical notes classification, GHS-Net outperforms previous best deep learning based methodology over Medical Information Mart for Intensive Care dataset (MIMIC-III) by the significant margin of 6%, 8% in terms of recall and F1-score. GHS-NET is available as a web service at and potentially can be used to accurately classify multi-variate disease and chemical exposure specific text.

摘要

生物医学文献和临床数据呈指数级增长，这就需要更强大、更精确的计算方法，以便从生物医学文献中提取有用的见解，并准确分配特定疾病的代码。这些方法可以大大提高各种生物医学和生物信息学应用的效果。最先进的计算生物医学文本分类方法要么仅利用通过深度卷积神经网络执行的卷积操作提取的判别特征，要么利用递归神经网络提取的上下文信息。然而，这些方法都没有利用卷积神经网络和递归神经网络。此外，现有的方法在对不同类型的生物医学文本（如生物医学文献或临床记录）进行分类时，性能不佳。我们首次提出了一种通用的深度学习混合多标签分类方法，即 GHS-NET，它可以用于准确地对不同类型的生物医学文本进行分类。GHS-NET 利用卷积神经网络提取最具判别力的特征，利用双向长短时记忆网络获取上下文信息。我们评估了 GHS-NET 在极端多标签生物医学文献分类和 ICD-9 代码分配到临床记录中的有效性。在极端多标签生物医学文献分类任务中，GHS-Net 与最先进的深度学习方法的性能比较表明，在癌症标志数据集方面，GHS-Net 的精度、召回率和 F1 得分分别提高了 1%、6%和 1%，在化学暴露数据集方面，分别提高了 10%、16%和 11%。在临床记录分类任务中，GHS-Net 在 MIMIC-III 数据集上的表现优于以前最好的基于深度学习的方法，在召回率和 F1 得分方面分别高出 6%和 8%。GHS-NET 可作为网络服务使用，并有可能用于准确分类多变量疾病和化学暴露特定文本。

相似文献

GHS-NET a generic hybridized shallow neural network for multi-label biomedical text classification.GHS-NET：一种用于多标签生物医学文本分类的通用混合浅层神经网络。

J Biomed Inform. 2021 Apr;116:103699. doi: 10.1016/j.jbi.2021.103699. Epub 2021 Feb 15.

ML-Net: multi-label classification of biomedical texts with deep neural networks.ML-Net：基于深度神经网络的生物医学文本多标签分类

J Am Med Inform Assoc. 2019 Nov 1;26(11):1279-1285. doi: 10.1093/jamia/ocz085.

A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance.深度学习模型在不同类别不平衡程度的非结构化医疗记录文本分类中的对比研究。

BMC Med Res Methodol. 2022 Jul 2;22(1):181. doi: 10.1186/s12874-022-01665-y.

Medical code prediction via capsule networks and ICD knowledge.基于胶囊网络和 ICD 知识的医疗编码预测。

BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):55. doi: 10.1186/s12911-021-01426-9.

Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity.利用上下文嵌入和标签粒度增强 ICD 多标签健康记录分类。

Comput Methods Programs Biomed. 2020 May;188:105264. doi: 10.1016/j.cmpb.2019.105264. Epub 2019 Dec 10.

An empirical evaluation of deep learning for ICD-9 code assignment using MIMIC-III clinical notes.基于 MIMIC-III 临床记录的深度学习方法在 ICD-9 编码任务中的实证评估

Comput Methods Programs Biomed. 2019 Aug;177:141-153. doi: 10.1016/j.cmpb.2019.05.024. Epub 2019 May 25.

Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。

J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.

Automatic ICD-10-CM coding via Lambda-Scaled attention based deep learning model.基于 Lambda 缩放注意力的深度学习模型实现自动 ICD-10-CM 编码。

Methods. 2024 Feb;222:19-27. doi: 10.1016/j.ymeth.2023.11.017. Epub 2023 Dec 21.

GCDN-Net: Garbage classifier deep neural network for recyclable urban waste management.GCDN-Net：用于可回收城市废物管理的垃圾分类器深度神经网络。

Waste Manag. 2024 Feb 15;174:439-450. doi: 10.1016/j.wasman.2023.12.014. Epub 2023 Dec 19.

Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification.卷积神经网络 (CNN) 和循环神经网络 (RNN) 架构在放射学文本报告分类中的比较效果。

Artif Intell Med. 2019 Jun;97:79-88. doi: 10.1016/j.artmed.2018.11.004. Epub 2018 Nov 23.

引用本文的文献

[Medical text classification model integrating medical entity label semantics].整合医学实体标签语义的医学文本分类模型

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2025 Apr 25;42(2):326-333. doi: 10.7507/1001-5515.202408001.

Health Inf Sci Syst. 2023 Nov 16;11(1):54. doi: 10.1007/s13755-023-00254-7. eCollection 2023 Dec.

Consumers' Opinions towards Public Health Effects of Online Games: An Empirical Study Based on Social Media Comments in China.消费者对网络游戏公共卫生影响的看法：基于中国社交媒体评论的实证研究。

Int J Environ Res Public Health. 2022 Oct 6;19(19):12793. doi: 10.3390/ijerph191912793.

Automatic Film Label Acquisition Method Based on Improved Neural Networks Optimized by Mutation Ant Colony Algorithm.基于改进的神经网络和变异蚁群算法优化的自动胶片标签获取方法。

Comput Intell Neurosci. 2021 Oct 11;2021:7158051. doi: 10.1155/2021/7158051. eCollection 2021.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

GHS-NET：一种用于多标签生物医学文本分类的通用混合浅层神经网络。

GHS-NET a generic hybridized shallow neural network for multi-label biomedical text classification.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献