Suppr超能文献

通过宽深模型推进潜在死因推断

Advancing Underlying Cause of Death Inference Through Wide and Deep Model.

作者信息

Fang Xin, Huang Shaofen, Yin Yanrong, Chen Tiehui, Liao Zhijun, Zhong Wenling

机构信息

Department for Chronic and Noncommunicable Disease Control and Prevention, Fujian Provincial Center for Disease Control and Prevention, Fuzhou City, Fujian Province, China.

Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou City, Fujian Province, China.

出版信息

China CDC Wkly. 2024 May 24;6(21):487-492. doi: 10.46234/ccdcw2024.094.

Abstract

INTRODUCTION

Accurately filling out death certificates is essential for death surveillance. However, manually determining the underlying cause of death is often imprecise. In this study, we investigate the Wide and Deep framework as a method to improve the accuracy and reliability of inferring the underlying cause of death.

METHODS

Death report data from national-level cause of death surveillance sites in Fujian Province from 2016 to 2022, involving 403,547 deaths, were analyzed. The Wide and Deep embedded with Convolutional Neural Networks (CNN) was developed. Model performance was assessed using weighted accuracy, weighted precision, weighted recall, and weighted area under the curve (AUC). A comparison was made with XGBoost, CNN, Gated Recurrent Unit (GRU), Transformer, and GRU with Attention.

RESULTS

The Wide and Deep achieved strong performance metrics on the test set: precision of 95.75%, recall of 92.08%, F1 Score of 93.78%, and an AUC of 95.99%. The model also displayed specific F1 Scores for different cause-of-death chain lengths: 97.13% for single causes, 95.08% for double causes, 91.24% for triple causes, and 79.50% for quadruple causes.

CONCLUSIONS

The Wide and Deep significantly enhances the ability to determine the root causes of death, providing a valuable tool for improving cause-of-death surveillance quality. Integrating artificial intelligence (AI) in this field is anticipated to streamline death registration and reporting procedures, thereby boosting the precision of public health data.

摘要

引言

准确填写死亡证明对于死亡监测至关重要。然而,手动确定根本死因往往不够精确。在本研究中,我们探讨了宽深框架作为一种提高推断根本死因准确性和可靠性的方法。

方法

分析了福建省2016年至2022年国家级死因监测点的403547例死亡报告数据。开发了嵌入卷积神经网络(CNN)的宽深模型。使用加权准确率、加权精确率、加权召回率和加权曲线下面积(AUC)评估模型性能。并与XGBoost、CNN、门控循环单元(GRU)、Transformer以及带注意力机制的GRU进行了比较。

结果

宽深模型在测试集上取得了优异的性能指标:精确率为95.75%,召回率为92.08%,F1分数为93.78%,AUC为95.99%。该模型还针对不同死因链长度显示了特定的F1分数:单一死因的F1分数为97.13%,双重死因的为95.08%,三重死因的为91.24%,四重死因的为79.50%。

结论

宽深模型显著提高了确定根本死因的能力,为提高死因监测质量提供了有价值的工具。预计将人工智能(AI)整合到该领域将简化死亡登记和报告程序,从而提高公共卫生数据的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5bd/11154107/548edbd20f8c/ccdcw-6-21-487-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验