Pita Ferreira Patrícia, Godinho Simões Diogo, Pinto de Carvalho Constança, Duarte Francisco, Fernandes Eugénia, Casaca Carvalho Pedro, Loff José Francisco, Soares Ana Paula, Albuquerque Maria João, Pinto-Leite Pedro, Peralta-Santos André
Direção de Serviços de Informação e Análise, Direção-Geral da Saúde, Lisbon, Portugal.
Unidade de Saúde Pública Zé Povinho, Agrupamento de Centros de Saúde do Oeste Norte, Administração Regional de Saúde de Lisboa e Vale do Tejo, Caldas da Rainha, Portugal.
JMIR AI. 2023 Nov 22;2:e40965. doi: 10.2196/40965.
In 2021, the European Union reported >270,000 excess deaths, including >16,000 in Portugal. The Portuguese Directorate-General of Health developed a deep neural network, AUTOCOD, which determines the primary causes of death by analyzing the free text of physicians' death certificates (DCs). Although AUTOCOD's performance has been established, it remains unclear whether its performance remains consistent over time, particularly during periods of excess mortality.
This study aims to assess the sensitivity and other performance metrics of AUTOCOD in classifying underlying causes of death compared with manual coding to identify specific causes of death during periods of excess mortality.
We included all DCs between 2016 and 2019. AUTOCOD's performance was evaluated by calculating various performance metrics, such as sensitivity, specificity, positive predictive value (PPV), and F-score, using a confusion matrix. This compared International Statistical Classification of Diseases and Health-Related Problems, 10th Revision (ICD-10), classifications of DCs by AUTOCOD with those by human coders at the Directorate-General of Health (gold standard). Subsequently, we compared periods without excess mortality with periods of excess, severe, and extreme excess mortality. We defined excess mortality as 2 consecutive days with a Z score above the 95% baseline limit, severe excess mortality as 2 consecutive days with a Z score >4 SDs, and extreme excess mortality as 2 consecutive days with a Z score >6 SDs. Finally, we repeated the analyses for the 3 most common ICD-10 chapters focusing on block-level classification.
We analyzed a large data set comprising 330,098 DCs classified by both human coders and AUTOCOD. AUTOCOD demonstrated high sensitivity (≥0.75) for 10 ICD-10 chapters examined, with values surpassing 0.90 for the more prevalent chapters (chapter II-"Neoplasms," chapter IX-"Diseases of the circulatory system," and chapter X-"Diseases of the respiratory system"), accounting for 67.69% (223,459/330,098) of all human-coded causes of death. No substantial differences were observed in these high-sensitivity values when comparing periods without excess mortality with periods of excess, severe, and extreme excess mortality. The same holds for specificity, which exceeded 0.96 for all chapters examined, and for PPV, which surpassed 0.75 in 9 chapters, including the more prevalent ones. When considering block classification within the 3 most common ICD-10 chapters, AUTOCOD maintained a high performance, demonstrating high sensitivity (≥0.75) for 13 ICD-10 blocks, high PPV for 9 blocks, and specificity of >0.98 in all blocks, with no significant differences between periods without excess mortality and those with excess mortality.
Our findings indicate that, during periods of excess and extreme excess mortality, AUTOCOD's performance remains unaffected by potential text quality degradation because of pressure on health services. Consequently, AUTOCOD can be dependably used for real-time cause-specific mortality surveillance even in extreme excess mortality situations.
2021年,欧盟报告了超过27万例超额死亡病例,其中葡萄牙有超过1.6万例。葡萄牙卫生总局开发了一种深度神经网络AUTOCOD,通过分析医生死亡证明(DC)的自由文本确定主要死因。尽管AUTOCOD的性能已经得到验证,但尚不清楚其性能是否随时间保持一致,特别是在超额死亡率期间。
本研究旨在评估AUTOCOD在与人工编码相比时对潜在死因进行分类的敏感性和其他性能指标,以识别超额死亡率期间的特定死因。
我们纳入了所有2016年至2019年期间的DC。通过使用混淆矩阵计算各种性能指标,如敏感性、特异性、阳性预测值(PPV)和F分数,来评估AUTOCOD的性能。这将AUTOCOD对DC的国际疾病和相关健康问题统计分类第10版(ICD-10)分类与卫生总局的人工编码员的分类(金标准)进行了比较。随后,我们将无超额死亡率时期与超额、严重和极端超额死亡率时期进行了比较。我们将超额死亡率定义为Z分数连续2天高于95%基线限值,严重超额死亡率定义为Z分数>4个标准差的连续2天,极端超额死亡率定义为Z分数>6个标准差的连续2天。最后,我们针对3个最常见的ICD-10章节重复分析,重点关注块级分类。
我们分析了一个由人工编码员和AUTOCOD分类的330,098份DC组成的大数据集。AUTOCOD在所检查的10个ICD-10章节中表现出高敏感性(≥0.75),对于更常见的章节(第二章 - “肿瘤”、第九章 - “循环系统疾病”和第十章 - “呼吸系统疾病”),其值超过0.90,占所有人工编码死因的67.69%(223,459/330,098)。在比较无超额死亡率时期与超额、严重和极端超额死亡率时期时,这些高敏感性值未观察到实质性差异。所有检查章节的特异性均超过0.96,PPV在9个章节中超过0.75,包括更常见的章节,情况也是如此。当考虑3个最常见的ICD-10章节内的块分类时,AUTOCOD保持了高性能,在13个ICD-10块中表现出高敏感性(≥0.75),9个块的PPV高,所有块的特异性>0.98,无超额死亡率时期与有超额死亡率时期之间无显著差异。
我们的研究结果表明,在超额和极端超额死亡率期间,AUTOCOD的性能不受卫生服务压力导致的潜在文本质量下降的影响。因此,即使在极端超额死亡率情况下,AUTOCOD也可可靠地用于实时特定病因死亡率监测。