随机森林在死因推断分析中的应用：使用临床诊断金标准的多中心验证研究。

Random forests for verbal autopsy analysis: multisite validation study using clinical diagnostic gold standards.

机构信息

Institute for Health Metrics and Evaluation, University of Washington, 2301 Fifth Ave,, Suite 600, Seattle, WA 98121, USA.

出版信息

Popul Health Metr. 2011 Aug 4;9:29. doi: 10.1186/1478-7954-9-29.

DOI:10.1186/1478-7954-9-29

PMID:21816105

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3160922/

Abstract

BACKGROUND

Computer-coded verbal autopsy (CCVA) is a promising alternative to the standard approach of physician-certified verbal autopsy (PCVA), because of its high speed, low cost, and reliability. This study introduces a new CCVA technique and validates its performance using defined clinical diagnostic criteria as a gold standard for a multisite sample of 12,542 verbal autopsies (VAs).

METHODS

The Random Forest (RF) Method from machine learning (ML) was adapted to predict cause of death by training random forests to distinguish between each pair of causes, and then combining the results through a novel ranking technique. We assessed quality of the new method at the individual level using chance-corrected concordance and at the population level using cause-specific mortality fraction (CSMF) accuracy as well as linear regression. We also compared the quality of RF to PCVA for all of these metrics. We performed this analysis separately for adult, child, and neonatal VAs. We also assessed the variation in performance with and without household recall of health care experience (HCE).

RESULTS

For all metrics, for all settings, RF was as good as or better than PCVA, with the exception of a nonsignificantly lower CSMF accuracy for neonates with HCE information. With HCE, the chance-corrected concordance of RF was 3.4 percentage points higher for adults, 3.2 percentage points higher for children, and 1.6 percentage points higher for neonates. The CSMF accuracy was 0.097 higher for adults, 0.097 higher for children, and 0.007 lower for neonates. Without HCE, the chance-corrected concordance of RF was 8.1 percentage points higher than PCVA for adults, 10.2 percentage points higher for children, and 5.9 percentage points higher for neonates. The CSMF accuracy was higher for RF by 0.102 for adults, 0.131 for children, and 0.025 for neonates.

CONCLUSIONS

We found that our RF Method outperformed the PCVA method in terms of chance-corrected concordance and CSMF accuracy for adult and child VA with and without HCE and for neonatal VA without HCE. It is also preferable to PCVA in terms of time and cost. Therefore, we recommend it as the technique of choice for analyzing past and current verbal autopsies.

摘要

背景

计算机编码死因推断（CCVA）是一种很有前途的替代方法，可以替代医师认证死因推断（PCVA），因为它速度快、成本低且可靠。本研究引入了一种新的 CCVA 技术，并使用定义明确的临床诊断标准作为 12542 例死因推断（VA）多站点样本的金标准来验证其性能。

方法

我们从机器学习（ML）中采用随机森林（RF）方法，通过训练随机森林来区分每对死因来预测死因，并通过一种新的排名技术来组合结果。我们使用机会校正一致率（chance-corrected concordance）在个体水平上评估新方法的质量，并使用死因特异性死亡率分数（cause-specific mortality fraction，CSMF）准确性和线性回归在人群水平上评估质量。我们还将 RF 与所有这些指标的 PCVA 进行了比较。我们分别对成人、儿童和新生儿 VA 进行了分析。我们还评估了在有和没有家庭回忆医疗保健经历（health care experience，HCE）的情况下性能的变化。

结果

对于所有指标，在所有设置中，RF 与 PCVA 一样好或更好，除了有 HCE 信息的新生儿的 CSMF 准确性略低但无统计学差异。有 HCE 时，RF 的机会校正一致率在成人中高 3.4 个百分点，在儿童中高 3.2 个百分点，在新生儿中高 1.6 个百分点。成人的 CSMF 准确性高 0.097，儿童高 0.097，新生儿低 0.007。无 HCE 时，RF 的机会校正一致率在成人中比 PCVA 高 8.1 个百分点，在儿童中高 10.2 个百分点，在新生儿中高 5.9 个百分点。成人的 CSMF 准确性高 0.102，儿童高 0.131，新生儿高 0.025。

结论

我们发现，对于有和没有 HCE 的成人和儿童 VA 以及没有 HCE 的新生儿 VA，我们的 RF 方法在机会校正一致率和 CSMF 准确性方面优于 PCVA 方法。与 PCVA 相比，它在时间和成本方面也更具优势。因此，我们建议将其作为分析过去和当前死因推断的首选技术。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4068/3160922/db8f4222f0f0/1478-7954-9-29-1.jpg

相似文献

Random forests for verbal autopsy analysis: multisite validation study using clinical diagnostic gold standards.

Popul Health Metr. 2011 Aug 4;9:29. doi: 10.1186/1478-7954-9-29.

Performance of physician-certified verbal autopsies: multisite validation study using clinical diagnostic gold standards.

Popul Health Metr. 2011 Aug 4;9:32. doi: 10.1186/1478-7954-9-32.

Direct estimation of cause-specific mortality fractions from verbal autopsies: multisite validation study using clinical diagnostic gold standards.

Popul Health Metr. 2011 Aug 4;9:35. doi: 10.1186/1478-7954-9-35.

Comparison of physician-certified verbal autopsy with computer-coded verbal autopsy for cause of death assignment in hospitalized patients in low- and middle-income countries: systematic review.

BMC Med. 2014 Feb 4;12:22. doi: 10.1186/1741-7015-12-22.

Using verbal autopsy to measure causes of death: the comparative performance of existing methods.

BMC Med. 2014 Jan 9;12:5. doi: 10.1186/1741-7015-12-5.

Validation of physician certified verbal autopsy using conventional autopsy: a large study of adult non-external causes of death in a metropolitan area in Brazil.

BMC Public Health. 2022 Apr 14;22(1):748. doi: 10.1186/s12889-022-13081-4.

Evaluation of methods for assigning causes of death from verbal autopsies in India.

Front Big Data. 2023 Aug 24;6:1197471. doi: 10.3389/fdata.2023.1197471. eCollection 2023.

Agreement between cause of death assignment by computer-coded verbal autopsy methods and physician coding of verbal autopsy interviews in South Africa.

Glob Health Action. 2023 Dec 31;16(1):2285105. doi: 10.1080/16549716.2023.2285105. Epub 2023 Dec 1.

Improving performance of the Tariff Method for assigning causes of death to verbal autopsies.

BMC Med. 2015 Dec 8;13:291. doi: 10.1186/s12916-015-0527-9.

Performance of four computer-coded verbal autopsy methods for cause of death assignment compared with physician coding on 24,000 deaths in low- and middle-income countries.

BMC Med. 2014 Feb 4;12:20. doi: 10.1186/1741-7015-12-20.

引用本文的文献

Private sector delivery of care for maternal and newborn health: trends over a decade in the Indian state of Bihar.

BMC Med. 2025 Jan 29;23(1):50. doi: 10.1186/s12916-025-03894-6.

BAYESIAN NESTED LATENT CLASS MODELS FOR CAUSE-OF-DEATH ASSIGNMENT USING VERBAL AUTOPSIES ACROSS MULTIPLE DOMAINS.

Ann Appl Stat. 2024 Jun;18(2):1137-1159. doi: 10.1214/23-aoas1826. Epub 2024 Apr 5.

The openVA Toolkit for Verbal Autopsies.

R J. 2022 Dec;14(4):316-334. doi: 10.32614/rj-2023-020. Epub 2023 Feb 24.

An Artificial Intelligence Model for Predicting Trauma Mortality Among Emergency Department Patients in South Korea: Retrospective Cohort Study.

J Med Internet Res. 2023 Aug 29;25:e49283. doi: 10.2196/49283.

Pediatric Injury Surveillance From Uncoded Emergency Department Admission Records in Italy: Machine Learning-Based Text-Mining Approach.

JMIR Public Health Surveill. 2023 Jul 12;9:e44467. doi: 10.2196/44467.

Performance evaluation of machine learning and Computer Coded Verbal Autopsy (CCVA) algorithms for cause of death determination: A comparative analysis of data from rural South Africa.

Front Public Health. 2022 Sep 27;10:990838. doi: 10.3389/fpubh.2022.990838. eCollection 2022.

Validation of physician certified verbal autopsy using conventional autopsy: a large study of adult non-external causes of death in a metropolitan area in Brazil.

BMC Public Health. 2022 Apr 14;22(1):748. doi: 10.1186/s12889-022-13081-4.

Disease Progression of Hypertrophic Cardiomyopathy: Modeling Using Machine Learning.

JMIR Med Inform. 2022 Feb 2;10(2):e30483. doi: 10.2196/30483.

Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study.

J Med Internet Res. 2022 Jan 25;24(1):e28036. doi: 10.2196/28036.

Accurate Prediction of Stroke for Hypertensive Patients Based on Medical Big Data and Machine Learning Algorithms: Retrospective Study.

JMIR Med Inform. 2021 Nov 10;9(11):e30277. doi: 10.2196/30277.

本文引用的文献

Performance of the Tariff Method: validation of a simple additive algorithm for analysis of verbal autopsies.

Popul Health Metr. 2011 Aug 4;9:31. doi: 10.1186/1478-7954-9-31.

Robust metrics for assessing the performance of different verbal autopsy cause assignment methods in validation studies.

Popul Health Metr. 2011 Aug 4;9:28. doi: 10.1186/1478-7954-9-28.

Performance of physician-certified verbal autopsies: multisite validation study using clinical diagnostic gold standards.

Popul Health Metr. 2011 Aug 4;9:32. doi: 10.1186/1478-7954-9-32.

Population Health Metrics Research Consortium gold standard verbal autopsy validation study: design, implementation, and development of analysis datasets.

Popul Health Metr. 2011 Aug 4;9:27. doi: 10.1186/1478-7954-9-27.

Validation of the symptom pattern method for analyzing verbal autopsy data.

PLoS Med. 2007 Nov 20;4(11):e327. doi: 10.1371/journal.pmed.0040327.

Verbal autopsy: current practices and challenges.

Bull World Health Organ. 2006 Mar;84(3):239-45. doi: 10.2471/blt.05.027003. Epub 2006 Mar 22.

Gene selection and classification of microarray data using random forest.

BMC Bioinformatics. 2006 Jan 6;7:3. doi: 10.1186/1471-2105-7-3.

Random forest similarity for protein-protein interaction prediction from multiple sources.

Pac Symp Biocomput. 2005:531-42.

A case study of using artificial neural networks for classifying cause of death from verbal autopsy.

Int J Epidemiol. 2001 Jun;30(3):515-20. doi: 10.1093/ije/30.3.515.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

随机森林在死因推断分析中的应用：使用临床诊断金标准的多中心验证研究。

Random forests for verbal autopsy analysis: multisite validation study using clinical diagnostic gold standards.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献