Murray Christopher J L, Lopez Alan D, Feehan Dennis M, Peter Shanon T, Yang Gonghuan
Institute for Health Metrics and Evaluation, University of Washington, Seattle, Washington, United States of America.
PLoS Med. 2007 Nov 20;4(11):e327. doi: 10.1371/journal.pmed.0040327.
Cause of death data are a critical input to formulating good public health policy. In the absence of reliable vital registration data, information collected after death from household members, called verbal autopsy (VA), is commonly used to study causes of death. VA data are usually analyzed by physician-coded verbal autopsy (PCVA). PCVA is expensive and its comparability across regions is questionable. Nearly all validation studies of PCVA have allowed physicians access to information collected from the household members' recall of medical records or contact with health services, thus exaggerating accuracy of PCVA in communities where few deaths had any interaction with the health system. In this study we develop and validate a statistical strategy for analyzing VA data that overcomes the limitations of PCVA.
We propose and validate a method that combines the advantages of methods proposed by King and Lu, and Byass, which we term the symptom pattern (SP) method. The SP method uses two sources of VA data. First, it requires a dataset for which we know the true cause of death, but which need not be representative of the population of interest; this dataset might come from deaths that occur in a hospital. The SP method can then be applied to a second VA sample that is representative of the population of interest. From the hospital data we compute the properties of each symptom; that is, the probability of responding yes to each symptom, given the true cause of death. These symptom properties allow us first to estimate the population-level cause-specific mortality fractions (CSMFs), and to then use the CSMFs as an input in assigning a cause of death to each individual VA response. Finally, we use our individual cause-of-death assignments to refine our population-level CSMF estimates. The results from applying our method to data collected in China are promising. At the population level, SP estimates the CSMFs with 16% average relative error and 0.7% average absolute error, while PCVA results in 27% average relative error and 1.1% average absolute error. At the individual level, SP assigns the correct cause of death in 83% of the cases, while PCVA does so for 69% of the cases. We also compare the results of SP and PCVA when both methods have restricted access to the information from the medical record recall section of the VA instrument. At the population level, without medical record recall, the SP method estimates the CSMFs with 14% average relative error and 0.6% average absolute error, while PCVA results in 70% average relative error and 3.2% average absolute error. For individual estimates without medical record recall, SP assigns the correct cause of death in 78% of cases, while PCVA does so for 38% of cases.
Our results from the data collected in China suggest that the SP method outperforms PCVA, both at the population and especially at the individual level. Further study is needed on additional VA datasets in order to continue validation of the method, and to understand how the symptom properties vary as a function of culture, language, and other factors. Our results also suggest that PCVA relies heavily on household recall of medical records and related information, limiting its applicability in low-resource settings. SP does not require that additional information to adequately estimate causes of death.
死因数据是制定良好公共卫生政策的关键输入信息。在缺乏可靠的人口动态登记数据的情况下,从家庭成员处收集的死后信息,即所谓的口头尸检(VA),通常用于研究死因。VA数据通常通过医生编码的口头尸检(PCVA)进行分析。PCVA成本高昂,且其在不同地区的可比性存疑。几乎所有PCVA的验证研究都允许医生获取从家庭成员对医疗记录的回忆或与卫生服务接触中收集的信息,因此夸大了PCVA在很少有死亡与卫生系统有任何互动的社区中的准确性。在本研究中,我们开发并验证了一种用于分析VA数据的统计策略,该策略克服了PCVA的局限性。
我们提出并验证了一种结合了King和Lu以及Byass所提出方法优点的方法,我们将其称为症状模式(SP)法。SP法使用两种VA数据源。首先,它需要一个我们知道真实死因的数据集,但该数据集不必代表感兴趣的人群;这个数据集可能来自医院发生的死亡病例。然后,SP法可应用于代表感兴趣人群的第二个VA样本。从医院数据中,我们计算每个症状的属性;也就是说,给定真实死因时对每个症状回答“是”的概率。这些症状属性首先使我们能够估计人群水平的特定病因死亡率(CSMF),然后将CSMF用作给每个个体VA回答分配死因的输入。最后,我们使用个体死因分配来完善我们的人群水平CSMF估计。将我们的方法应用于在中国收集的数据所得到的结果很有前景。在人群水平上,SP估计CSMF的平均相对误差为16%,平均绝对误差为0.7%,而PCVA的平均相对误差为27%,平均绝对误差为1.1%。在个体水平上,SP在83%的病例中正确分配了死因,而PCVA在69%的病例中做到了这一点。我们还比较了SP和PCVA在两种方法都限制获取VA工具中医疗记录回忆部分信息时的结果。在人群水平上,没有医疗记录回忆时,SP法估计CSMF的平均相对误差为14%,平均绝对误差为0.6%,而PCVA的平均相对误差为70%,平均绝对误差为3.2%。对于没有医疗记录回忆的个体估计,SP在78%的病例中正确分配了死因,而PCVA在38%的病例中做到了这一点。
我们从中国收集的数据得出的结果表明,SP法在人群水平尤其是个体水平上优于PCVA。需要对更多的VA数据集进行进一步研究,以继续验证该方法,并了解症状属性如何随文化、语言和其他因素而变化。我们的结果还表明,PCVA严重依赖家庭成员对医疗记录和相关信息的回忆,限制了其在资源匮乏环境中的适用性。SP不需要这些额外信息就能充分估计死因。