Suppr超能文献

将生灭过程拟合到面板数据及其在细菌DNA指纹识别中的应用

Fitting Birth-Death Processes to Panel Data with Applications to Bacterial DNA Fingerprinting.

作者信息

Doss Charles R, Suchard Marc A, Holmes Ian, Kato-Maeda Midori, Minin Vladimir N

机构信息

University of Washington, Seattle.

University of California, Los Angeles.

出版信息

Ann Appl Stat. 2013;7(4):2315-2335. doi: 10.1214/13-AOAS673.

Abstract

Continuous-time linear birth-death-immigration (BDI) processes are frequently used in ecology and epidemiology to model stochastic dynamics of the population of interest. In clinical settings, multiple birth-death processes can describe disease trajectories of individual patients, allowing for estimation of the effects of individual covariates on the birth and death rates of the process. Such estimation is usually accomplished by analyzing patient data collected at unevenly spaced time points, referred to as panel data in the biostatistics literature. Fitting linear BDI processes to panel data is a nontrivial optimization problem because birth and death rates can be functions of many parameters related to the covariates of interest. We propose a novel expectation-maximization (EM) algorithm for fitting linear BDI models with covariates to panel data. We derive a closed-form expression for the joint generating function of some of the BDI process statistics and use this generating function to reduce the E-step of the EM algorithm, as well as calculation of the Fisher information, to one-dimensional integration. This analytical technique yields a computationally efficient and robust optimization algorithm that we implemented in an open-source R package. We apply our method to DNA fingerprinting of , the causative agent of tuberculosis, to study intrapatient time evolution of IS copy number, a genetic marker frequently used during estimation of epidemiological clusters of infections. Our analysis reveals previously undocumented differences in IS birth-death rates among three major lineages of , which has important implications for epidemiologists that use IS for DNA fingerprinting of .

摘要

连续时间线性出生-死亡-迁入(BDI)过程在生态学和流行病学中经常用于对感兴趣种群的随机动态进行建模。在临床环境中,多个出生-死亡过程可以描述个体患者的疾病轨迹,从而能够估计个体协变量对该过程的出生率和死亡率的影响。这种估计通常是通过分析在不均匀间隔时间点收集的患者数据来完成的,在生物统计学文献中称为面板数据。将线性BDI过程拟合到面板数据是一个不平凡的优化问题,因为出生率和死亡率可能是与感兴趣的协变量相关的许多参数的函数。我们提出了一种新颖的期望最大化(EM)算法,用于将具有协变量的线性BDI模型拟合到面板数据。我们推导出了一些BDI过程统计量的联合生成函数的闭式表达式,并使用该生成函数将EM算法的E步以及费舍尔信息的计算简化为一维积分。这种分析技术产生了一种计算高效且稳健的优化算法,我们在一个开源R包中实现了该算法。我们将我们的方法应用于结核病病原体结核分枝杆菌的DNA指纹识别,以研究IS拷贝数的患者体内时间演变,IS拷贝数是在估计结核分枝杆菌感染的流行病学簇时经常使用的一种遗传标记。我们的分析揭示了结核分枝杆菌三个主要谱系之间以前未记录的IS出生率和死亡率差异,这对使用IS进行结核分枝杆菌DNA指纹识别的流行病学家具有重要意义。

相似文献

4
DNA fingerprinting with two probes decreases clustering of Mycobacterium tuberculosis.
Am J Respir Crit Care Med. 1997 Mar;155(3):1140-6. doi: 10.1164/ajrccm.155.3.9117000.
6
Estimation for general birth-death processes.
J Am Stat Assoc. 2014 Apr;109(506):730-747. doi: 10.1080/01621459.2013.866565.

引用本文的文献

1
Computational methods for birth-death processes.
Wiley Interdiscip Rev Comput Stat. 2018 Mar-Apr;10(2). doi: 10.1002/wics.1423. Epub 2018 Jan 2.
2
Birth/birth-death processes and their computable transition probabilities with biological applications.
J Math Biol. 2018 Mar;76(4):911-944. doi: 10.1007/s00285-017-1160-3. Epub 2017 Jul 24.
5
Estimation for general birth-death processes.
J Am Stat Assoc. 2014 Apr;109(506):730-747. doi: 10.1080/01621459.2013.866565.

本文引用的文献

1
Extinction models for cancer stem cell therapy.
Math Biosci. 2011 Dec;234(2):132-46. doi: 10.1016/j.mbs.2011.09.005. Epub 2011 Oct 6.
2
Genotyping of Mycobacterium tuberculosis: application in epidemiologic studies.
Future Microbiol. 2011 Feb;6(2):203-16. doi: 10.2217/fmb.10.165.
3
Deciphering the role of IS6110 in a highly transmissible Mycobacterium tuberculosis Beijing strain, GC1237.
Tuberculosis (Edinb). 2011 Mar;91(2):117-26. doi: 10.1016/j.tube.2010.12.007. Epub 2011 Jan 20.
4
Counting labeled transitions in continuous-time Markov models of evolution.
J Math Biol. 2008 Mar;56(3):391-412. doi: 10.1007/s00285-007-0120-8. Epub 2007 Sep 14.
5
The role of IS6110 in the evolution of Mycobacterium tuberculosis.
Tuberculosis (Edinb). 2007 Sep;87(5):393-404. doi: 10.1016/j.tube.2007.05.010. Epub 2007 Jul 12.
7
Variable host-pathogen compatibility in Mycobacterium tuberculosis.
Proc Natl Acad Sci U S A. 2006 Feb 21;103(8):2869-73. doi: 10.1073/pnas.0511240103. Epub 2006 Feb 13.
8
A queueing model for chronic recurrent conditions under panel observation.
Biometrics. 2005 Mar;61(1):193-8. doi: 10.1111/j.0006-341X.2005.040332.x.
9
Using evolutionary Expectation Maximization to estimate indel rates.
Bioinformatics. 2005 May 15;21(10):2294-300. doi: 10.1093/bioinformatics/bti177. Epub 2005 Feb 24.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验