Cappello Lorenzo, Palacios Julia A
Department of Statistics, Stanford University, Stanford, CA.
Department of Biomedical Data Science, Stanford Medicine, Stanford, CA.
J Comput Graph Stat. 2022;31(2):541-552. doi: 10.1080/10618600.2021.1987256. Epub 2021 Nov 29.
Longitudinal molecular data of rapidly evolving viruses and pathogens provide information about disease spread and complement traditional surveillance approaches based on case count data. The coalescent is used to model the genealogy that represents the sample ancestral relationships. The basic assumption is that coalescent events occur at a rate inversely proportional to the effective population size (), a time-varying measure of genetic diversity. When the sampling process (collection of samples over time) depends on (), the coalescent and the sampling processes can be jointly modeled to improve estimation of (). Failing to do so can lead to bias due to model misspecification. However, the way that the sampling process depends on the effective population size may vary over time. We introduce an approach where the sampling process is modeled as an inhomogeneous Poisson process with rate equal to the product of () and a time-varying coefficient, making minimal assumptions on their functional shapes via Markov random field priors. We provide efficient algorithms for inference, show the model performance vis-a-vis alternative methods in a simulation study, and apply our model to SARS-CoV-2 sequences from Los Angeles and Santa Clara counties. The methodology is implemented and available in the R package adapref. Supplementary files for this article are available online.
快速进化的病毒和病原体的纵向分子数据提供了有关疾病传播的信息,并补充了基于病例数数据的传统监测方法。合并过程用于对代表样本祖先关系的谱系进行建模。基本假设是合并事件发生的速率与有效种群大小()成反比,有效种群大小是遗传多样性的一种随时间变化的度量。当抽样过程(随时间收集样本)依赖于()时,可以对合并过程和抽样过程进行联合建模,以改进对()的估计。否则可能会由于模型设定错误而导致偏差。然而,抽样过程依赖于有效种群大小的方式可能会随时间变化。我们引入一种方法,将抽样过程建模为一个非齐次泊松过程,其速率等于()与一个随时间变化的系数的乘积,并通过马尔可夫随机场先验对它们的函数形状做出最小假设。我们提供了有效的推理算法,在模拟研究中展示了相对于其他方法的模型性能,并将我们的模型应用于洛杉矶县和圣克拉拉县的SARS-CoV-2序列。该方法已在R包adapref中实现并可用。本文的补充文件可在线获取。