Department of Statistical Science, Duke University, Durham, North Carolina, USA.
Department of Neurology, Duke University, Durham, North Carolina, USA.
Alzheimers Dement. 2021 Apr;17(4):561-573. doi: 10.1002/alz.12223. Epub 2021 Jan 21.
The study of Alzheimer's disease (AD) has revealed biological pathways with implications for disease neuropathology and pathophysiology. These pathway-level effects may also be mediated by individual characteristics or covariates such as age or sex. Evaluation of AD biological pathways in the context of interactions with these covariates is critical to the understanding of AD as well as the development of model systems used to study the disease.
Gene set enrichment methods are powerful tools used to interpret gene-level statistics at the level of biological pathways. We introduce a method for quantifying gene set enrichment using likelihood ratio-derived test statistics (gsLRT), which accounts for sample covariates like age and sex. We then use our method to test for age and sex interactions with protein expression levels in AD and to compare the pathway results between human and mouse species.
Our method, based on nested logistic regressions is competitive with the existing standard for gene set testing in the context of linear models and complex experimental design. The gene sets we identify as having a significant association with AD-both with and without additional covariate interactions-are validated by previous studies. Differences between gsLRT results on mouse and human datasets are observed.
Characterizing biological pathways involved in AD builds on the important work involving single gene drivers. Our gene set enrichment method finds pathways that are significantly related to AD while accounting for covariates that may be relevant to disease development. The method highlights commonalities and differences between human AD and mouse models, which may inform the development of higher fidelity models for the study of AD.
阿尔茨海默病(AD)的研究揭示了对疾病神经病理学和病理生理学有影响的生物学途径。这些途径水平的效应也可能由个体特征或协变量(如年龄或性别)介导。在考虑这些协变量与交互作用的情况下,评估 AD 生物学途径对于理解 AD 以及开发用于研究该疾病的模型系统至关重要。
基因集富集方法是一种强大的工具,用于在生物学途径水平上解释基因水平的统计数据。我们引入了一种使用似然比衍生的检验统计量(gsLRT)量化基因集富集的方法,该方法考虑了年龄和性别等样本协变量。然后,我们使用该方法测试 AD 中蛋白质表达水平与年龄和性别的相互作用,并比较人类和小鼠物种之间的途径结果。
我们的方法基于嵌套逻辑回归,在线性模型和复杂实验设计的背景下与现有的基因集检验标准具有竞争力。我们确定的与 AD 具有显著关联的基因集(包括和不包括额外的协变量相互作用)都得到了先前研究的验证。在小鼠和人类数据集上的 gsLRT 结果之间观察到差异。
AD 涉及的生物学途径的特征建立在涉及单个基因驱动因素的重要工作之上。我们的基因集富集方法在考虑可能与疾病发展相关的协变量的情况下,找到了与 AD 显著相关的途径。该方法突出了人类 AD 和小鼠模型之间的共性和差异,这可能为 AD 研究的更高保真度模型的开发提供信息。