Shi Yushu, Zhang Liangliang, Do Kim-Anh, Jenq Robert R, Peterson Christine B
Department of Population Health Sciences, Weill Cornell Medicine, 575 Lexington Avenue, New York, NY 10065, United States.
Department of Population and Quantitative Health Sciences, Case Western Reserve University, 2103 Cornell Road, Cleveland, OH 44106, United States.
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf326.
In microbiome analysis, researchers often seek to identify taxonomic features associated with an outcome of interest. However, microbiome features are intercorrelated and linked by phylogenetic relationships, making it challenging to assess the association between an individual feature and an outcome. This paper proposes a novel conditional association test, CAT, that can account for other features and phylogenetic relatedness when testing the association between a feature and an outcome. CAT adopts a permutation approach, measuring the importance of a feature in predicting the outcome by permuting operational taxonomic unit/amplicon sequence variant counts belonging to that feature from the data and quantifying how much the association with the outcome is weakened through the change in the coefficient of determination $R^{2}$. Compared with marginal association tests, it focuses on the added value of a feature in explaining outcome variation that is not captured by other features. By leveraging global tests including PERMANOVA and MiRKAT-based methods, CAT allows association testing for continuous, binary, categorical, count, survival, and correlated outcomes. We demonstrate through simulation studies that CAT can provide a direct quantification of feature importance that is distinct from that of marginal association tests, and illustrate CAT with applications to two real-world studies on the microbiome in melanoma patients: one examining the role of the microbiome in shaping immunotherapy response, and one investigating the association between the microbiome and survival outcomes. Our results illustrate the potential of CAT to inform the design of microbiome interventions aimed at improving clinical outcomes.
在微生物组分析中,研究人员常常试图识别与感兴趣的结果相关的分类特征。然而,微生物组特征相互关联,并通过系统发育关系相联系,这使得评估单个特征与结果之间的关联具有挑战性。本文提出了一种新颖的条件关联检验方法——CAT,它在检验一个特征与结果之间的关联时,能够考虑其他特征和系统发育相关性。CAT采用一种置换方法,通过对数据中属于该特征的操作分类单元/扩增子序列变体计数进行置换,来衡量该特征在预测结果中的重要性,并量化通过决定系数(R^{2})的变化,与结果的关联减弱了多少。与边际关联检验相比,它关注的是一个特征在解释其他特征未捕捉到的结果变异方面的附加值。通过利用包括PERMANOVA和基于MiRKAT的方法在内的全局检验,CAT允许对连续、二元、分类、计数、生存和相关结果进行关联检验。我们通过模拟研究表明,CAT能够提供与边际关联检验不同的特征重要性的直接量化,并通过将CAT应用于两项关于黑色素瘤患者微生物组的真实世界研究进行说明:一项研究微生物组在塑造免疫治疗反应中的作用,另一项研究微生物组与生存结果之间的关联。我们的结果说明了CAT在为旨在改善临床结果的微生物组干预设计提供信息方面的潜力。