Litman Aviya, Sauerwald Natalie, Snyder LeeAnne Green, Foss-Feig Jennifer, Park Christopher Y, Hao Yun, Dinstein Ilan, Theesfeld Chandra L, Troyanskaya Olga G
Quantitative and Computational Biology Program, Princeton University, NJ, USA.
Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.
medRxiv. 2024 Aug 16:2024.08.15.24312078. doi: 10.1101/2024.08.15.24312078.
Unraveling the phenotypic and genetic complexity of autism is extremely challenging yet critical for understanding the biology, inheritance, trajectory, and clinical manifestations of the many forms of the condition. Here, we leveraged broad phenotypic data from a large cohort with matched genetics to characterize classes of autism and their patterns of core, associated, and co-occurring traits, ultimately demonstrating that phenotypic patterns are associated with distinct genetic and molecular programs. We used a generative mixture modeling approach to identify robust, clinically-relevant classes of autism which we validate and replicate in a large independent cohort. We link the phenotypic findings to distinct patterns of and inherited variation which emerge from the deconvolution of these genetic signals, and demonstrate that class-specific common variant scores strongly align with clinical outcomes. We further provide insights into the distinct biological pathways and processes disrupted by the sets of mutations in each class. Remarkably, we discover class-specific differences in the developmental timing of genes that are dysregulated, and these temporal patterns correspond to clinical milestone and outcome differences between the classes. These analyses embrace the phenotypic complexity of children with autism, unraveling genetic and molecular programs underlying their heterogeneity and suggesting specific biological dysregulation patterns and mechanistic hypotheses.
解析自闭症的表型和遗传复杂性极具挑战性,但对于理解该病症多种形式的生物学特性、遗传方式、发展轨迹及临床表现至关重要。在此,我们利用来自一个大型队列的广泛表型数据及匹配的遗传学数据,以刻画自闭症的类别及其核心、相关和共现特征的模式,最终证明表型模式与不同的遗传和分子程序相关。我们采用生成性混合建模方法来识别可靠的、与临床相关的自闭症类别,并在一个大型独立队列中进行验证和重复。我们将表型研究结果与从这些遗传信号解卷积中出现的独特模式和遗传变异相联系,并证明类别特异性常见变异分数与临床结果高度一致。我们进一步深入了解了每个类别中因突变集而 disrupted 的不同生物学途径和过程。值得注意的是,我们发现基因失调的发育时间存在类别特异性差异,且这些时间模式与类别之间的临床里程碑和结果差异相对应。这些分析涵盖了自闭症儿童的表型复杂性,揭示了其异质性背后的遗传和分子程序,并提出了特定的生物失调模式和机制假设。