The Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
Brain. 2023 Feb 13;146(2):519-533. doi: 10.1093/brain/awac381.
Neurodevelopmental disorders (NDDs), including severe paediatric epilepsy, autism and intellectual disabilities are heterogeneous conditions in which clinical genetic testing can often identify a pathogenic variant. For many of them, genetic therapies will be tested in this or the coming years in clinical trials. In contrast to first-generation symptomatic treatments, the new disease-modifying precision medicines require a genetic test-informed diagnosis before a patient can be enrolled in a clinical trial. However, even in 2022, most identified genetic variants in NDD genes are 'variants of uncertain significance'. To safely enrol patients in precision medicine clinical trials, it is important to increase our knowledge about which regions in NDD-associated proteins can 'tolerate' missense variants and which ones are 'essential' and will cause a NDD when mutated. In addition, knowledge about functionally indispensable regions in the 3D structure context of proteins can also provide insights into the molecular mechanisms of disease variants. We developed a novel consensus approach that overlays evolutionary, and population based genomic scores to identify 3D essential sites (Essential3D) on protein structures. After extensive benchmarking of AlphaFold predicted and experimentally solved protein structures, we generated the currently largest expert curated protein structure set for 242 NDDs and identified 14 377 Essential3D sites across 189 gene disorders associated proteins. We demonstrate that the consensus annotation of Essential3D sites improves prioritization of disease mutations over single annotations. The identified Essential3D sites were enriched for functional features such as intermembrane regions or active sites and discovered key inter-molecule interactions in protein complexes that were otherwise not annotated. Using the currently largest autism, developmental disorders, and epilepsies exome sequencing studies including >360 000 NDD patients and population controls, we found that missense variants at Essential3D sites are 8-fold enriched in patients. In summary, we developed a comprehensive protein structure set for 242 NDDs and identified 14 377 Essential3D sites in these. All data are available at https://es-ndd.broadinstitute.org for interactive visual inspection to enhance variant interpretation and development of mechanistic hypotheses for 242 NDDs genes. The provided resources will enhance clinical variant interpretation and in silico drug target development for NDD-associated genes and encoded proteins.
神经发育障碍(NDD),包括严重的儿科癫痫、自闭症和智力残疾,是异质性疾病,临床遗传学检测通常可以确定致病性变异。对于其中许多疾病,遗传疗法将在未来几年的临床试验中进行测试。与第一代对症治疗不同,新型疾病修饰精准药物在患者被纳入临床试验之前,需要基于基因检测的诊断。然而,即使在 2022 年,在 NDD 基因中发现的大多数遗传变异仍然是“意义未明的变异”。为了安全地将患者纳入精准医学临床试验,增加我们对 NDD 相关蛋白中哪些区域可以“耐受”错义变异,以及哪些区域是“必需的”并在突变时导致 NDD 的认识非常重要。此外,关于蛋白质 3D 结构背景中功能不可或缺区域的知识也可以提供对疾病变异分子机制的深入了解。我们开发了一种新的共识方法,该方法覆盖了进化和基于群体的基因组评分,以确定蛋白质结构上的 3D 必需区域(Essential3D)。在对 AlphaFold 预测的和实验解决的蛋白质结构进行广泛的基准测试之后,我们为 242 种 NDD 生成了目前最大的专家 curated 蛋白质结构集,并在 189 种与基因疾病相关的蛋白质中鉴定了 14377 个 Essential3D 位点。我们证明,essential3D 位点的共识注释可以提高对疾病突变的优先级,而不是单一注释。鉴定的 essential3D 位点富集了功能特征,如跨膜区域或活性位点,并发现了蛋白质复合物中否则未注释的关键分子间相互作用。使用目前最大的自闭症、发育障碍和癫痫外显子组测序研究,包括>360000 名 NDD 患者和群体对照,我们发现 essential3D 位点的错义变异在患者中富集了 8 倍。总之,我们为 242 种 NDD 开发了一个全面的蛋白质结构集,并在其中鉴定了 14377 个 essential3D 位点。所有数据都可在 https://es-ndd.broadinstitute.org 上获得,用于交互式可视化检查,以增强对 242 种 NDD 基因变异的解释,并为这些基因的机制假说的发展提供信息。提供的资源将增强对 NDD 相关基因和编码蛋白的临床变异解释和计算机药物靶点开发。