Corcuff Mélanie, Garibal Marc, Desvignes Jean-Pierre, Guien Céline, Grattepanche Coralie, Collod-Béroud Gwenaëlle, Ménoret Estelle, Salgado David, Béroud Christophe
Aix Marseille University, INSERM, MMG, Bioinformatics & Genetics, Marseille, France.
Laboratoire de Génétique Médicale, APHM Hôpital d'Enfants de la Timone, Marseille, France.
Front Bioinform. 2023 Feb 21;3:1127341. doi: 10.3389/fbinf.2023.1127341. eCollection 2023.
Using the ACMG-AMP guidelines for the interpretation of sequence variants, it remains difficult to meet the criterion associated with the protein domain, PM1, which is assigned in only about 10% of cases, whereas the criteria related to variant frequency, PM2/BA1/BS1, is reported in 50% of cases. To improve the classification of human missense variants using protein domains information, we developed the DOLPHIN system (https://dolphin.mmg-gbit.eu). We used Pfam alignments of eukaryotes to define DOLPHIN scores to identify protein domain residues and variants that have a significant impact. In parallel, we enriched gnomAD variants frequencies for each domains' residue. These were validated using ClinVar data. We applied this method to all potential human transcripts' variants, resulting in 30.0% being assigned a PM1 label, whereas 33.2% were eligible for a new benign support criterion, BP8. We also showed that DOLPHIN provides an extrapolated frequency for 31.8% of the variants, compared to the original frequency available in gnomAD for 7.6% of them. Overall, DOLPHIN allows a simplified use of the PM1 criterion, an expanded application of the PM2/BS1 criteria and the creation of a new BP8 criterion. DOLPHIN could facilitate the classification of amino acid substitutions in protein domains that cover nearly 40% of proteins and represent the sites of most pathogenic variants.
使用美国医学遗传学与基因组学学会(ACMG)和分子病理学会(AMP)的序列变异解读指南,仍然难以满足与蛋白质结构域相关的标准PM1,只有约10%的病例符合该标准,而与变异频率相关的标准PM2/BA1/BS1在50%的病例中被报道。为了利用蛋白质结构域信息改进人类错义变异的分类,我们开发了DOLPHIN系统(https://dolphin.mmg-gbit.eu)。我们使用真核生物的Pfam比对来定义DOLPHIN分数,以识别对蛋白质结构域残基和变异有重大影响的情况。同时,我们丰富了每个结构域残基的gnomAD变异频率。这些通过ClinVar数据进行了验证。我们将此方法应用于所有潜在的人类转录本变异,结果30.0%的变异被赋予PM1标签,而33.2%的变异符合新的良性支持标准BP8。我们还表明,与gnomAD中7.6%的变异的原始频率相比,DOLPHIN为31.8%的变异提供了外推频率。总体而言,DOLPHIN允许简化PM1标准的使用,扩展PM2/BS1标准的应用,并创建新的BP8标准。DOLPHIN可以促进对几乎40%的蛋白质的蛋白质结构域中氨基酸替换的分类,这些结构域代表了大多数致病变异的位点。