Ruiz-Alías Gabriel, Soldevila Sergi, Altafaj Xavier, Cordomí Arnau, Olivella Mireia
Department of Biosciences, Faculty of Sciences and Technology, University of Vic-Central University of Catalonia, Vic, Barcelona 08500, Spain.
Institute for Research and Innovation in Life and Health Sciences (IRIS-CC), University of Vic-Central University of Catalonia, Vic, Barcelona 08500, Spain.
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf305.
High-throughput DNA sequencing has revealed millions of single nucleotide variants (SNVs) in the human genome, with a small fraction linked to disease. The effect of missense variants, which alter the protein sequence, is particularly challenging to interpret due to the scarcity of clinical annotations and experimental information. While using conservation and structural information, current prediction tools still struggle to predict variant pathogenicity. In this study, we explored the pathogenicity of homologous missense variants-variants in equivalent positions across homologous proteins-focusing on proteins involved in autosomal dominant diseases.
Our analysis of 2976 pathogenic and 17 555 non-pathogenic homologous variants demonstrated that pathogenicity can be extrapolated with 95% accuracy within a family, or up to 98% for closer homologs. Remarkably, the evaluation of 27 commonly used mutation predictor methods revealed that they were not fully capturing this biological feature. To facilitate the exploration of homologous variants, we created HomolVar, a web server that computationally predicts the pathogenesis of missense variants using annotations from homologous variants, freely available at https://rarevariants.org/HomolVar. Overall, these findings and the accompanying tool offer a robust method for predicting the pathogenicity of unannotated variants, enhancing genotype-phenotype correlations, and contributing to diagnosing rare genetic disorders.
HomolVar is freely available at https://rarevariants.org/HomolVar.
高通量DNA测序揭示了人类基因组中数以百万计的单核苷酸变异(SNV),其中一小部分与疾病相关。错义变异会改变蛋白质序列,由于临床注释和实验信息匮乏,其影响尤其难以解释。尽管使用了保守性和结构信息,但当前的预测工具在预测变异致病性方面仍存在困难。在本研究中,我们探讨了同源错义变异(同源蛋白质中相同位置的变异)的致病性,重点关注常染色体显性疾病相关蛋白质。
我们对2976个致病性同源变异和17555个非致病性同源变异的分析表明,在一个家族内,致病性推断的准确率可达95%,对于亲缘关系更近的同源物,准确率可达98%。值得注意的是,对27种常用突变预测方法的评估表明,它们并未完全捕捉到这一生物学特征。为便于探索同源变异,我们创建了HomolVar网络服务器,它利用同源变异的注释通过计算预测错义变异的致病性,可在https://rarevariants.org/HomolVar免费获取。总体而言,这些发现及配套工具为预测未注释变异的致病性、增强基因型-表型相关性以及诊断罕见遗传病提供了一种可靠方法。
HomolVar可在https://rarevariants.org/HomolVar免费获取。