Frédéric Mélissa Yana, Lalande Marine, Boileau Catherine, Hamroun Dalil, Claustres Mireille, Béroud Christophe, Collod-Béroud Gwenaëlle
INSERM, U827, Montpellier, France.
Hum Mutat. 2009 Jun;30(6):952-9. doi: 10.1002/humu.20970.
Approximately half of gene lesions responsible for human inherited diseases are due to an amino acid substitution, showing that this mutational mechanism plays a large role in diseases. Distinguishing neutral sequence variations from those responsible for the phenotype is of major interest in human genetics. Because in vitro validation of mutations is not always possible in diagnostic settings, indirect arguments must be accumulated to define whether a missense variation is causative. To further differentiate neutral variants from pathogenic nucleotide substitutions, we developed a new tool, UMD-Predictor. This tool provides a combinatorial approach that associates the following data: localization within the protein, conservation, biochemical properties of the mutant and wild-type residues, and the potential impact of the variation on mRNA. To evaluate this new tool, we compared it to the SIFT, PolyPhen, and SNAP software, the BLOSUM62 and Yu's Biochemical Matrices. All tools were evaluated using variations from well-validated datasets extracted from four UMD-LSDB databases (UMD-FBN1, UMD-FBN2, UMD-TGFBR1, and UMD-TGFBR2) that contain all published mutations of the corresponding genes, that is, 1,945 mutations, among which 796 different substitutions corresponding to missense mutations. Our results show that the UMD-Predictor algorithm is the most efficient tool to predict pathogenic mutations in this context with a positive predictive value of 99.4%, a sensitivity of 95.4%, and a specificity of 92.2%. It can thus enhance the interpretation of variations in these genes, and could easily be applied to any other disease gene through the freely available UMD generic software (http://www.umd.be).
导致人类遗传性疾病的基因损伤中,约有一半是由氨基酸替换引起的,这表明这种突变机制在疾病中起着重要作用。区分中性序列变异与导致表型的变异是人类遗传学的主要研究兴趣。由于在诊断环境中并非总能对突变进行体外验证,因此必须积累间接证据来确定错义变异是否具有致病性。为了进一步区分中性变异与致病性核苷酸替换,我们开发了一种新工具UMD-Predictor。该工具提供了一种组合方法,将以下数据关联起来:在蛋白质中的定位、保守性、突变型和野生型残基的生化特性,以及变异对mRNA的潜在影响。为了评估这个新工具,我们将其与SIFT、PolyPhen和SNAP软件、BLOSUM62和Yu氏生化矩阵进行了比较。所有工具均使用从四个UMD-LSDB数据库(UMD-FBN1、UMD-FBN2、UMD-TGFBR1和UMD-TGFBR2)中提取的经过充分验证的数据集的变异进行评估,这些数据库包含相应基因的所有已发表突变,即1945个突变,其中796个不同的替换对应错义突变。我们的结果表明,在这种情况下,UMD-Predictor算法是预测致病性突变最有效的工具,其阳性预测值为99.4%,敏感性为95.4%,特异性为92.2%。因此,它可以增强对这些基因变异的解释,并且可以通过免费提供的UMD通用软件(http://www.umd.be)轻松应用于任何其他疾病基因。