Balasubramanian Suganthi, Xia Yu, Freinkman Elizaveta, Gerstein Mark
Department of Molecular Biophysics and Biochemistry, Yale University 266 Whitney Avenue, New Haven, CT 06520-8114, USA.
Nucleic Acids Res. 2005 Mar 22;33(5):1710-21. doi: 10.1093/nar/gki311. Print 2005.
We assessed the disease-causing potential of single nucleotide polymorphisms (SNPs) based on a simple set of sequence-based features. We focused on SNPs from the dbSNP database in G-protein-coupled receptors (GPCRs), a large class of important transmembrane (TM) proteins. Apart from the location of the SNP in the protein, we evaluated the predictive power of three major classes of features to differentiate between disease-causing mutations and neutral changes: (i) properties derived from amino-acid scales, such as volume and hydrophobicity; (ii) position-specific phylogenetic features reflecting evolutionary conservation, such as normalized site entropy, residue frequency and SIFT score; and (iii) substitution-matrix scores, such as those derived from the BLOSUM62, GRANTHAM and PHAT matrices. We validated our approach using a control dataset consisting of known disease-causing mutations and neutral variations. Logistic regression analyses indicated that position-specific phylogenetic features that describe the conservation of an amino acid at a specific site are the best discriminators of disease mutations versus neutral variations, and integration of all our features improves discrimination power. Overall, we identify 115 SNPs in GPCRs from dbSNP that are likely to be associated with disease and thus are good candidates for genotyping in association studies.
我们基于一组简单的序列特征评估了单核苷酸多态性(SNP)的致病潜力。我们重点关注来自dbSNP数据库中G蛋白偶联受体(GPCR)的SNP,GPCR是一大类重要的跨膜(TM)蛋白。除了SNP在蛋白质中的位置外,我们还评估了三大类特征区分致病突变和中性变化的预测能力:(i)源自氨基酸量表的属性,如体积和疏水性;(ii)反映进化保守性的位置特异性系统发育特征,如标准化位点熵、残基频率和SIFT评分;以及(iii)替代矩阵分数,如源自BLOSUM62、GRANTHAM和PHAT矩阵的分数。我们使用由已知致病突变和中性变异组成的对照数据集验证了我们的方法。逻辑回归分析表明,描述特定位点氨基酸保守性的位置特异性系统发育特征是区分疾病突变与中性变异的最佳判别指标,整合所有特征可提高判别能力。总体而言,我们在dbSNP的GPCR中鉴定出115个可能与疾病相关的SNP,因此是关联研究中基因分型的良好候选者。