Sethumadhavan Rao, Doss C George Priya, Rajasekaran R
School of BioSciences and Technology, Vellore Institute of Technology, 632014, Vellore, Tamil Nadu, India.
Methods Mol Biol. 2011;760:239-50. doi: 10.1007/978-1-61779-176-5_15.
Experimental analyses of disease-associated DNA variants have provided significant insights into the functional implications of sequence variation. However, such experiment-based approaches for identifying functional DNA variants from a pool with a large number of neutral variants are challenging. Computational biology has the opportunity to play an important role in the identification of functional DNA variants in large-scale genotyping studies, ultimately yielding new drug targets and biomarkers. This chapter outlines in silico methods to predict disease-associated functional DNA variants so that the number of DNA variants screened for association with disease can be reduced to those that are most likely to alter gene function. To explore possible relationships between genetic mutations and phenotypic variation, different computational methods like Sorting Intolerant from Tolerant (SIFT, an evolutionary-based approach), Polymorphism Phenotyping (PolyPhen, a structure-based approach) and PupaSuite are discussed for prioritization of high-risk DNA variants. The PupaSuite tool aims to predict the phenotypic effect of DNA variants on the structure and function of the affected protein as well as the effect of variants in the non-coding regions of the same genes. To further investigate the possible causes of disease at the molecular level, deleterious nonsynonymous variants can be mapped to 3D protein structures. An analysis of solvent accessibility and secondary structure can also be performed to understand the impact of a mutation on protein function and stability. This chapter demonstrates a 'real-world' application of some existing bioinformatics tools for DNA variant analysis.
对与疾病相关的DNA变异进行实验分析,已为序列变异的功能影响提供了重要见解。然而,从大量中性变异中识别功能性DNA变异的这种基于实验的方法具有挑战性。计算生物学有机会在大规模基因分型研究中识别功能性DNA变异方面发挥重要作用,最终产生新的药物靶点和生物标志物。本章概述了计算机方法,以预测与疾病相关的功能性DNA变异,从而将筛选与疾病相关的DNA变异数量减少到最有可能改变基因功能的那些变异。为了探索基因突变与表型变异之间的可能关系,讨论了不同的计算方法,如从耐受中筛选不耐受(SIFT,一种基于进化的方法)、多态性表型分析(PolyPhen,一种基于结构的方法)和PupaSuite,用于对高风险DNA变异进行优先级排序。PupaSuite工具旨在预测DNA变异对受影响蛋白质的结构和功能的表型效应,以及同一基因非编码区变异的效应。为了在分子水平上进一步研究疾病的可能原因,可以将有害的非同义变异映射到三维蛋白质结构上。还可以进行溶剂可及性和二级结构分析,以了解突变对蛋白质功能和稳定性的影响。本章展示了一些现有生物信息学工具在DNA变异分析中的“实际应用”。