Suppr
超能文献

PredictSNP：用于预测疾病相关突变的强大且准确的一致性分类器。

PredictSNP: robust and accurate consensus classifier for prediction of disease-related mutations.

作者信息

Bendl Jaroslav, Stourac Jan, Salanda Ondrej, Pavelka Antonin, Wieben Eric D, Zendulka Jaroslav, Brezovsky Jan, Damborsky Jiri

机构信息

Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment, Faculty of Science, Masaryk University, Brno, Czech Republic ; Department of Information Systems, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic ; Center of Biomolecular and Cellular Engineering, International Centre for Clinical Research, St. Anne's University Hospital Brno, Brno, Czech Republic.

Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment, Faculty of Science, Masaryk University, Brno, Czech Republic ; Center of Biomolecular and Cellular Engineering, International Centre for Clinical Research, St. Anne's University Hospital Brno, Brno, Czech Republic.

出版信息

PLoS Comput Biol. 2014 Jan;10(1):e1003440. doi: 10.1371/journal.pcbi.1003440. Epub 2014 Jan 16.

DOI:10.1371/journal.pcbi.1003440

PMID:24453961

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3894168/

Abstract

Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analysis of single nucleotide variants and their prioritization for experimental characterization. Many computational tools are already widely employed for this purpose. Unfortunately, their comparison and further improvement is hindered by large overlaps between the training datasets and benchmark datasets, which lead to biased and overly optimistic reported performances. In this study, we have constructed three independent datasets by removing all duplicities, inconsistencies and mutations previously used in the training of evaluated tools. The benchmark dataset containing over 43,000 mutations was employed for the unbiased evaluation of eight established prediction tools: MAPP, nsSNPAnalyzer, PANTHER, PhD-SNP, PolyPhen-1, PolyPhen-2, SIFT and SNAP. The six best performing tools were combined into a consensus classifier PredictSNP, resulting into significantly improved prediction performance, and at the same time returned results for all mutations, confirming that consensus prediction represents an accurate and robust alternative to the predictions delivered by individual tools. A user-friendly web interface enables easy access to all eight prediction tools, the consensus classifier PredictSNP and annotations from the Protein Mutant Database and the UniProt database. The web server and the datasets are freely available to the academic community at http://loschmidt.chemi.muni.cz/predictsnp.

摘要

单核苷酸变异是一种常见的遗传变异形式。编码区的突变常常与各种遗传疾病的发生相关。用于预测突变对蛋白质功能影响的计算工具对于单核苷酸变异分析及其实验表征的优先级确定非常重要。许多计算工具已广泛用于此目的。不幸的是，训练数据集和基准数据集之间的大量重叠阻碍了它们的比较和进一步改进，这导致报告的性能存在偏差且过于乐观。在本研究中，我们通过去除先前在评估工具训练中使用的所有重复、不一致和突变构建了三个独立的数据集。包含超过43,000个突变的基准数据集用于对八个既定预测工具进行无偏评估：MAPP、nsSNPAnalyzer、PANTHER、PhD-SNP、PolyPhen-1、PolyPhen-2、SIFT和SNAP。六个性能最佳的工具被组合成一个共识分类器PredictSNP，从而显著提高了预测性能，同时返回了所有突变的结果，证实了共识预测是单个工具预测的准确且稳健的替代方法。一个用户友好的网络界面使人们能够轻松访问所有八个预测工具、共识分类器PredictSNP以及来自蛋白质突变数据库和UniProt数据库的注释。网络服务器和数据集可在http://loschmidt.chemi.muni.cz/predictsnp免费提供给学术界。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/818d/3894168/8f2bceb9c028/pcbi.1003440.g001.jpg

相似文献

PredictSNP: robust and accurate consensus classifier for prediction of disease-related mutations.

PLoS Comput Biol. 2014 Jan;10(1):e1003440. doi: 10.1371/journal.pcbi.1003440. Epub 2014 Jan 16.

PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions.

PLoS Comput Biol. 2016 May 25;12(5):e1004962. doi: 10.1371/journal.pcbi.1004962. eCollection 2016 May.

Performance of mutation pathogenicity prediction methods on missense variants.

Hum Mutat. 2011 Apr;32(4):358-68. doi: 10.1002/humu.21445. Epub 2011 Feb 22.

Assessment of the predictive accuracy of five in silico prediction tools, alone or in combination, and two metaservers to classify long QT syndrome gene mutations.

BMC Med Genet. 2015 May 13;16:34. doi: 10.1186/s12881-015-0176-z.

MutTMPredictor: Robust and accurate cascade XGBoost classifier for prediction of mutations in transmembrane proteins.

Comput Struct Biotechnol J. 2021 Nov 19;19:6400-6416. doi: 10.1016/j.csbj.2021.11.024. eCollection 2021.

The evaluation of tools used to predict the impact of missense variants is hindered by two types of circularity.

Hum Mutat. 2015 May;36(5):513-23. doi: 10.1002/humu.22768. Epub 2015 Mar 26.

Prediction of the pathogenicity of antithrombin sequence variations by in silico methods.

Thromb Res. 2015 Feb;135(2):404-9. doi: 10.1016/j.thromres.2014.11.022. Epub 2014 Dec 4.

Collective judgment predicts disease-associated single nucleotide variants.

BMC Genomics. 2013;14 Suppl 3(Suppl 3):S2. doi: 10.1186/1471-2164-14-S3-S2. Epub 2013 May 28.

Structural modeling and in silico analysis of human superoxide dismutase 2.

PLoS One. 2013 Jun 13;8(6):e65558. doi: 10.1371/journal.pone.0065558. Print 2013.

An integrated database-pipeline system for studying single nucleotide polymorphisms and diseases.

BMC Bioinformatics. 2008 Dec 12;9 Suppl 12(Suppl 12):S19. doi: 10.1186/1471-2105-9-S12-S19.

引用本文的文献

Computational prediction of high-risk non-synonymous SNPs in human ApoE and their structural impact on amyloid-β interaction in Alzheimer's disease pathogenesis.

PLoS One. 2025 Sep 2;20(9):e0331339. doi: 10.1371/journal.pone.0331339. eCollection 2025.

Missense Variant in Dogs with Cerebellar Ataxia.

Genes (Basel). 2025 Aug 4;16(8):934. doi: 10.3390/genes16080934.

Uncovering the protein aggregation process through effect of G41D mutant SOD1 charge variation in ALS disease.

Sci Rep. 2025 Aug 27;15(1):31661. doi: 10.1038/s41598-025-16910-9.

HTSNPedia: A Molecular Perspective and Risk Estimator Database for Hypertension-Associated Genes.

Biochem Genet. 2025 Aug 25. doi: 10.1007/s10528-025-11232-x.

"In silico analysis of human TLR3 missense single nucleotide polymorphisms and their potential association with cancer".

Sci Rep. 2025 Aug 22;15(1):30837. doi: 10.1038/s41598-025-05599-5.

ConPath 2.0: an optimized consensus strategy for assessing the potential pathogenicity of hRPE65 missense variants.

J Mol Model. 2025 Aug 20;31(9):251. doi: 10.1007/s00894-025-06481-x.

Computational association in parkinson's disease SNPs with brain structural and functional alterations.

Neurogenetics. 2025 Aug 9;26(1):59. doi: 10.1007/s10048-025-00843-6.

Genetic and computational analysis of AKR1C4 gene rs17134592 polymorphism in breast cancer among the Bangladeshi population.

Sci Rep. 2025 Jul 28;15(1):27526. doi: 10.1038/s41598-025-13411-7.

Functional and structural analysis of missense variants in the human Gene.

J Public Health Afr. 2025 Jun 20;16(4):1348. doi: 10.4102/jphia.v16i4.1348. eCollection 2025.

Clinical Characteristics and Genetic Variants in Children with Mutation-Associated Disorders.

Medicina (Kaunas). 2025 May 22;61(6):959. doi: 10.3390/medicina61060959.

本文引用的文献

Collective judgment predicts disease-associated single nucleotide variants.

BMC Genomics. 2013;14 Suppl 3(Suppl 3):S2. doi: 10.1186/1471-2164-14-S3-S2. Epub 2013 May 28.

Residue mutations and their impact on protein structure and function: detecting beneficial and pathogenic changes.

Biochem J. 2013 Feb 1;449(3):581-94. doi: 10.1042/BJ20121221.

The Human Gene Mutation Database (HGMD) and its exploitation in the fields of personalized genomics and molecular evolution.

Curr Protoc Bioinformatics. 2012 Sep;Chapter 1:1.13.1-1.13.20. doi: 10.1002/0471250953.bi0113s39.

SIFT web server: predicting effects of amino acid substitutions on proteins.

Nucleic Acids Res. 2012 Jul;40(Web Server issue):W452-7. doi: 10.1093/nar/gks539. Epub 2012 Jun 11.

PON-P: integrated predictor for pathogenicity of missense variants.

Hum Mutat. 2012 Aug;33(8):1166-74. doi: 10.1002/humu.22102. Epub 2012 May 7.

Bioinformatics for personal genome interpretation.

Brief Bioinform. 2012 Jul;13(4):495-512. doi: 10.1093/bib/bbr070. Epub 2012 Jan 13.

Reorganizing the protein space at the Universal Protein Resource (UniProt).

Nucleic Acids Res. 2012 Jan;40(Database issue):D71-5. doi: 10.1093/nar/gkr981. Epub 2011 Nov 18.

Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data.

Nat Rev Genet. 2011 Aug 18;12(9):628-40. doi: 10.1038/nrg3046.

Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel.

Am J Hum Genet. 2011 Apr 8;88(4):440-9. doi: 10.1016/j.ajhg.2011.03.004. Epub 2011 Mar 31.

Performance of mutation pathogenicity prediction methods on missense variants.

Hum Mutat. 2011 Apr;32(4):358-68. doi: 10.1002/humu.21445. Epub 2011 Feb 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

PredictSNP：用于预测疾病相关突变的强大且准确的一致性分类器。

PredictSNP: robust and accurate consensus classifier for prediction of disease-related mutations.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译