文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies.

作者信息

Dong Chengliang, Wei Peng, Jian Xueqiu, Gibbs Richard, Boerwinkle Eric, Wang Kai, Liu Xiaoming

机构信息

Zilkha Neurogenetic Institute, Biostatistics Division, Department of Preventive Medicine and.

Human Genetics Center, Division of Biostatistics, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA and.

出版信息

Hum Mol Genet. 2015 Apr 15;24(8):2125-37. doi: 10.1093/hmg/ddu733. Epub 2014 Dec 30.


DOI:10.1093/hmg/ddu733
PMID:25552646
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4375422/
Abstract

Accurate deleteriousness prediction for nonsynonymous variants is crucial for distinguishing pathogenic mutations from background polymorphisms in whole exome sequencing (WES) studies. Although many deleteriousness prediction methods have been developed, their prediction results are sometimes inconsistent with each other and their relative merits are still unclear in practical applications. To address these issues, we comprehensively evaluated the predictive performance of 18 current deleteriousness-scoring methods, including 11 function prediction scores (PolyPhen-2, SIFT, MutationTaster, Mutation Assessor, FATHMM, LRT, PANTHER, PhD-SNP, SNAP, SNPs&GO and MutPred), 3 conservation scores (GERP++, SiPhy and PhyloP) and 4 ensemble scores (CADD, PON-P, KGGSeq and CONDEL). We found that FATHMM and KGGSeq had the highest discriminative power among independent scores and ensemble scores, respectively. Moreover, to ensure unbiased performance evaluation of these prediction scores, we manually collected three distinct testing datasets, on which no current prediction scores were tuned. In addition, we developed two new ensemble scores that integrate nine independent scores and allele frequency. Our scores achieved the highest discriminative power compared with all the deleteriousness prediction scores tested and showed low false-positive prediction rate for benign yet rare nonsynonymous variants, which demonstrated the value of combining information from multiple orthologous approaches. Finally, to facilitate variant prioritization in WES studies, we have pre-computed our ensemble scores for 87 347 044 possible variants in the whole-exome and made them publicly available through the ANNOVAR software and the dbNSFP database.

摘要

相似文献

[1]
Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies.

Hum Mol Genet. 2015-4-15

[2]
REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants.

Am J Hum Genet. 2016-10-6

[3]
dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations.

Hum Mutat. 2013-7-10

[4]
dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs.

Hum Mutat. 2016-3

[5]
dbNSFP v4: a comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs.

Genome Med. 2020-12-2

[6]
Integrating multiple genomic data to predict disease-causing nonsynonymous single nucleotide variants in exome sequencing studies.

PLoS Genet. 2014-3-20

[7]
PERCH: A Unified Framework for Disease Gene Prioritization.

Hum Mutat. 2017-3

[8]
dbNSFP: a lightweight database of human nonsynonymous SNPs and their functional predictions.

Hum Mutat. 2011-8

[9]
The evaluation of tools used to predict the impact of missense variants is hindered by two types of circularity.

Hum Mutat. 2015-5

[10]
IMHOTEP-a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants.

Nucleic Acids Res. 2017-2-17

引用本文的文献

[1]
Prediction of human pathogenic start loss variants based on self-supervised contrastive learning.

BMC Biol. 2025-8-8

[2]
Exploring Molecular and Phenotypic Characteristics of Arg234Gly and Asp312Asn Variants.

Mol Syndromol. 2025-8

[3]
Health risks and genetic architecture of objectively measured multidimensional sleep health.

Nat Commun. 2025-7-31

[4]
Phenotypic and Genotypic Characterization of 171 Patients with Syndromic Inherited Retinal Diseases Highlights the Importance of Genetic Testing for Accurate Clinical Diagnosis.

Genes (Basel). 2025-6-26

[5]
Predicting the Damaging Potential of Uncharacterized and Variants.

Int J Mol Sci. 2025-7-8

[6]
An Integrated Clinical, Germline, Somatic, and In Silico Approach to Assess a Novel PMS2 Gene Variant Identified in Two Unrelated Lynch Syndrome Families.

Cancers (Basel). 2025-7-11

[7]
Leukemia mutated proteins PHF6 and PHIP form a chromatin complex that represses acute myeloid leukemia stemness.

Genes Dev. 2025-7-28

[8]
Sequencing validates deep learning models for EHR-based detection of Noonan syndrome in pediatric patients.

NPJ Genom Med. 2025-7-21

[9]
De Novo Variant Associated With Juvenile-Onset Temporal Lobe Epilepsy With Favorable Outcomes.

Hum Mutat. 2025-2-12

[10]
Insights on SNPs of Human Activation-Induced Cytidine Deaminase AID.

Int J Mol Sci. 2025-6-25

本文引用的文献

[1]
A general framework for estimating the relative pathogenicity of human genetic variants.

Nat Genet. 2014-2-2

[2]
Improved exome prioritization of disease genes through cross-species phenotype comparison.

Genome Res. 2013-10-25

[3]
Integrated enrichment analysis of variants and pathways in genome-wide association studies indicates central role for IL-2 signaling genes in type 1 diabetes, and cytokine signaling genes in Crohn's disease.

PLoS Genet. 2013-10-3

[4]
eXtasy: variant prioritization by genomic data fusion.

Nat Methods. 2013-9-29

[5]
dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations.

Hum Mutat. 2013-7-10

[6]
Whole-genome sequence-based analysis of high-density lipoprotein cholesterol.

Nat Genet. 2013-6-16

[7]
Predicting the functional consequences of cancer-associated amino acid substitutions.

Bioinformatics. 2013-4-25

[8]
An integrated map of genetic variation from 1,092 human genomes.

Nature. 2012-11-1

[9]
Exploiting protein-protein interaction networks for genome-wide disease-gene prioritization.

PLoS One. 2012-9-21

[10]
VariBench: a benchmark database for variations.

Hum Mutat. 2012-10-11

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索