• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于独立功能测定数据集评估的变异效应预测工具:对发现和诊断的影响。

Variant effect prediction tools assessed using independent, functional assay-based datasets: implications for discovery and diagnostics.

作者信息

Mahmood Khalid, Jung Chol-Hee, Philip Gayle, Georgeson Peter, Chung Jessica, Pope Bernard J, Park Daniel J

机构信息

Melbourne Bioinformatics, The University of Melbourne, Melbourne, Australia.

出版信息

Hum Genomics. 2017 May 16;11(1):10. doi: 10.1186/s40246-017-0104-8.

DOI:10.1186/s40246-017-0104-8
PMID:28511696
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5433009/
Abstract

BACKGROUND

Genetic variant effect prediction algorithms are used extensively in clinical genomics and research to determine the likely consequences of amino acid substitutions on protein function. It is vital that we better understand their accuracies and limitations because published performance metrics are confounded by serious problems of circularity and error propagation. Here, we derive three independent, functionally determined human mutation datasets, UniFun, BRCA1-DMS and TP53-TA, and employ them, alongside previously described datasets, to assess the pre-eminent variant effect prediction tools.

RESULTS

Apparent accuracies of variant effect prediction tools were influenced significantly by the benchmarking dataset. Benchmarking with the assay-determined datasets UniFun and BRCA1-DMS yielded areas under the receiver operating characteristic curves in the modest ranges of 0.52 to 0.63 and 0.54 to 0.75, respectively, considerably lower than observed for other, potentially more conflicted datasets.

CONCLUSIONS

These results raise concerns about how such algorithms should be employed, particularly in a clinical setting. Contemporary variant effect prediction tools are unlikely to be as accurate at the general prediction of functional impacts on proteins as reported prior. Use of functional assay-based datasets that avoid prior dependencies promises to be valuable for the ongoing development and accurate benchmarking of such tools.

摘要

背景

基因变异效应预测算法在临床基因组学和研究中被广泛用于确定氨基酸替换对蛋白质功能可能产生的后果。我们必须更好地了解它们的准确性和局限性,因为已发表的性能指标受到严重的循环性和误差传播问题的混淆。在这里,我们推导了三个独立的、功能确定的人类突变数据集,即UniFun、BRCA1-DMS和TP53-TA,并将它们与先前描述的数据集一起用于评估卓越的变异效应预测工具。

结果

变异效应预测工具的表观准确性受到基准数据集的显著影响。使用通过实验确定的数据集UniFun和BRCA1-DMS进行基准测试时,受试者工作特征曲线下的面积分别在0.52至0.63和0.54至0.75的适度范围内,远低于在其他可能更具冲突性的数据集上观察到的结果。

结论

这些结果引发了对如何使用此类算法的担忧,尤其是在临床环境中。当代变异效应预测工具在对蛋白质功能影响的一般预测方面不太可能像之前报道的那样准确。使用避免先前依赖性的基于功能测定的数据集有望对这类工具的持续开发和准确基准测试有价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2b5/5433009/efbb5eb5a27f/40246_2017_104_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2b5/5433009/0a4af094b1c0/40246_2017_104_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2b5/5433009/a152a519e439/40246_2017_104_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2b5/5433009/efbb5eb5a27f/40246_2017_104_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2b5/5433009/0a4af094b1c0/40246_2017_104_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2b5/5433009/a152a519e439/40246_2017_104_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2b5/5433009/efbb5eb5a27f/40246_2017_104_Fig3_HTML.jpg

相似文献

1
Variant effect prediction tools assessed using independent, functional assay-based datasets: implications for discovery and diagnostics.使用基于独立功能测定数据集评估的变异效应预测工具:对发现和诊断的影响。
Hum Genomics. 2017 May 16;11(1):10. doi: 10.1186/s40246-017-0104-8.
2
A Comprehensive Evaluation of the Performance of Prediction Algorithms on Clinically Relevant Missense Variants.临床相关错义变异预测算法性能的综合评估。
Int J Mol Sci. 2022 Jul 19;23(14):7946. doi: 10.3390/ijms23147946.
3
Quantitative Missense Variant Effect Prediction Using Large-Scale Mutagenesis Data.利用大规模诱变数据进行定量错义变异效应预测。
Cell Syst. 2018 Jan 24;6(1):116-124.e3. doi: 10.1016/j.cels.2017.11.003. Epub 2017 Dec 6.
4
Representativeness of variation benchmark datasets.变异性基准数据集的代表性。
BMC Bioinformatics. 2018 Nov 29;19(1):461. doi: 10.1186/s12859-018-2478-6.
5
Updated benchmarking of variant effect predictors using deep mutational scanning.使用深度突变扫描对变异效应预测器进行更新的基准测试。
Mol Syst Biol. 2023 Aug 8;19(8):e11474. doi: 10.15252/msb.202211474. Epub 2023 Jun 13.
6
A variant by any name: quantifying annotation discordance across tools and clinical databases.无论名称如何的变体:量化不同工具和临床数据库之间的注释不一致性。
Genome Med. 2017 Jan 26;9(1):7. doi: 10.1186/s13073-016-0396-7.
7
Family-Based Benchmarking of Copy Number Variation Detection Software.基于家族的拷贝数变异检测软件基准测试
PLoS One. 2015 Jul 21;10(7):e0133465. doi: 10.1371/journal.pone.0133465. eCollection 2015.
8
Assessing performance of pathogenicity predictors using clinically relevant variant datasets.使用与临床相关的变异数据集评估致病性预测因子的性能。
J Med Genet. 2021 Aug;58(8):547-555. doi: 10.1136/jmedgenet-2020-107003. Epub 2020 Aug 25.
9
COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.认知器:宏基因组数据集功能注释框架
PLoS One. 2015 Nov 11;10(11):e0142102. doi: 10.1371/journal.pone.0142102. eCollection 2015.
10
A comprehensive benchmark of RNA-RNA interaction prediction tools for all domains of life.针对生命所有领域的RNA - RNA相互作用预测工具的全面基准测试。
Bioinformatics. 2017 Apr 1;33(7):988-996. doi: 10.1093/bioinformatics/btw728.

引用本文的文献

1
Dynamicasome-a molecular dynamics-guided and AI-driven pathogenicity prediction catalogue for all genetic mutations.动力体组——一个由分子动力学引导并由人工智能驱动的针对所有基因突变的致病性预测目录。
Commun Biol. 2025 Jul 7;8(1):958. doi: 10.1038/s42003-025-08334-y.
2
Selecting variant masks to improve power and replicability of gene-level burden tests.选择变异掩码以提高基因水平负担测试的效能和可重复性。
Res Sq. 2025 Apr 15:rs.3.rs-6322956. doi: 10.21203/rs.3.rs-6322956/v1.
3
Variant effect predictor correlation with functional assays is reflective of clinical classification performance.

本文引用的文献

1
REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants.REVEL:一种预测罕见错义变异致病性的集成方法。
Am J Hum Genet. 2016 Oct 6;99(4):877-885. doi: 10.1016/j.ajhg.2016.08.016. Epub 2016 Sep 22.
2
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.
3
TP53 Variations in Human Cancers: New Lessons from the IARC TP53 Database and Genomics Data.人类癌症中的TP53变异:来自国际癌症研究机构TP53数据库和基因组数据的新认识
变异效应预测器与功能测定的相关性反映了临床分类性能。
Genome Biol. 2025 Apr 22;26(1):104. doi: 10.1186/s13059-025-03575-w.
4
Meta-EA: a gene-specific combination of available computational tools for predicting missense variant effects.Meta-EA:一种用于预测错义变异效应的现有计算工具的基因特异性组合。
Nat Commun. 2025 Jan 2;16(1):159. doi: 10.1038/s41467-024-55066-4.
5
Development and validation of animal variant classification guidelines to objectively evaluate genetic variant pathogenicity in domestic animals.制定和验证动物变异分类指南以客观评估家畜基因变异的致病性。
Front Vet Sci. 2024 Dec 5;11:1497817. doi: 10.3389/fvets.2024.1497817. eCollection 2024.
6
Ensemble and consensus approaches to prediction of recessive inheritance for missense variants in human disease.用于预测人类疾病错义变异隐性遗传的集成和共识方法。
Cell Rep Methods. 2024 Dec 16;4(12):100914. doi: 10.1016/j.crmeth.2024.100914. Epub 2024 Dec 9.
7
Understanding the heterogeneous performance of variant effect predictors across human protein-coding genes.理解变异效应预测因子在人类蛋白质编码基因中的异质性表现。
Sci Rep. 2024 Oct 30;14(1):26114. doi: 10.1038/s41598-024-76202-6.
8
PanEffect: a pan-genome visualization tool for variant effects in maize.泛基因组效应:一种用于玉米变异效应的泛基因组可视化工具。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae073.
9
Translating pharmacogenomic sequencing data into drug response predictions-How to interpret variants of unknown significance.将药物基因组测序数据转化为药物反应预测——如何解读意义未明的变异
Br J Clin Pharmacol. 2025 Feb;91(2):252-263. doi: 10.1111/bcp.15915. Epub 2023 Oct 16.
10
In vivo versus in silico assessment of potentially pathogenic missense variants in human reproductive genes.体内与计算机模拟评估人类生殖基因中潜在致病性错义变异。
Proc Natl Acad Sci U S A. 2023 Jul 25;120(30):e2219925120. doi: 10.1073/pnas.2219925120. Epub 2023 Jul 17.
Hum Mutat. 2016 Sep;37(9):865-76. doi: 10.1002/humu.23035. Epub 2016 Jul 8.
4
The Ensembl Variant Effect Predictor.Ensembl变异效应预测器。
Genome Biol. 2016 Jun 6;17(1):122. doi: 10.1186/s13059-016-0974-4.
5
A Review of Whole-Exome Sequencing Efforts Toward Hereditary Breast Cancer Susceptibility Gene Discovery.全外显子组测序用于发现遗传性乳腺癌易感基因的研究综述。
Hum Mutat. 2016 Sep;37(9):835-46. doi: 10.1002/humu.23017. Epub 2016 Jun 27.
6
A spectral approach integrating functional genomic annotations for coding and noncoding variants.一种整合编码和非编码变异功能基因组注释的光谱方法。
Nat Genet. 2016 Feb;48(2):214-20. doi: 10.1038/ng.3477. Epub 2016 Jan 4.
7
Comparison of predicted and actual consequences of missense mutations.错义突变预测结果与实际结果的比较。
Proc Natl Acad Sci U S A. 2015 Sep 15;112(37):E5189-98. doi: 10.1073/pnas.1511585112. Epub 2015 Aug 12.
8
GESPA: classifying nsSNPs to predict disease association.GESPA:对非同义单核苷酸多态性进行分类以预测疾病关联性。
BMC Bioinformatics. 2015 Jul 25;16:228. doi: 10.1186/s12859-015-0673-2.
9
Massively Parallel Functional Analysis of BRCA1 RING Domain Variants.BRCA1 环状结构域变体的大规模平行功能分析
Genetics. 2015 Jun;200(2):413-22. doi: 10.1534/genetics.115.175802. Epub 2015 Mar 30.
10
Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.序列变异解读的标准与指南:美国医学遗传学与基因组学学会和分子病理学协会的联合共识推荐
Genet Med. 2015 May;17(5):405-24. doi: 10.1038/gim.2015.30. Epub 2015 Mar 5.