• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在 CAGI 6 实验中快速区分有害和良性错义突变。

Rapid discrimination between deleterious and benign missense mutations in the CAGI 6 experiment.

机构信息

Research and Information Systems, LLC, 1620 E. 72nd ST., Indianapolis, IN, 46240, USA.

Physics Department, Indiana University Purdue University Indianapolis, Indianapolis, IN, 46202, USA.

出版信息

Hum Genomics. 2024 Aug 27;18(1):89. doi: 10.1186/s40246-024-00655-z.

DOI:10.1186/s40246-024-00655-z
PMID:39192324
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11350969/
Abstract

We describe the machine learning tool that we applied in the CAGI 6 experiment to predict whether single residue mutations in proteins are deleterious or benign. This tool was trained using only single sequences, i.e., without multiple sequence alignments or structural information. Instead, we used global characterizations of the protein sequence. Training and testing data for human gene mutations was obtained from ClinVar (ncbi.nlm.nih.gov/pub/ClinVar/), and for non-human gene mutations from Uniprot (www.uniprot.org). Testing was done on post-training data from ClinVar. This testing yielded high AUC and Matthews correlation coefficient (MCC) for well trained examples but low generalizability. For genes with either sparse or unbalanced training data, the prediction accuracy is poor. The resulting prediction server is available online at http://www.mamiris.com/Shoni.cagi6.

摘要

我们描述了在 CAGI 6 实验中应用的机器学习工具,用于预测蛋白质中的单个残基突变是有害的还是良性的。该工具仅使用单个序列进行训练,即不使用多序列比对或结构信息。相反,我们使用了蛋白质序列的全局特征。人类基因突变的训练和测试数据来自 ClinVar(ncbi.nlm.nih.gov/pub/ClinVar/),非人类基因突变的数据来自 Uniprot(www.uniprot.org)。测试是在 ClinVar 的培训后数据上进行的。对于训练有素的示例,该测试产生了较高的 AUC 和马修斯相关系数(MCC),但通用性较低。对于训练数据稀疏或不平衡的基因,预测准确性较差。生成的预测服务器可在线获得,网址为 http://www.mamiris.com/Shoni.cagi6。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd4/11350969/5d0b61db1830/40246_2024_655_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd4/11350969/5d0b61db1830/40246_2024_655_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd4/11350969/5d0b61db1830/40246_2024_655_Fig1_HTML.jpg

相似文献

1
Rapid discrimination between deleterious and benign missense mutations in the CAGI 6 experiment.在 CAGI 6 实验中快速区分有害和良性错义突变。
Hum Genomics. 2024 Aug 27;18(1):89. doi: 10.1186/s40246-024-00655-z.
2
Assessment of computational methods for predicting the effects of missense mutations in human cancers.评估计算方法预测人类癌症中错义突变影响的研究。
BMC Genomics. 2013;14 Suppl 3(Suppl 3):S7. doi: 10.1186/1471-2164-14-S3-S7. Epub 2013 May 28.
3
Missense variant pathogenicity predictors generalize well across a range of function-specific prediction challenges.错义变异致病性预测工具在一系列特定功能的预测挑战中具有良好的通用性。
Hum Mutat. 2017 Sep;38(9):1092-1108. doi: 10.1002/humu.23258. Epub 2017 Jun 12.
4
Fast and Accurate Accessible Surface Area Prediction Without a Sequence Profile.无需序列概况即可快速准确地预测可及表面积
Methods Mol Biol. 2017;1484:127-136. doi: 10.1007/978-1-4939-6406-2_10.
5
Are machine learning based methods suited to address complex biological problems? Lessons from CAGI-5 challenges.基于机器学习的方法是否适合解决复杂的生物学问题?来自 CAGI-5 挑战赛的经验教训。
Hum Mutat. 2019 Sep;40(9):1455-1462. doi: 10.1002/humu.23784. Epub 2019 Jun 18.
6
Identifying Mendelian disease genes with the variant effect scoring tool.使用变异效应评分工具鉴定孟德尔疾病基因。
BMC Genomics. 2013;14 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2164-14-S3-S3. Epub 2013 May 28.
7
A gene-specific method for predicting hemophilia-causing point mutations.一种用于预测导致血友病的点突变的基因特异性方法。
J Mol Biol. 2013 Nov 1;425(21):4023-33. doi: 10.1016/j.jmb.2013.07.037. Epub 2013 Aug 3.
8
Predicting functional effect of human missense mutations using PolyPhen-2.使用PolyPhen-2预测人类错义突变的功能效应。
Curr Protoc Hum Genet. 2013 Jan;Chapter 7:Unit7.20. doi: 10.1002/0471142905.hg0720s76.
9
Objective assessment of the evolutionary action equation for the fitness effect of missense mutations across CAGI-blinded contests.对跨CAGI盲测的错义突变适应性效应的进化作用方程进行客观评估。
Hum Mutat. 2017 Sep;38(9):1072-1084. doi: 10.1002/humu.23266. Epub 2017 Jun 21.
10
Assessing predictions of fitness effects of missense mutations in SUMO-conjugating enzyme UBE2I.评估小泛素样修饰物(SUMO)缀合酶UBE2I中错义突变的适应性效应预测。
Hum Mutat. 2017 Sep;38(9):1051-1063. doi: 10.1002/humu.23293.

本文引用的文献

1
CAGI, the Critical Assessment of Genome Interpretation, establishes progress and prospects for computational genetic variant interpretation methods.CAGI,即基因组解读的关键评估,旨在评估计算遗传变异解读方法的进展和前景。
Genome Biol. 2024 Feb 22;25(1):53. doi: 10.1186/s13059-023-03113-6.
2
Germline genetic variation and predicting immune checkpoint inhibitor induced toxicity.生殖系基因变异与预测免疫检查点抑制剂诱导的毒性。
NPJ Genom Med. 2022 Dec 24;7(1):73. doi: 10.1038/s41525-022-00345-6.
3
Deep learning and protein structure modeling.
深度学习与蛋白质结构建模。
Nat Methods. 2022 Jan;19(1):13-14. doi: 10.1038/s41592-021-01360-8.
4
Critical assessment of methods of protein structure prediction (CASP)-Round XIV.蛋白质结构预测方法的关键性评估(CASP)-第十四轮。
Proteins. 2021 Dec;89(12):1607-1617. doi: 10.1002/prot.26237. Epub 2021 Oct 7.
5
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
6
UniProt: the universal protein knowledgebase in 2021.UniProt:2021 年的通用蛋白质知识库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489. doi: 10.1093/nar/gkaa1100.
7
Using common genetic variation to examine phenotypic expression and risk prediction in 22q11.2 deletion syndrome.利用常见遗传变异研究 22q11.2 缺失综合征的表型表达和风险预测。
Nat Med. 2020 Dec;26(12):1912-1918. doi: 10.1038/s41591-020-1103-1. Epub 2020 Nov 9.
8
Predicting geographic location from genetic variation with deep neural networks.利用深度神经网络从遗传变异中预测地理位置。
Elife. 2020 Jun 8;9:e54507. doi: 10.7554/eLife.54507.
9
Update on NAFLD genetics: From new variants to the clinic.非酒精性脂肪性肝病遗传学研究进展:从新的变异到临床应用。
J Hepatol. 2020 Jun;72(6):1196-1209. doi: 10.1016/j.jhep.2020.02.020. Epub 2020 Mar 4.
10
Rhapsody: predicting the pathogenicity of human missense variants.Rhapsody:预测人类错义变异的致病性。
Bioinformatics. 2020 May 1;36(10):3084-3092. doi: 10.1093/bioinformatics/btaa127.