• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大规模计算机统计诱变分析揭示了人类蛋白质组的有害性景观。

Large-scale in-silico statistical mutagenesis analysis sheds light on the deleteriousness landscape of the human proteome.

机构信息

Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, La Plaine Campus, Triomflaan, 1050, Brussels, Belgium.

ESAT-STADIUS, KU Leuven, Kasteelpark Arenberg 10, 3001, Leuven, Belgium.

出版信息

Sci Rep. 2018 Nov 19;8(1):16980. doi: 10.1038/s41598-018-34959-7.

DOI:10.1038/s41598-018-34959-7
PMID:30451933
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6242909/
Abstract

Next generation sequencing technologies are providing increasing amounts of sequencing data, paving the way for improvements in clinical genetics and precision medicine. The interpretation of the observed genomic variants in the light of their phenotypic effects is thus emerging as a crucial task to solve in order to advance our understanding of how exomic variants affect proteins and how the proteins' functional changes affect human health. Since the experimental evaluation of the effects of every observed variant is unfeasible, Bioinformatics methods are being developed to address this challenge in-silico, by predicting the impact of millions of variants, thus providing insight into the deleteriousness landscape of entire proteomes. Here we show the feasibility of this approach by using the recently developed DEOGEN2 variant-effect predictor to perform the largest in-silico mutagenesis scan to date. We computed the deleteriousness score of 170 million variants over 15000 human proteins and we analysed the results, investigating how the predicted deleteriousness landscape of the proteins relates to known functionally and structurally relevant protein regions and biophysical properties. Moreover, we qualitatively validated our results by comparing them with two mutagenesis studies targeting two specific proteins, showing the consistency of DEOGEN2 predictions with respect to experimental data.

摘要

下一代测序技术正在提供越来越多的测序数据,为临床遗传学和精准医学的发展铺平了道路。因此,如何根据表型效应来解释观察到的基因组变异,已成为解决这一问题的关键任务,这有助于我们了解外显子变异如何影响蛋白质,以及蛋白质功能变化如何影响人类健康。由于对每个观察到的变异的实验评估是不可行的,因此正在开发生物信息学方法来通过预测数百万个变异的影响来解决这个问题,从而深入了解整个蛋白质组的有害性景观。在这里,我们通过使用最近开发的 DEOGEN2 变体效应预测器来展示这种方法的可行性,进行了迄今为止最大的基于计算的诱变扫描。我们计算了 1.7 亿个变体在 15000 个人类蛋白质上的有害性评分,并对结果进行了分析,研究了蛋白质的预测有害性景观与已知功能和结构相关的蛋白质区域和生物物理性质之间的关系。此外,我们通过将结果与针对两个特定蛋白质的两个诱变研究进行定性比较来验证我们的结果,表明 DEOGEN2 预测与实验数据的一致性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/cd8d30b654d1/41598_2018_34959_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/7c6a8fd22810/41598_2018_34959_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/3a795fbc6f55/41598_2018_34959_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/0f6c7af90981/41598_2018_34959_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/8db8c24e0c91/41598_2018_34959_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/720704416cdf/41598_2018_34959_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/f286d2602074/41598_2018_34959_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/27741fd4eff6/41598_2018_34959_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/22e6e2e21db6/41598_2018_34959_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/19e4f74af542/41598_2018_34959_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/cd8d30b654d1/41598_2018_34959_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/7c6a8fd22810/41598_2018_34959_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/3a795fbc6f55/41598_2018_34959_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/0f6c7af90981/41598_2018_34959_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/8db8c24e0c91/41598_2018_34959_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/720704416cdf/41598_2018_34959_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/f286d2602074/41598_2018_34959_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/27741fd4eff6/41598_2018_34959_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/22e6e2e21db6/41598_2018_34959_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/19e4f74af542/41598_2018_34959_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a18/6242909/cd8d30b654d1/41598_2018_34959_Fig10_HTML.jpg

相似文献

1
Large-scale in-silico statistical mutagenesis analysis sheds light on the deleteriousness landscape of the human proteome.大规模计算机统计诱变分析揭示了人类蛋白质组的有害性景观。
Sci Rep. 2018 Nov 19;8(1):16980. doi: 10.1038/s41598-018-34959-7.
2
DEOGEN2: prediction and interactive visualization of single amino acid variant deleteriousness in human proteins.DEOGEN2:人类蛋白质中单氨基酸变异有害性的预测和交互式可视化。
Nucleic Acids Res. 2017 Jul 3;45(W1):W201-W206. doi: 10.1093/nar/gkx390.
3
In Silico Prediction of Deleteriousness for Nonsynonymous and Splice-Altering Single Nucleotide Variants in the Human Genome.人类基因组中非同义及剪接改变单核苷酸变异的有害性的计算机模拟预测
Methods Mol Biol. 2017;1498:191-197. doi: 10.1007/978-1-4939-6472-7_13.
4
MutaFrame-an interpretative visualization framework for deleteriousness prediction of missense variants in the human exome.MutaFrame——一种用于预测人类外显子错义变异有害性的解释性可视化框架。
Bioinformatics. 2021 Dec 22;38(1):265-266. doi: 10.1093/bioinformatics/btab453.
5
pycofitness-Evaluating the fitness landscape of RNA and protein sequences.pycofitness—评估 RNA 和蛋白质序列的适应性景观。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae074.
6
Interpreting functional effects of coding variants: challenges in proteome-scale prediction, annotation and assessment.解读编码变体的功能效应:蛋白质组规模预测、注释及评估中的挑战
Brief Bioinform. 2016 Sep;17(5):841-62. doi: 10.1093/bib/bbv084. Epub 2015 Oct 22.
7
Comparison of different functional prediction scores using a gene-based permutation model for identifying cancer driver genes.基于基因排列模型的不同功能预测评分比较,用于鉴定癌症驱动基因。
BMC Med Genomics. 2019 Jan 31;12(Suppl 1):22. doi: 10.1186/s12920-018-0452-9.
8
In silico functional dissection of saturation mutagenesis: Interpreting the relationship between phenotypes and changes in protein stability, interactions and activity.饱和诱变的计算机功能剖析:解读表型与蛋白质稳定性、相互作用及活性变化之间的关系。
Sci Rep. 2016 Jan 22;6:19848. doi: 10.1038/srep19848.
9
10
Taxonomic Landscape of the Dark Proteomes: Whole-Proteome Scale Interplay Between Structural Darkness, Intrinsic Disorder, and Crystallization Propensity.暗蛋白质组的分类学景观:结构黑暗、内在无序和结晶倾向之间的全蛋白质组规模相互作用。
Proteomics. 2018 Nov;18(21-22):e1800243. doi: 10.1002/pmic.201800243. Epub 2018 Oct 8.

引用本文的文献

1
Insights into the inner workings of transformer models for protein function prediction.揭示用于蛋白质功能预测的变压器模型的内部工作原理。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae031.
2
Mutations observed in somatic evolution reveal underlying gene mechanisms.体细胞进化中观察到的突变揭示了潜在的基因机制。
Commun Biol. 2023 Jul 19;6(1):753. doi: 10.1038/s42003-023-05136-y.
3
i4mC-Deep: An Intelligent Predictor of N4-Methylcytosine Sites Using a Deep Learning Approach with Chemical Properties.i4mC-Deep:一种基于深度学习方法并结合化学性质预测 N4-甲基胞嘧啶位点的智能预测器。

本文引用的文献

1
Inferring the molecular and phenotypic impact of amino acid variants with MutPred2.使用 MutPred2 推断氨基酸变异的分子和表型影响。
Nat Commun. 2020 Nov 20;11(1):5918. doi: 10.1038/s41467-020-19669-x.
2
Understanding mutational effects in digenic diseases.了解双基因疾病中的突变效应。
Nucleic Acids Res. 2017 Sep 6;45(15):e140. doi: 10.1093/nar/gkx557.
3
Charged residues next to transmembrane regions revisited: "Positive-inside rule" is complemented by the "negative inside depletion/outside enrichment rule".重新审视跨膜区域附近的带电残基:“正内规则”被“负内耗尽/外富集规则”所补充。
Genes (Basel). 2021 Jul 23;12(8):1117. doi: 10.3390/genes12081117.
4
Fast computational mutation-response scanning of proteins.蛋白质的快速计算突变-反应扫描
PeerJ. 2021 Apr 21;9:e11330. doi: 10.7717/peerj.11330. eCollection 2021.
5
DNA sequences performs as natural language processing by exploiting deep learning algorithm for the identification of N4-methylcytosine.DNA 序列通过利用深度学习算法进行自然语言处理,用于识别 N4-甲基胞嘧啶。
Sci Rep. 2021 Jan 8;11(1):212. doi: 10.1038/s41598-020-80430-x.
6
DNC4mC-Deep: Identification and Analysis of DNA N4-Methylcytosine Sites Based on Different Encoding Schemes By Using Deep Learning.DNC4mC-Deep:基于深度学习的不同编码方案识别和分析 DNA N4-甲基胞嘧啶位点。
Cells. 2020 Jul 22;9(8):1756. doi: 10.3390/cells9081756.
7
Biophysical and Mechanistic Models for Disease-Causing Protein Variants.疾病相关蛋白变异的生物物理和机械模型。
Trends Biochem Sci. 2019 Jul;44(7):575-588. doi: 10.1016/j.tibs.2019.01.003. Epub 2019 Jan 31.
BMC Biol. 2017 Jul 24;15(1):66. doi: 10.1186/s12915-017-0404-4.
4
DEOGEN2: prediction and interactive visualization of single amino acid variant deleteriousness in human proteins.DEOGEN2:人类蛋白质中单氨基酸变异有害性的预测和交互式可视化。
Nucleic Acids Res. 2017 Jul 3;45(W1):W201-W206. doi: 10.1093/nar/gkx390.
5
Alpha Helices Are More Robust to Mutations than Beta Strands.α螺旋比β链对突变更具耐受性。
PLoS Comput Biol. 2016 Dec 9;12(12):e1005242. doi: 10.1371/journal.pcbi.1005242. eCollection 2016 Dec.
6
Prediction of phosphothreonine sites in human proteins by fusing different features.通过融合不同特征预测人类蛋白质中的磷酸苏氨酸位点。
Sci Rep. 2016 Oct 4;6:34817. doi: 10.1038/srep34817.
7
Investigating the Molecular Mechanisms Behind Uncharacterized Cysteine Losses from Prediction of Their Oxidation State.通过预测其氧化态来研究未表征的半胱氨酸损失背后的分子机制。
Hum Mutat. 2017 Jan;38(1):86-94. doi: 10.1002/humu.23129. Epub 2016 Oct 13.
8
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.
9
Multilevel biological characterization of exomic variants at the protein level significantly improves the identification of their deleterious effects.在蛋白质水平对外显子变异进行多层次生物学特征分析,可显著提高对其有害效应的识别能力。
Bioinformatics. 2016 Jun 15;32(12):1797-804. doi: 10.1093/bioinformatics/btw094. Epub 2016 Feb 18.
10
Protein lipid modifications--More than just a greasy ballast.蛋白质脂质修饰——不仅仅是一个油腻的负担。
Proteomics. 2016 Mar;16(5):759-82. doi: 10.1002/pmic.201500353.