• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

数千种蛋白质线性基序类别可能仍未被发现。

Thousands of protein linear motif classes may still be undiscovered.

机构信息

Laboratorio de Fisiología de Proteínas, Facultad de Ciencias Exactas y Naturales, Consejo Nacional de lnvestigaciones Cientificas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos Aires, Argentina.

Departamento de Matematica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina.

出版信息

PLoS One. 2021 May 3;16(5):e0248841. doi: 10.1371/journal.pone.0248841. eCollection 2021.

DOI:10.1371/journal.pone.0248841
PMID:33939703
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8092775/
Abstract

Linear motifs are short protein subsequences that mediate protein interactions. Hundreds of motif classes including thousands of motif instances are known. Our theory estimates how many motif classes remain undiscovered. As commonly done, we describe motif classes as regular expressions specifying motif length and the allowed amino acids at each motif position. We measure motif specificity for a pair of motif classes by quantifying how many motif-discriminating positions prevent a protein subsequence from matching the two classes at once. We derive theorems for the maximal number of motif classes that can simultaneously maintain a certain number of motif-discriminating positions between all pairs of classes in the motif universe, for a given amino acid alphabet. We also calculate the fraction of all protein subsequences that would belong to a motif class if all potential motif classes came into existence. Naturally occurring pairs of motif classes present most often a single motif-discriminating position. This mild specificity maximizes the potential number of coexisting motif classes, the expansion of the motif universe due to amino acid modifications and the fraction of amino acid sequences that code for a motif instance. As a result, thousands of linear motif classes may remain undiscovered.

摘要

线性基序是介导蛋白质相互作用的短蛋白质序列。已知数百种基序类别,包括数千个基序实例。我们的理论估计了仍未发现的基序类别数量。通常,我们将基序类别描述为指定基序长度和每个基序位置允许的氨基酸的正则表达式。我们通过量化阻止蛋白质序列同时匹配两个类别的基序区分位置的数量来衡量基序类别的特异性。我们为给定的氨基酸字母表推导出了在基序宇宙中所有类别对之间同时保持特定数量的基序区分位置的最大基序类别数量的定理。我们还计算了如果所有潜在的基序类别都存在,那么属于基序类别的所有蛋白质序列的分数。自然存在的基序类别对通常只有一个基序区分位置。这种温和的特异性最大限度地提高了共存基序类别的潜在数量、由于氨基酸修饰而扩展的基序宇宙以及编码基序实例的氨基酸序列的分数。因此,可能仍有数千种线性基序类别未被发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/a7a1cc7313c6/pone.0248841.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/2523e671f418/pone.0248841.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/dd4c51705dee/pone.0248841.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/04d2b5b20e02/pone.0248841.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/7bd89ae8f67f/pone.0248841.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/3b8e2246b58f/pone.0248841.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/a7a1cc7313c6/pone.0248841.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/2523e671f418/pone.0248841.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/dd4c51705dee/pone.0248841.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/04d2b5b20e02/pone.0248841.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/7bd89ae8f67f/pone.0248841.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/3b8e2246b58f/pone.0248841.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bd2/8092775/a7a1cc7313c6/pone.0248841.g006.jpg

相似文献

1
Thousands of protein linear motif classes may still be undiscovered.数千种蛋白质线性基序类别可能仍未被发现。
PLoS One. 2021 May 3;16(5):e0248841. doi: 10.1371/journal.pone.0248841. eCollection 2021.
2
Mining for class-specific motifs in protein sequence classification.蛋白质序列分类中的类特异性基序挖掘。
BMC Bioinformatics. 2013 Mar 15;14:96. doi: 10.1186/1471-2105-14-96.
3
Evidence for the concerted evolution between short linear protein motifs and their flanking regions.短线性蛋白质基序与其侧翼区域之间协同进化的证据。
PLoS One. 2009 Jul 8;4(7):e6052. doi: 10.1371/journal.pone.0006052.
4
Reduced alphabet motif methodology for GPCR annotation.用于G蛋白偶联受体注释的简化字母基序方法。
J Biomol Struct Dyn. 2007 Dec;25(3):299-310. doi: 10.1080/07391102.2007.10507178.
5
Estimation and efficient computation of the true probability of recurrence of short linear protein sequence motifs in unrelated proteins.估计和有效计算短线性蛋白质序列基序在无关蛋白质中的真实重现概率。
BMC Bioinformatics. 2010 Jan 7;11:14. doi: 10.1186/1471-2105-11-14.
6
Prediction of short linear protein binding regions.预测短线性蛋白结合区域。
J Mol Biol. 2012 Jan 6;415(1):193-204. doi: 10.1016/j.jmb.2011.10.025. Epub 2011 Oct 21.
7
Profile-based short linear protein motif discovery.基于轮廓的短线性蛋白质基序发现。
BMC Bioinformatics. 2012 May 18;13:104. doi: 10.1186/1471-2105-13-104.
8
Structural alphabet motif discovery and a structural motif database.结构字母基元发现和结构基元数据库。
Comput Biol Med. 2012 Jan;42(1):93-105. doi: 10.1016/j.compbiomed.2011.10.013. Epub 2011 Nov 17.
9
Modeling within-motif dependence for transcription factor binding site predictions.用于转录因子结合位点预测的模体内依赖性建模。
Bioinformatics. 2004 Apr 12;20(6):909-16. doi: 10.1093/bioinformatics/bth006. Epub 2004 Jan 29.
10
Clustering-based approach for predicting motif pairs from protein interaction data.
J Bioinform Comput Biol. 2009 Aug;7(4):701-16. doi: 10.1142/s0219720009004266.

引用本文的文献

1
Exploiting the endogenous yeast nuclear proteome to identify short linear motifs in vivo.利用内源性酵母核蛋白组鉴定体内的短线性基序。
Cell Rep Methods. 2023 Nov 20;3(11):100637. doi: 10.1016/j.crmeth.2023.100637. Epub 2023 Nov 9.
2
Multi-dimensional feature recognition model based on capsule network for ubiquitination site prediction.基于胶囊网络的多维特征识别模型在泛素化位点预测中的应用。
PeerJ. 2022 Dec 6;10:e14427. doi: 10.7717/peerj.14427. eCollection 2022.
3
Transcription factor specificity limits the number of DNA-binding motifs.

本文引用的文献

1
ELM-the eukaryotic linear motif resource in 2020.ELM-the eukaryotic linear motif resource in 2020. ELM-the 2020 eukaryotic linear motif resource.
Nucleic Acids Res. 2020 Jan 8;48(D1):D296-D306. doi: 10.1093/nar/gkz1030.
2
Comprehensive Analysis of the Human SH3 Domain Family Reveals a Wide Variety of Non-canonical Specificities.人类SH3结构域家族的综合分析揭示了多种非典型特异性。
Structure. 2017 Oct 3;25(10):1598-1610.e3. doi: 10.1016/j.str.2017.07.017. Epub 2017 Sep 7.
3
Discovery of short linear motif-mediated interactions through phage display of intrinsically disordered regions of the human proteome.
转录因子特异性限制了 DNA 结合基序的数量。
PLoS One. 2022 Jan 28;17(1):e0263307. doi: 10.1371/journal.pone.0263307. eCollection 2022.
通过人蛋白质组内在无序区域的噬菌体展示发现短线性基序介导的相互作用。
FEBS J. 2017 Feb;284(3):485-498. doi: 10.1111/febs.13995. Epub 2017 Jan 18.
4
Experimental detection of short regulatory motifs in eukaryotic proteins: tips for good practice as well as for bad.真核生物蛋白质中短调控基序的实验检测:实践中的技巧与失误
Cell Commun Signal. 2015 Nov 18;13:42. doi: 10.1186/s12964-015-0121-y.
5
Unexpected features of the dark proteome.黑暗蛋白质组的意外特征。
Proc Natl Acad Sci U S A. 2015 Dec 29;112(52):15898-903. doi: 10.1073/pnas.1508380112. Epub 2015 Nov 17.
6
Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.宿主-病原体相互作用中蛋白质线性基序的趋同进化与模拟
Curr Opin Struct Biol. 2015 Jun;32:91-101. doi: 10.1016/j.sbi.2015.03.004. Epub 2015 Apr 2.
7
Linear motif-mediated interactions have contributed to the evolution of modularity in complex protein interaction networks.线性基序介导的相互作用促进了复杂蛋白质相互作用网络中模块化的进化。
PLoS Comput Biol. 2014 Oct 9;10(10):e1003881. doi: 10.1371/journal.pcbi.1003881. eCollection 2014 Oct.
8
A million peptide motifs for the molecular biologist.分子生物学家的百万肽基序。
Mol Cell. 2014 Jul 17;55(2):161-9. doi: 10.1016/j.molcel.2014.05.032.
9
Short linear motifs: ubiquitous and functionally diverse protein interaction modules directing cell regulation.短线性基序:指导细胞调控的普遍存在且功能多样的蛋白质相互作用模块
Chem Rev. 2014 Jul 9;114(13):6733-78. doi: 10.1021/cr400585q. Epub 2014 Jun 13.
10
Design of orthogonal genetic switches based on a crosstalk map of σs, anti-σs, and promoters.基于 σs、anti-σs 和启动子的串扰图谱设计正交遗传开关。
Mol Syst Biol. 2013 Oct 29;9:702. doi: 10.1038/msb.2013.58.