• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

被忽视的巨人:揭示蛋白质组中巨大蛋白质的流行程度和功能群。

The neglected giants: Uncovering the prevalence and functional groups of huge proteins in proteomes.

机构信息

Centro Andaluz de Biología del Desarrollo, Consejo Superior de Investigaciones Científicas, Campus Universidad Pablo de Olavide, Seville, Spain.

CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019-UMR 9017-CIIL-Centre d'Infection et d'Immunité de Lille, University of Lille, Lille, France.

出版信息

PLoS Comput Biol. 2024 Sep 16;20(9):e1012459. doi: 10.1371/journal.pcbi.1012459. eCollection 2024 Sep.

DOI:10.1371/journal.pcbi.1012459
PMID:39283951
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11573180/
Abstract

An often-overlooked aspect of biology is formed by the outliers of the protein length distribution, specifically those proteins with more than 5000 amino acids, which we refer to as huge proteins (HPs). By examining UniprotKB, we discovered more than 41 000 HPs throughout the tree of life, with the majority found in eukaryotes. Notably, the phyla with the highest propensity for HPs are Apicomplexa and Fornicata. Moreover, we observed that certain bacteria, such as Elusimicrobiota or Planctomycetota, have a higher tendency for encoding HPs, even more than the average eukaryote. To investigate if these macro-polypeptides represent "real" proteins, we explored several indirect metrics. Additionally, orthology analyses reveals thousands of clusters of homologous sequences of HPs, revealing functional groups related to key cellular processes such as cytoskeleton organization and functioning as chaperones or as E3-ubiquitin ligases in eukaryotes. In the case of bacteria, the major clusters have functions related to non-ribosomomal peptide synthesis/polyketide synthesis, followed by pathogen-host attachment or recognition surface proteins. Further exploration of the annotations for each HPs supported the previously identified functional groups. These findings underscore the need for further investigation of the cellular and ecological roles of these HPs and their potential impact on biology and biotechnology.

摘要

生物学中一个经常被忽视的方面是由蛋白质长度分布的离群值形成的,特别是那些长度超过 5000 个氨基酸的蛋白质,我们称之为巨大蛋白质(HPs)。通过检查 UniprotKB,我们在生命之树中发现了超过 41000 种 HPs,其中大部分存在于真核生物中。值得注意的是,具有最高 HPs 倾向的门是顶复门和纤毛门。此外,我们观察到某些细菌,如 Elusimicrobiota 或 Planctomycetota,编码 HPs 的倾向更高,甚至超过平均真核生物。为了研究这些大的多肤是否代表“真正的”蛋白质,我们探索了几种间接指标。此外,同源性分析揭示了数千个 HPs 同源序列簇,揭示了与关键细胞过程相关的功能组,如细胞骨架组织和在真核生物中作为伴侣或 E3-泛素连接酶的功能。对于细菌,主要的簇具有与非核糖体肽合成/聚酮合成相关的功能,其次是病原体-宿主附着或识别表面蛋白。对每个 HPs 的注释的进一步探索支持了先前确定的功能组。这些发现强调了需要进一步研究这些 HPs 的细胞和生态作用及其对生物学和生物技术的潜在影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f3/11573180/11b172feac89/pcbi.1012459.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f3/11573180/13e648adcd5f/pcbi.1012459.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f3/11573180/4faf6022550a/pcbi.1012459.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f3/11573180/a938df604178/pcbi.1012459.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f3/11573180/11b172feac89/pcbi.1012459.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f3/11573180/13e648adcd5f/pcbi.1012459.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f3/11573180/4faf6022550a/pcbi.1012459.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f3/11573180/a938df604178/pcbi.1012459.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f3/11573180/11b172feac89/pcbi.1012459.g004.jpg

相似文献

1
The neglected giants: Uncovering the prevalence and functional groups of huge proteins in proteomes.被忽视的巨人:揭示蛋白质组中巨大蛋白质的流行程度和功能群。
PLoS Comput Biol. 2024 Sep 16;20(9):e1012459. doi: 10.1371/journal.pcbi.1012459. eCollection 2024 Sep.
2
Identification of Low-Complexity Domains by Compositional Signatures Reveals Class-Specific Frequencies and Functions Across the Domains of Life.通过组成特征鉴定低复杂度结构域揭示了生命领域中特定类别出现的频率和功能。
PLoS Comput Biol. 2024 May 15;20(5):e1011372. doi: 10.1371/journal.pcbi.1011372. eCollection 2024 May.
3
Unexpected features of the dark proteome.黑暗蛋白质组的意外特征。
Proc Natl Acad Sci U S A. 2015 Dec 29;112(52):15898-903. doi: 10.1073/pnas.1508380112. Epub 2015 Nov 17.
4
Identification of Novel Abiotic Stress Proteins in Triticum aestivum Through Functional Annotation of Hypothetical Proteins.通过对小麦假想蛋白的功能注释鉴定新型非生物胁迫蛋白。
Interdiscip Sci. 2018 Mar;10(1):205-220. doi: 10.1007/s12539-016-0178-3. Epub 2016 Jul 16.
5
3D-GENOMICS: a database to compare structural and functional annotations of proteins between sequenced genomes.3D基因组学:一个用于比较已测序基因组之间蛋白质的结构和功能注释的数据库。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D245-50. doi: 10.1093/nar/gkh064.
6
Abundance and distributions of eukaryote protein simple sequences.真核生物蛋白质简单序列的丰度和分布
Mol Cell Proteomics. 2002 Dec;1(12):983-95. doi: 10.1074/mcp.m200032-mcp200.
7
Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment.通过生物信息学实验阐明人类和人科骨骼蛋白质组鉴定中的跨物种蛋白质组效应。
BMC Evol Biol. 2018 Feb 20;18(1):23. doi: 10.1186/s12862-018-1141-1.
8
Screening for clusters of charge in human virus proteomes.人类病毒蛋白质组中电荷簇的筛选。
BMC Genomics. 2016 Oct 17;17(Suppl 9):758. doi: 10.1186/s12864-016-3086-3.
9
Environmental proteomics: applications of proteome profiling in environmental microbiology and biotechnology.环境蛋白质组学:蛋白质组分析在环境微生物学和生物技术中的应用
Brief Funct Genomic Proteomic. 2009 Jan;8(1):75-87. doi: 10.1093/bfgp/elp005. Epub 2009 Mar 11.
10
Metaproteomics reveals persistent and phylum-redundant metabolic functional stability in adult human gut microbiomes of Crohn's remission patients despite temporal variations in microbial taxa, genomes, and proteomes.代谢蛋白质组学揭示了克罗恩病缓解患者成人肠道微生物组中持久且门类冗余的代谢功能稳定性,尽管微生物类群、基因组和蛋白质组存在时间变化。
Microbiome. 2019 Feb 11;7(1):18. doi: 10.1186/s40168-019-0631-8.

本文引用的文献

1
Protein length distribution is remarkably uniform across the tree of life.蛋白质长度分布在整个生命之树上都非常均匀。
Genome Biol. 2023 Jun 8;24(1):135. doi: 10.1186/s13059-023-02973-2.
2
Hyperactive nanobacteria with host-dependent traits pervade Omnitrophota.具有宿主依赖性特征的超活性纳米细菌普遍存在于 Omnitrophota 中。
Nat Microbiol. 2023 Apr;8(4):727-744. doi: 10.1038/s41564-022-01319-1. Epub 2023 Mar 16.
3
SARS-CoV-2 Africa dashboard for real-time COVID-19 information.SARS-CoV-2 非洲数据仪表盘,提供实时 COVID-19 信息。
Nat Microbiol. 2023 Jan;8(1):1-4. doi: 10.1038/s41564-022-01276-9.
4
Nonribosomal Peptide Synthesis Definitely Working Out of the Rules.非核糖体肽合成绝对是在突破常规。
Microorganisms. 2022 Mar 7;10(3):577. doi: 10.3390/microorganisms10030577.
5
and " Velamenicoccus archaeovorus".以及“古生菌厚壁菌门球菌属”。
Appl Environ Microbiol. 2022 Apr 12;88(7):e0240721. doi: 10.1128/aem.02407-21. Epub 2022 Mar 21.
6
SignalP 6.0 predicts all five types of signal peptides using protein language models.SignalP 6.0 使用蛋白质语言模型预测所有五种类型的信号肽。
Nat Biotechnol. 2022 Jul;40(7):1023-1025. doi: 10.1038/s41587-021-01156-3. Epub 2022 Jan 3.
7
Small proteins: overcoming size restrictions.小蛋白:克服尺寸限制。
Nat Rev Microbiol. 2022 Feb;20(2):65. doi: 10.1038/s41579-021-00672-y.
8
eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale.eggNOG-mapper v2:宏基因组尺度的功能注释、直系同源物分配和结构域预测。
Mol Biol Evol. 2021 Dec 9;38(12):5825-5829. doi: 10.1093/molbev/msab293.
9
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
10
IUPred3: prediction of protein disorder enhanced with unambiguous experimental annotation and visualization of evolutionary conservation.IUPred3:利用明确的实验注释和进化保守性可视化增强的蛋白质无序性预测。
Nucleic Acids Res. 2021 Jul 2;49(W1):W297-W303. doi: 10.1093/nar/gkab408.