• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过组成特征鉴定低复杂度结构域揭示了生命领域中特定类别出现的频率和功能。

Identification of Low-Complexity Domains by Compositional Signatures Reveals Class-Specific Frequencies and Functions Across the Domains of Life.

机构信息

Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins, Colorado, United States of America.

出版信息

PLoS Comput Biol. 2024 May 15;20(5):e1011372. doi: 10.1371/journal.pcbi.1011372. eCollection 2024 May.

DOI:10.1371/journal.pcbi.1011372
PMID:38748749
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11132505/
Abstract

Low-complexity domains (LCDs) in proteins are typically enriched in one or two predominant amino acids. As a result, LCDs often exhibit unusual structural/biophysical tendencies and can occupy functional niches. However, for each organism, protein sequences must be compatible with intracellular biomolecules and physicochemical environment, both of which vary from organism to organism. This raises the possibility that LCDs may occupy sequence spaces in select organisms that are otherwise prohibited in most organisms. Here, we report a comprehensive survey and functional analysis of LCDs in all known reference proteomes (>21k organisms), with added focus on rare and unusual types of LCDs. LCDs were classified according to both the primary amino acid and secondary amino acid in each LCD sequence, facilitating detailed comparisons of LCD class frequencies across organisms. Examination of LCD classes at different depths (i.e., domain of life, organism, protein, and per-residue levels) reveals unique facets of LCD frequencies and functions. To our surprise, all 400 LCD classes occur in nature, although some are exceptionally rare. A number of rare classes can be defined for each domain of life, with many LCD classes appearing to be eukaryote-specific. Certain LCD classes were consistently associated with identical functions across many organisms, particularly in eukaryotes. Our analysis methods enable simultaneous, direct comparison of all LCD classes between individual organisms, resulting in a proteome-scale view of differences in LCD frequencies and functions. Together, these results highlight the remarkable diversity and functional specificity of LCDs across all known life forms.

摘要

蛋白质中的低复杂度结构域(LCDs)通常富含一种或两种主要氨基酸。因此,LCDs 通常表现出异常的结构/生物物理倾向,并能占据功能生态位。然而,对于每个生物体,蛋白质序列必须与细胞内生物分子和物理化学环境兼容,而这些在不同的生物体中都有所不同。这就提出了一种可能性,即 LCDs 可能占据某些特定生物体的序列空间,而在大多数生物体中则被禁止。在这里,我们对所有已知参考蛋白质组(>21000 个生物体)中的 LCD 进行了全面的调查和功能分析,并特别关注罕见和不常见的 LCD 类型。根据每个 LCD 序列中的主要氨基酸和次要氨基酸对 LCD 进行分类,便于在生物体之间对 LCD 类频率进行详细比较。在不同的深度(即生命领域、生物体、蛋白质和每个残基水平)检查 LCD 类,揭示了 LCD 频率和功能的独特方面。令我们惊讶的是,所有 400 个 LCD 类都存在于自然界中,尽管有些非常罕见。对于每一个生命领域,都可以定义一些罕见的类别,其中许多 LCD 类别似乎是真核生物特有的。某些 LCD 类在许多生物体中始终与相同的功能相关联,尤其是在真核生物中。我们的分析方法使我们能够在个体生物体之间同时直接比较所有的 LCD 类,从而在蛋白质组范围内观察到 LCD 频率和功能的差异。总之,这些结果突出了 LCD 在所有已知生命形式中的显著多样性和功能特异性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/2a97a8c808ac/pcbi.1011372.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/82b934fc77c4/pcbi.1011372.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/1f4f0cd5149e/pcbi.1011372.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/897d45191794/pcbi.1011372.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/cac663aa1127/pcbi.1011372.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/1cdbfb69e80d/pcbi.1011372.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/c0eeba1dcecd/pcbi.1011372.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/b3000d6f9f0b/pcbi.1011372.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/2571e9b93eaf/pcbi.1011372.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/7ece0f433c0f/pcbi.1011372.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/76037dd7c97d/pcbi.1011372.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/cc86cbb259fd/pcbi.1011372.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/d15545a1f0b6/pcbi.1011372.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/2a97a8c808ac/pcbi.1011372.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/82b934fc77c4/pcbi.1011372.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/1f4f0cd5149e/pcbi.1011372.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/897d45191794/pcbi.1011372.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/cac663aa1127/pcbi.1011372.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/1cdbfb69e80d/pcbi.1011372.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/c0eeba1dcecd/pcbi.1011372.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/b3000d6f9f0b/pcbi.1011372.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/2571e9b93eaf/pcbi.1011372.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/7ece0f433c0f/pcbi.1011372.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/76037dd7c97d/pcbi.1011372.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/cc86cbb259fd/pcbi.1011372.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/d15545a1f0b6/pcbi.1011372.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/11132505/2a97a8c808ac/pcbi.1011372.g013.jpg

相似文献

1
Identification of Low-Complexity Domains by Compositional Signatures Reveals Class-Specific Frequencies and Functions Across the Domains of Life.通过组成特征鉴定低复杂度结构域揭示了生命领域中特定类别出现的频率和功能。
PLoS Comput Biol. 2024 May 15;20(5):e1011372. doi: 10.1371/journal.pcbi.1011372. eCollection 2024 May.
2
Atypical structural tendencies among low-complexity domains in the Protein Data Bank proteome.蛋白质数据库(PDB)蛋白质组中低复杂度结构域的非典型结构趋势。
PLoS Comput Biol. 2020 Jan 27;16(1):e1007487. doi: 10.1371/journal.pcbi.1007487. eCollection 2020 Jan.
3
LCD-Composer: an intuitive, composition-centric method enabling the identification and detailed functional mapping of low-complexity domains.LCD-Composer:一种直观的、以组成为中心的方法,可实现低复杂性结构域的识别和详细功能映射。
NAR Genom Bioinform. 2021 May 26;3(2):lqab048. doi: 10.1093/nargab/lqab048. eCollection 2021 Jun.
4
The LCD-Composer webserver: high-specificity identification and functional analysis of low-complexity domains in proteins.LCD-Composer 网络服务器:在蛋白质中进行低复杂度区域的高特异性识别和功能分析。
Bioinformatics. 2022 Dec 13;38(24):5446-5448. doi: 10.1093/bioinformatics/btac699.
5
Proteome-scale relationships between local amino acid composition and protein fates and functions.蛋白质组范围内的局部氨基酸组成与蛋白质命运和功能之间的关系。
PLoS Comput Biol. 2018 Sep 24;14(9):e1006256. doi: 10.1371/journal.pcbi.1006256. eCollection 2018 Sep.
6
Abundance and distributions of eukaryote protein simple sequences.真核生物蛋白质简单序列的丰度和分布
Mol Cell Proteomics. 2002 Dec;1(12):983-95. doi: 10.1074/mcp.m200032-mcp200.
7
The hnRNP-like Nab3 termination factor can employ heterologous prion-like domains in place of its own essential low complexity domain.类hnRNP的Nab3终止因子可以利用异源的朊病毒样结构域来替代其自身必需的低复杂性结构域。
PLoS One. 2017 Oct 12;12(10):e0186187. doi: 10.1371/journal.pone.0186187. eCollection 2017.
8
Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity.将基因-蛋白质网络投射到蛋白质组的功能空间及其在分析生物复杂性中的应用。
BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2164-11-S1-S4.
9
Systematic analysis of compositional order of proteins reveals new characteristics of biological functions and a universal correlate of macroevolution.系统分析蛋白质的组成顺序揭示了生物功能的新特征和宏观进化的普遍相关性。
PLoS Comput Biol. 2013;9(11):e1003346. doi: 10.1371/journal.pcbi.1003346. Epub 2013 Nov 21.
10
Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment.通过生物信息学实验阐明人类和人科骨骼蛋白质组鉴定中的跨物种蛋白质组效应。
BMC Evol Biol. 2018 Feb 20;18(1):23. doi: 10.1186/s12862-018-1141-1.

引用本文的文献

1
A Perspective on the Role of Mitochondrial Biomolecular Condensates (mtBCs) in Neurodegenerative Diseases and Evolutionary Links to Bacterial BCs.线粒体生物分子凝聚物(mtBCs)在神经退行性疾病中的作用及与细菌凝聚物的进化联系之展望
Int J Mol Sci. 2025 Aug 24;26(17):8216. doi: 10.3390/ijms26178216.

本文引用的文献

1
How many species are there on Earth? Progress and problems.地球上有多少物种?进展与问题。
PLoS Biol. 2023 Nov 20;21(11):e3002388. doi: 10.1371/journal.pbio.3002388. eCollection 2023 Nov.
2
One Step Closer to the Understanding of the Relationship IDR-LCR-Structure.离理解 IDR-LCR-结构的关系又近了一步。
Genes (Basel). 2023 Aug 28;14(9):1711. doi: 10.3390/genes14091711.
3
Functional Tuning of Intrinsically Disordered Regions in Human Proteins by Composition Bias.通过组成偏见对人类蛋白质中的无规卷曲区域进行功能调节。
Biomolecules. 2022 Oct 15;12(10):1486. doi: 10.3390/biom12101486.
4
The LCD-Composer webserver: high-specificity identification and functional analysis of low-complexity domains in proteins.LCD-Composer 网络服务器:在蛋白质中进行低复杂度区域的高特异性识别和功能分析。
Bioinformatics. 2022 Dec 13;38(24):5446-5448. doi: 10.1093/bioinformatics/btac699.
5
A spatiotemporal reconstruction of the pharyngeal cuticle reveals a structure rich in phase-separating proteins.咽表皮的时空重建揭示了富含相分离蛋白的结构。
Elife. 2022 Oct 19;11:e79396. doi: 10.7554/eLife.79396.
6
Regions with two amino acids in protein sequences: A step forward from homorepeats into the low complexity landscape.蛋白质序列中含有两个氨基酸的区域:从同聚物重复迈向低复杂性格局的一步。
Comput Struct Biotechnol J. 2022 Sep 18;20:5516-5523. doi: 10.1016/j.csbj.2022.09.011. eCollection 2022.
7
Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions.固有无序蛋白质和区域的组成偏倚及其预测。
Biomolecules. 2022 Jun 25;12(7):888. doi: 10.3390/biom12070888.
8
Expansion and functional analysis of the SR-related protein family across the domains of life.SR 相关蛋白家族在生命领域的扩张和功能分析。
RNA. 2022 Oct;28(10):1298-1314. doi: 10.1261/rna.079170.122. Epub 2022 Jul 21.
9
The intrinsically disordered region from PP2C phosphatases functions as a conserved CO sensor.来自PP2C磷酸酶的内在无序区域充当保守的一氧化碳传感器。
Nat Cell Biol. 2022 Jul;24(7):1029-1037. doi: 10.1038/s41556-022-00936-6. Epub 2022 Jun 16.
10
F/YGG-motif is an intrinsically disordered nucleic-acid binding motif.F/YGG 基序是一种固有无序的核酸结合基序。
RNA Biol. 2022;19(1):622-635. doi: 10.1080/15476286.2022.2066336. Epub 2021 Dec 31.