• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

真核生物蛋白质组中的未知蛋白。

On the Unknown Proteins of Eukaryotic Proteomes.

机构信息

US2B, UMR 6286 of CNRS, Nantes University, rue de la Houssinière, 44322, Nantes, France.

出版信息

J Mol Evol. 2023 Aug;91(4):492-501. doi: 10.1007/s00239-023-10116-1. Epub 2023 May 23.

DOI:10.1007/s00239-023-10116-1
PMID:37219573
Abstract

To study unknown proteins on a large scale, a reference system has been set up for the three better studied eukaryotic kingdoms, built with 36 proteomes as taxonomically diverse as possible. Proteins from 362 other eukaryotic proteomes with no known homologue in this set were then analyzed, focusing noteworthy on singletons, that is, on such proteins with no known homologue in their own proteome. Consistently, for a given species, no more than 12% of the singletons thus found are known at the protein level, according to Uniprot. In addition, since they rely on the information found in the alignment of homologous sequences, predictions of AlphaFold2 for their tridimensional structure are poor. In the case of metazoan species, the number of singletons rarely exceeds 1000 for the species the closest to the reference system (divergence times below 75 Myr). Interestingly, in the cases of viridiplantae and fungi, larger amounts of singletons are found for such species, as if the timescale on which singletons are added to proteomes were different in metazoa and in other eukaryotic kingdoms. In order to confirm this phenomenon, further studies of proteomes closer to those of the reference system are, however, needed.

摘要

为了大规模研究未知蛋白质,已经为三个研究较好的真核生物王国建立了一个参考系统,其中包含尽可能多样化的 36 个蛋白质组。然后分析了来自其他 362 个真核蛋白质组的蛋白质,这些蛋白质在这个集合中没有已知的同源物,特别关注单体,也就是说,在它们自己的蛋白质组中没有已知同源物的蛋白质。一致地,根据 Uniprot,对于给定的物种,在这种情况下发现的单体中不超过 12%是在蛋白质水平上已知的。此外,由于它们依赖于同源序列比对中发现的信息,因此 AlphaFold2 对其三维结构的预测很差。在后生动物物种的情况下,对于与参考系统最接近的物种(分歧时间低于 7500 万年),单体的数量很少超过 1000 个。有趣的是,在绿藻门和真菌门的情况下,对于这些物种,发现了更多的单体,好像单体添加到蛋白质组的时间尺度在后生动物和其他真核生物王国中是不同的。为了证实这一现象,然而,需要对更接近参考系统的蛋白质组进行进一步研究。

相似文献

1
On the Unknown Proteins of Eukaryotic Proteomes.真核生物蛋白质组中的未知蛋白。
J Mol Evol. 2023 Aug;91(4):492-501. doi: 10.1007/s00239-023-10116-1. Epub 2023 May 23.
2
Phyloproteomic Analysis of 11780 Six-Residue-Long Motifs Occurrences.对11780个六残基长基序出现情况的系统发育蛋白质组学分析
Biomed Res Int. 2015;2015:208346. doi: 10.1155/2015/208346. Epub 2015 May 31.
3
[Comparative analysis of internal repeating segments in proteins of species from the three kingdoms of life].[生命三域物种蛋白质内部重复片段的比较分析]
Yi Chuan Xue Bao. 2005 Mar;32(3):315-21.
4
Occurrence of disordered patterns and homorepeats in eukaryotic and bacterial proteomes.真核生物和细菌蛋白质组中无序模式和同型重复序列的出现情况。
Mol Biosyst. 2012 Jan;8(1):327-37. doi: 10.1039/c1mb05318c. Epub 2011 Oct 18.
5
The relationships between the isoelectric point and: length of proteins, taxonomy and ecology of organisms.蛋白质的等电点与蛋白质长度、生物体的分类学和生态学之间的关系。
BMC Genomics. 2007 Jun 12;8:163. doi: 10.1186/1471-2164-8-163.
6
Evolution of protein indels in plants, animals and fungi.蛋白质插入缺失在植物、动物和真菌中的进化。
BMC Evol Biol. 2013 Jul 4;13:140. doi: 10.1186/1471-2148-13-140.
7
Disordered patterns in clustered Protein Data Bank and in eukaryotic and bacterial proteomes.蛋白质数据库中聚集的蛋白质和真核生物及细菌蛋白质组中的紊乱模式。
PLoS One. 2011;6(11):e27142. doi: 10.1371/journal.pone.0027142. Epub 2011 Nov 4.
8
The draft nuclear genome sequence and predicted mitochondrial proteome of Andalucia godoyi, a protist with the most gene-rich and bacteria-like mitochondrial genome.安地西亚原绿球藻(Andalucia godoyi)的核基因组草图序列和预测的线粒体蛋白质组,这是一种具有最丰富基因和类似细菌的线粒体基因组的原生生物。
BMC Biol. 2020 Mar 2;18(1):22. doi: 10.1186/s12915-020-0741-6.
9
An atlas of protein homo-oligomerization across domains of life.生命领域中蛋白质同源寡聚体的图谱。
Cell. 2024 Feb 15;187(4):999-1010.e15. doi: 10.1016/j.cell.2024.01.022. Epub 2024 Feb 6.
10
Promiscuous Domains in Eukaryotes and HAT Proteins in FUNGI Have Followed Different Evolutionary Paths.真核生物中的混杂结构域和真菌中的 HAT 蛋白遵循不同的进化途径。
J Mol Evol. 2022 Feb;90(1):124-138. doi: 10.1007/s00239-021-10046-w. Epub 2022 Jan 27.

引用本文的文献

1
Fine-Tuning Protein Language Models Unlocks the Potential of Underrepresented Viral Proteomes.微调蛋白质语言模型可释放未充分表征的病毒蛋白质组的潜力。
bioRxiv. 2025 Jun 11:2025.04.17.649224. doi: 10.1101/2025.04.17.649224.
2
Are Most Human-Specific Proteins Encoded by Long Noncoding RNAs?大多数人类特异性蛋白是否由长非编码 RNA 编码?
J Mol Evol. 2024 Aug;92(4):363-370. doi: 10.1007/s00239-024-10174-z. Epub 2024 Jun 25.

本文引用的文献

1
TimeTree 5: An Expanded Resource for Species Divergence Times.TimeTree 5:物种分化时间的扩展资源。
Mol Biol Evol. 2022 Aug 6;39(8). doi: 10.1093/molbev/msac174.
2
The impact of AlphaFold2 one year on.AlphaFold2发布一年后的影响。 (原英文表述不太准确,推测完整意思可能是这样,根据准确英文原文调整翻译会更准确)
Nat Methods. 2022 Jan;19(1):15-20. doi: 10.1038/s41592-021-01365-3.
3
Electron cryo-tomography structure of axonemal doublet microtubule from .电子冷冻断层扫描结构的轴丝双联微管从.
Life Sci Alliance. 2021 Dec 30;5(3). doi: 10.26508/lsa.202101225. Print 2022 Mar.
4
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库:用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.
5
De novo identification of mammalian ciliary motility proteins using cryo-EM.使用冷冻电镜对哺乳动物纤毛运动蛋白进行从头鉴定。
Cell. 2021 Nov 11;184(23):5791-5806.e19. doi: 10.1016/j.cell.2021.10.007. Epub 2021 Oct 28.
6
Applying and improving AlphaFold at CASP14.应用和改进 AlphaFold 参加 CASP14。
Proteins. 2021 Dec;89(12):1711-1721. doi: 10.1002/prot.26257.
7
Highly accurate protein structure prediction for the human proteome.高精准度的人类蛋白质组蛋白结构预测。
Nature. 2021 Aug;596(7873):590-596. doi: 10.1038/s41586-021-03828-1. Epub 2021 Jul 22.
8
The ChinaMAP analytics of deep whole genome sequences in 10,588 individuals.中国人群深度全基因组序列的 ChinaMAP 分析。
Cell Res. 2020 Sep;30(9):717-731. doi: 10.1038/s41422-020-0322-9. Epub 2020 Apr 30.
9
Structure of the Decorated Ciliary Doublet Microtubule.有被装饰的纤毛二联微管的结构。
Cell. 2019 Oct 31;179(4):909-922.e12. doi: 10.1016/j.cell.2019.09.030. Epub 2019 Oct 24.
10
Anticancer Activity of Natural Compounds from Plant and Marine Environment.植物和海洋环境天然产物的抗癌活性。
Int J Mol Sci. 2018 Nov 9;19(11):3533. doi: 10.3390/ijms19113533.