Kontopodis Evangelos, Pierros Vasileios, Vorgias Constantinos E, Papassideri Issidora S, Stravopodis Dimitrios J, Tsangaris George Th
Proteomics Research Unit, Biomedical Research Foundation of the Academy of Athens (BRFAA), Athens 11527, Greece.
Section of Cell Biology and Biophysics, Department of Biology, School of Science, National and Kapodistrian University of Athens (NKUA), Athens 15701, Greece.
Comput Struct Biotechnol J. 2025 May 21;27:2123-2138. doi: 10.1016/j.csbj.2025.05.027. eCollection 2025.
Cellular and molecular uniqueness have recently gained eminent importance due to the large amount of data produced by the use of "omics" technologies. Although the uniqueness of peptides is a well-studied feature, an innovative and pioneering concept regarding the correlation of uniqueness, with the peptide amino acid sequence and the peptide length, is introduced and extensively analyzed for the first time in this work. We construct the human "Uniquome" by introducing the following novel peptide entities: "Core Unique Peptide" (CrUP) defined as the peptide whose sequence is accommodated, specifically and exclusively, only in one protein in a given proteome, and also bears the minimum length of an amino acid sequence; and, further, the "Composite Unique Peptide" (CmUP), the "Family Unique Peptides" (FUPs) and the "Universal Unique Peptides" (UUPs). We thoroughly analyze the human and 20 critical model organisms' proteomes. Our findings indicate that these novel peptides entities possess unique properties and crucial functions that have not been previously characterized, far beyond the identification of proteins from a single peptide, representing the "causative agents" for many other groups of peptides with important biological activities. Considering the complexity of proteomes, a systemic analysis of these new entities will enhance the potential for novel protein identification, functional prediction, and exploration of cellular and molecular uniqueness, setting new standards in medicine for advanced studies in deep proteomics/proteomes, and translational biology. Finally, our results suggest that, across species, the highly conserved sequences are not DNA nucleotides but CrUP entities.
由于使用“组学”技术产生了大量数据,细胞和分子独特性最近变得极为重要。尽管肽的独特性是一个已被充分研究的特征,但在本研究中首次引入并广泛分析了一个关于独特性与肽氨基酸序列及肽长度相关性的创新且前沿的概念。我们通过引入以下新型肽实体构建人类“独特肽组”:“核心独特肽”(CrUP),定义为其序列在给定蛋白质组中仅特异性且唯一地存在于一种蛋白质中,并且具有氨基酸序列最小长度的肽;此外,还有“复合独特肽”(CmUP)、“家族独特肽”(FUPs)和“通用独特肽”(UUPs)。我们全面分析了人类和20种关键模式生物的蛋白质组。我们的研究结果表明,这些新型肽实体具有以前未被表征的独特性质和关键功能,远远超出了从单个肽鉴定蛋白质的范畴,代表了许多其他具有重要生物活性的肽组的“致病因子”。考虑到蛋白质组的复杂性,对这些新实体进行系统分析将提高新型蛋白质鉴定、功能预测以及细胞和分子独特性探索的潜力,为深度蛋白质组学/蛋白质组和转化生物学的高级研究设定医学新标准。最后,我们的结果表明,跨物种来看,高度保守的序列不是DNA核苷酸,而是CrUP实体。