Jagadeesh Jaiganesh, Vembar Shruthi Sridhar
Institute of Bioinformatics and Applied Biotechnology, Bengaluru, Karnataka, India.
Sci Rep. 2024 Dec 5;14(1):30363. doi: 10.1038/s41598-024-79937-4.
The DNA/RNA-binding Alba domain is prevalent across all kingdoms of life. First discovered in archaea, this protein domain has evolved from RNA- to DNA-binding, with a concomitant expansion in the range of cellular processes that it regulates. Despite its widespread presence, the full extent of its sequence, structural, and functional diversity remains unexplored. In this study, we employed iterative searches in PSI-BLAST to identify 15,161 unique Alba domain-containing proteins from the NCBI non-redundant protein database. Sequence similarity network (SSN) analysis clustered them into 13 distinct subgroups, including the archaeal Alba and eukaryotic Rpp20/Pop7 and Rpp25/Pop6 groups, as well as novel fungal and Plasmodium-specific Albas. Sequence and structural conservation analysis of the subgroups indicated high preservation of the dimer interface, with Alba domains from unicellular eukaryotes notably exhibiting structural deviations towards their C-terminal end. Finally, phylogenetic analysis, while supporting SSN clustering, revealed the evolutionary branchpoint at which the eukaryotic Rpp20- and Rpp25-like clades emerged from archaeal Albas, and the subsequent taxonomic lineage-based divergence within each clade. Taken together, this comprehensive analysis enhances our understanding of the evolutionary history of Alba domain-containing proteins across diverse organisms.
DNA/RNA结合Alba结构域在所有生命王国中都很普遍。该蛋白结构域最初在古细菌中被发现,已从结合RNA进化为结合DNA,同时其调节的细胞过程范围也有所扩大。尽管它广泛存在,但其序列、结构和功能多样性的全部程度仍未被探索。在本研究中,我们利用PSI-BLAST中的迭代搜索从NCBI非冗余蛋白质数据库中鉴定出15161个独特的含Alba结构域的蛋白质。序列相似性网络(SSN)分析将它们聚类为13个不同的亚组,包括古细菌Alba以及真核生物Rpp20/Pop7和Rpp25/Pop6组,还有新发现的真菌特异性Alba和疟原虫特异性Alba。对这些亚组的序列和结构保守性分析表明二聚体界面高度保守,单细胞真核生物的Alba结构域在其C末端明显表现出结构偏差。最后,系统发育分析虽然支持SSN聚类,但揭示了真核生物Rpp20和Rpp25样进化枝从古细菌Alba中出现的进化分支点,以及每个进化枝内随后基于分类谱系的分歧。综上所述,这一全面分析增强了我们对不同生物体中含Alba结构域蛋白质进化历史的理解。