Steczkiewicz Kamil, Kossakowski Aleksander, Janik Stanisław, Muszewska Anna
Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5A, 02-106 Warsaw, Poland.
Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Stefana Banacha 2, 02-097 Warsaw, Poland.
NAR Genom Bioinform. 2025 Feb 27;7(1):lqaf014. doi: 10.1093/nargab/lqaf014. eCollection 2025 Mar.
Reports on the diversity and occurrence of low-complexity regions (LCR) in Eukaryota are limited. Some studies have provided a more extensive characterization of LCR proteins in prokaryotes. There is a growing body of knowledge about a plethora of biological functions attributable to LCRs. However, it is hard to determine to what extent observed phenomena apply to fungi since most studies of fungal LCRs were limited to model yeasts. To fill this gap, we performed a survey of LCRs in proteins across all fungal tree of life branches. We show that the abundance of LCRs and the abundance of proteins with LCRs are positively correlated with proteome size. We observed that most LCRs are present in proteins with protein domains but do not overlap with the domain regions. LCRs are associated with many duplicated protein domains. The quantity of particular amino acids in LCRs deviates from the background frequency with a clear over-representation of amino acids with functional groups and a negative charge. Moreover, we discovered that each lineage of fungi favors distinct LCRs expansions. Early diverging fungal lineages differ in LCR abundance and composition pointing at a different evolutionary trajectory of each fungal group.
关于真核生物中低复杂性区域(LCR)的多样性和出现情况的报道有限。一些研究对原核生物中的LCR蛋白进行了更广泛的表征。关于LCRs具有的大量生物学功能的知识越来越多。然而,由于大多数关于真菌LCRs的研究仅限于模式酵母,因此很难确定观察到的现象在多大程度上适用于真菌。为了填补这一空白,我们对所有真菌生命之树分支中的蛋白质中的LCRs进行了一项调查。我们表明,LCRs的丰度和具有LCRs的蛋白质的丰度与蛋白质组大小呈正相关。我们观察到,大多数LCRs存在于具有蛋白质结构域的蛋白质中,但不与结构域区域重叠。LCRs与许多重复的蛋白质结构域相关。LCRs中特定氨基酸的数量偏离背景频率,具有官能团和负电荷的氨基酸明显过度富集。此外,我们发现每个真菌谱系都倾向于不同的LCRs扩展。早期分化的真菌谱系在LCR丰度和组成上存在差异,这表明每个真菌群体都有不同的进化轨迹。