Yong Christina Seok Yien, Atheeqah-Hamzah Nur
Department of Biology, Faculty of Science, Universiti Putra Malaysia, Jalan UPM, 43400 Serdang, Selangor, Malaysia.
Trop Life Sci Res. 2024 Oct;35(3):121-148. doi: 10.21315/tlsr2024.35.3.6. Epub 2024 Oct 7.
Plants are rich in tandem repeats-containing proteins. It is postulated that the occurrence of tandem repeat gene families facilitates the adaptation and survival of plants in adverse environmental conditions. This study intended to identify the tandem repeats in the transcriptome of a high potential tropical horticultural plant, roselle ( L.). A total of 92,974 annotated assembled transcripts were analysed using approach, and 6,541 transcripts that encoded proteins containing tandem repeats with length of 20-60 amino acid residues were identified. Domain analysis revealed a total of nine tandem repeat protein families in the transcriptome of roselle, which are the Ankyrin repeats (ANK), Armadillo repeats (ARM), elongation factor-hand domain repeats (EF-hand), Huntingtin, elongation factor 3, protein phosphatase 2A, yeast kinase TOR1 repeats (HEAT), Kelch repeats (Kelch), leucine rich repeats (LRR), pentatricopeptide repeats (PPR), tetratricopeptide repeats (TPR) and WD40 repeats (WD40). Functional annotation analysis further matched 6,236 transcripts to 1,045 known proteins that contained tandem repeats including proteins implicated in plant development, protein-protein interaction, immunity and abiotic stress responses. The findings provide new insights into the occurrence of tandem repeats in the transcriptome and lay the foundation to elucidate the functional associations between tandem peptide repeats (TRs) and proteins in roselle and facilitate the identification of novel biotic and abiotic response related tandem repeats genes that may be useful in breeding improved varieties.
植物富含含有串联重复序列的蛋白质。据推测,串联重复基因家族的出现促进了植物在不利环境条件下的适应和生存。本研究旨在鉴定一种具有高潜力的热带园艺植物玫瑰茄(Hibiscus sabdariffa L.)转录组中的串联重复序列。使用相关方法对总共92,974个注释的组装转录本进行了分析,鉴定出6,541个编码含有长度为20 - 60个氨基酸残基的串联重复序列的蛋白质的转录本。结构域分析揭示了玫瑰茄转录组中总共九个串联重复蛋白家族,即锚蛋白重复序列(ANK)、犰狳重复序列(ARM)、伸长因子手型结构域重复序列(EF - hand)、亨廷顿蛋白、伸长因子3、蛋白磷酸酶2A、酵母激酶TOR1重复序列(HEAT)、kelch重复序列(Kelch)、富含亮氨酸重复序列(LRR)、五肽重复序列(PPR)、四肽重复序列(TPR)和WD40重复序列(WD40)。功能注释分析进一步将6,236个转录本与1,045个已知的含有串联重复序列的蛋白质进行匹配,这些蛋白质包括与植物发育、蛋白质 - 蛋白质相互作用、免疫和非生物胁迫反应相关的蛋白质。这些发现为转录组中串联重复序列的出现提供了新的见解,并为阐明玫瑰茄中串联肽重复序列(TRs)与蛋白质之间的功能关联奠定了基础,同时有助于鉴定可能对培育改良品种有用的新型生物和非生物反应相关串联重复序列基因。