Laboratório de Parasitologia, Instituto Butantan, São Paulo 05503-900, Brazil.
Center of Toxins, Immune Response and Cell Signaling (CeTICS), Instituto Butantan, São Paulo 05503-900, Brazil.
Genes (Basel). 2020 Oct 21;11(10):1235. doi: 10.3390/genes11101235.
is the etiological agent of Chagas disease, which affects millions of people in Latin America. No transcriptional control of gene expression has been demonstrated in this organism, and 50% of its genome consists of repetitive elements and members of multigenic families. In this study, we applied a novel bioinformatics approach to predict new repetitive elements in the genome sequence of . A new repetitive sequence measuring 241 nt was identified and found to be interspersed along the genome sequence from strains of different DTUs. This new repeat was mostly on intergenic regions, and upstream and downstream regions of the 241 nt repeat were enriched in surface protein genes. RNAseq analysis revealed that the repeat was part of processed mRNAs and was predominantly found in the 3' untranslated regions (UTRs) of genes of multigenic families encoding surface proteins. Moreover, we detected a correlation between the presence of the repeat in the 3'UTR of multigenic family genes and the level of differential expression of these genes when comparing epimastigote and trypomastigote transcriptomes. These data suggest that this sequence plays a role in the posttranscriptional regulation of the expression of multigenic families.
克氏锥虫是恰加斯病的病原体,该病影响拉丁美洲数百万人。该生物的基因表达没有转录控制,其基因组的 50%由重复元件和多基因家族成员组成。在这项研究中,我们应用了一种新的生物信息学方法来预测 的基因组序列中的新重复元件。鉴定出一种新的重复序列,长 241 个核苷酸,发现在来自不同 DTU 株的基因组序列中散布。这种新的重复序列主要位于基因间区,241 个核苷酸重复序列的上下游区域富含表面蛋白基因。RNAseq 分析表明,该重复序列是加工 mRNA 的一部分,主要存在于多基因家族编码表面蛋白的基因的 3'非翻译区 (UTR)。此外,当比较epimastigote 和 trypomastigote 转录组时,我们在多基因家族基因的 3'UTR 中检测到重复存在与这些基因差异表达水平之间存在相关性。这些数据表明,该序列在多基因家族表达的转录后调控中发挥作用。