Department of Epidemiology of Microbial Diseases, School of Public Health, Yale University, New Haven, CT, USA.
BMC Biol. 2014 Feb 19;12:14. doi: 10.1186/1741-7007-12-14.
Although technical advances in genomics and proteomics research have yielded a better understanding of the coding capacity of a genome, one major challenge remaining is the identification of all expressed proteins, especially those less than 100 amino acids in length. Such information can be particularly relevant to human pathogens, such as Trypanosoma brucei, the causative agent of African trypanosomiasis, since it will provide further insight into the parasite biology and life cycle.
Starting with 993 T. brucei transcripts, previously shown by RNA-Sequencing not to coincide with annotated coding sequences (CDS), homology searches revealed that 173 predicted short open reading frames in these transcripts are conserved across kinetoplastids with 13 also conserved in representative eukaryotes. Mining mass spectrometry data sets revealed 42 transcripts encoding at least one matching peptide. RNAi-induced down-regulation of these 42 transcripts revealed seven to be essential in insect-form trypanosomes with two also required for the bloodstream life cycle stage. To validate the specificity of the RNAi results, each lethal phenotype was rescued by co-expressing an RNAi-resistant construct of each corresponding CDS. These previously non-annotated essential small proteins localized to a variety of cell compartments, including the cell surface, mitochondria, nucleus and cytoplasm, inferring the diverse biological roles they are likely to play in T. brucei. We also provide evidence that one of these small proteins is required for replicating the kinetoplast (mitochondrial) DNA.
Our studies highlight the presence and significance of small proteins in a protist and expose potential new targets to block the survival of trypanosomes in the insect vector and/or the mammalian host.
尽管基因组学和蛋白质组学研究的技术进步使人们对基因组的编码能力有了更好的理解,但仍面临一个主要挑战,即识别所有表达的蛋白质,尤其是那些长度小于 100 个氨基酸的蛋白质。此类信息对于人类病原体(如引起非洲锥虫病的布氏锥虫)可能特别重要,因为它将进一步深入了解寄生虫的生物学和生命周期。
从之前通过 RNA 测序显示与注释编码序列 (CDS) 不重合的 993 个 T. brucei 转录本开始,同源搜索显示,这些转录本中预测的 173 个短开放阅读框在动质体中是保守的,其中 13 个在代表性真核生物中也是保守的。挖掘质谱数据集揭示了至少有一个匹配肽的 42 个转录本。这些 42 个转录本的 RNAi 诱导下调表明,在昆虫型锥虫中有 7 个是必需的,其中 2 个也需要血腔生命周期阶段。为了验证 RNAi 结果的特异性,将每个致死表型用相应 CDS 的 RNAi 抗性构建体共表达进行拯救。这些以前未注释的必需小蛋白定位于多种细胞区室,包括细胞表面、线粒体、核和细胞质,推断它们在 T. brucei 中可能发挥的多种生物学作用。我们还提供了证据表明,这些小蛋白之一对于复制动质体(线粒体)DNA 是必需的。
我们的研究强调了小型蛋白质在原生动物中的存在和重要性,并揭示了潜在的新靶点,以阻止锥虫在昆虫媒介和/或哺乳动物宿主中的存活。