Tsypin Lev M, Turkewitz Aaron P
Department of Molecular Genetics and Cell Biology, University of Chicago, Chicago IL, 60637.
SoftwareX. 2017;6:165-171. doi: 10.1016/j.softx.2017.06.006. Epub 2017 Aug 16.
Identifying co-regulated genes provides a useful approach for defining pathway-specific machinery in an organism. To be efficient, this approach relies on thorough genome annotation, a process much slower than genome sequencing , a unicellular eukaryote, has been a useful model organism and has a fully sequenced but sparsely annotated genome. One important resource for studying this organism has been an online transcriptomic database. We have developed an automated approach to gene annotation in the context of transcriptome data in , called the Co-regulation Data Harvester (CDH). Beginning with a gene of interest, the CDH identifies co-regulated genes by accessing the transcriptome database. It then identifies their closely related genes (orthologs) in other organisms by using reciprocal BLAST searches. Finally, it collates the annotations of those orthologs' functions, which provides the user with information to help predict the cellular role of the initial query. The CDH, which is freely available, represents a powerful new tool for analyzing cell biological pathways in . Moreover, to the extent that genes and pathways are conserved between organisms, the inferences obtained the CDH should be relevant, and can be explored, in many other systems.
识别共同调控的基因提供了一种有用的方法来定义生物体中特定途径的机制。为了有效,这种方法依赖于全面的基因组注释,而这一过程比基因组测序要慢得多。单细胞真核生物是一种有用的模式生物,其基因组已完全测序但注释稀少。研究这种生物的一个重要资源是一个在线转录组数据库。我们开发了一种在转录组数据背景下进行基因注释的自动化方法,称为共调控数据采集器(CDH)。从一个感兴趣的基因开始,CDH通过访问转录组数据库来识别共同调控的基因。然后,它通过双向BLAST搜索在其他生物体中识别它们的密切相关基因(直系同源基因)。最后,它整理这些直系同源基因功能的注释,为用户提供信息以帮助预测初始查询基因的细胞作用。免费提供的CDH是分析细胞生物学途径的一个强大新工具。此外,由于基因和途径在生物体之间是保守的,通过CDH获得的推断在许多其他系统中应该是相关的并且可以被探索。