Janies Daniel A, Witter Zach, Linchangco Gregorio V, Foltz David W, Miller Allison K, Kerr Alexander M, Jay Jeremy, Reid Robert W, Wray Gregory A
Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, NC, 28223-0001, USA.
Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, 70803, USA.
BMC Bioinformatics. 2016 Jan 22;17:48. doi: 10.1186/s12859-016-0883-2.
One of our goals for the echinoderm tree of life project (http://echinotol.org) is to identify orthologs suitable for phylogenetic analysis from next-generation transcriptome data. The current dataset is the largest assembled for echinoderm phylogeny and transcriptomics. We used RNA-Seq to profile adult tissues from 42 echinoderm specimens from 24 orders and 37 families. In order to achieve sampling members of clades that span key evolutionary divergence, many of our exemplars were collected from deep and polar seas.
A small fraction of the transcriptome data we produced is being used for phylogenetic reconstruction. Thus to make a larger dataset available to researchers with a wide variety of interests, we made a web-based application, EchinoDB (http://echinodb.uncc.edu). EchinoDB is a repository of orthologous transcripts from echinoderms that is searchable via keywords and sequence similarity.
From transcripts we identified 749,397 clusters of orthologous loci. We have developed the information technology to manage and search the loci their annotations with respect to the Sea Urchin (Strongylocentrotus purpuratus) genome. Several users have already taken advantage of these data for spin-off projects in developmental biology, gene family studies, and neuroscience. We hope others will search EchinoDB to discover datasets relevant to a variety of additional questions in comparative biology.
我们生成的转录组数据中有一小部分正用于系统发育重建。因此,为了向具有广泛兴趣的研究人员提供更大的数据集,我们制作了一个基于网络的应用程序EchinoDB(http://echinodb.uncc.edu)。EchinoDB是一个棘皮动物直系同源转录本的储存库,可通过关键词和序列相似性进行搜索。
从转录本中我们识别出749397个直系同源基因座簇。我们已经开发了信息技术来管理和搜索这些基因座及其相对于海胆(紫海胆)基因组的注释。一些用户已经利用这些数据开展了发育生物学、基因家族研究和神经科学方面的衍生项目。我们希望其他人能搜索EchinoDB,以发现与比较生物学中各种其他问题相关的数据集。