Agricultural Biotechnology Research Center, Academia Sinica, Taipei, Taiwan.
Plant Cell Physiol. 2013 Feb;54(2):e11. doi: 10.1093/pcp/pct004. Epub 2013 Jan 16.
A specialized orchid database, named Orchidstra (URL: http://orchidstra.abrc.sinica.edu.tw), has been constructed to collect, annotate and share genomic information for orchid functional genomics studies. The Orchidaceae is a large family of Angiosperms that exhibits extraordinary biodiversity in terms of both the number of species and their distribution worldwide. Orchids exhibit many unique biological features; however, investigation of these traits is currently constrained due to the limited availability of genomic information. Transcriptome information for five orchid species and one commercial hybrid has been included in the Orchidstra database. Altogether, these comprise >380,000 non-redundant orchid transcript sequences, of which >110,000 are protein-coding genes. Sequences from the transcriptome shotgun assembly (TSA) were obtained either from output reads from next-generation sequencing technologies assembled into contigs, or from conventional cDNA library approaches. An annotation pipeline using Gene Ontology, KEGG and Pfam was built to assign gene descriptions and functional annotation to protein-coding genes. Deep sequencing of small RNA was also performed for Phalaenopsis aphrodite to search for microRNAs (miRNAs), extending the information archived for this species to miRNA annotation, precursors and putative target genes. The P. aphrodite transcriptome information was further used to design probes for an oligonucleotide microarray, and expression profiling analysis was carried out. The intensities of hybridized probes derived from microarray assays of various tissues were incorporated into the database as part of the functional evidence. In the future, the content of the Orchidstra database will be expanded with transcriptome data and genomic information from more orchid species.
一个名为 Orchidstra(网址:http://orchidstra.abrc.sinica.edu.tw)的专门兰花数据库已经建立,用于收集、注释和共享兰花功能基因组学研究的基因组信息。兰科是被子植物中一个大科,在物种数量和全球分布方面表现出非凡的生物多样性。兰花表现出许多独特的生物学特征;然而,由于基因组信息的有限可用性,目前对这些特征的研究受到限制。Orchidstra 数据库中包含五个兰花物种和一个商业杂交品种的转录组信息。这些信息总共包含超过 380,000 个非冗余的兰花转录序列,其中超过 110,000 个是蛋白质编码基因。转录组鸟枪法测序(TSA)的序列要么来自下一代测序技术的输出读取组装成 contigs,要么来自常规 cDNA 文库方法。使用基因本体论、KEGG 和 Pfam 构建了一个注释管道,为蛋白质编码基因分配基因描述和功能注释。还对蝴蝶兰进行了小 RNA 的深度测序,以搜索 microRNAs(miRNAs),将该物种的信息扩展到 miRNA 注释、前体和推定靶基因。蝴蝶兰的转录组信息进一步用于设计寡核苷酸微阵列的探针,并进行表达谱分析。各种组织的微阵列分析杂交探针的强度被纳入数据库,作为功能证据的一部分。将来,Orchidstra 数据库的内容将通过更多兰花物种的转录组数据和基因组信息进行扩展。