Lim Sim Lin, D'Agui Haylee M, Enright Neal J, He Tianhua
Department of Environment and Agriculture, Curtin University, Perth, WA 6845, Australia.
School of Veterinary and Life Sciences, Murdoch University, Perth, WA 6150, Australia.
Genomics Proteomics Bioinformatics. 2017 Feb;15(1):49-56. doi: 10.1016/j.gpb.2016.11.001. Epub 2017 Feb 2.
Banksia is a significant element in vegetation of southwestern Australia, a biodiversity hotspot with global significance. In particular, Banksia hookeriana represents a species with significant economic and ecological importance in the region. For better conservation and management, we reported an overview of transcriptome of B. hookeriana using RNA-seq and de novo assembly. We have generated a total of 202.7 million reads (18.91 billion of nucleotides) from four leaf samples in four plants of B. hookeriana, and assembled 59,063 unigenes (average size=1098bp) through de novotranscriptome assembly. Among them, 39,686 unigenes were annotated against the Swiss-Prot, Clusters of Orthologous Groups (COG), and NCBI non-redundant (NR) protein databases. We showed that there was approximately one single nucleotide polymorphism (SNP) per 5.6-7.1kb in the transcriptome, and the ratio of transitional to transversional polymorphisms was approximately 1.82. We compared unigenes of B. hookeriana to those of Arabidopsis thaliana and Nelumbo nucifera through sequence homology, Gene Ontology (GO) annotation, and KEGG pathway analyses. The comparative analysis revealed that unigenes of B. hookeriana were closely related to those of N. nucifera. B. hookeriana, N. nucifera, and A. thaliana shared similar GO annotations but different distributions in KEGG pathways, indicating that B. hookeriana has adapted to dry-Mediterranean type shrublands via regulating expression of specific genes. In total 1927 potential simple sequence repeat (SSR) markers were discovered, which could be used in the genotype and genetic diversity studies of the Banksia genus. Our results provide valuable sequence resource for further study in Banksia.
山龙眼科植物是澳大利亚西南部植被的重要组成部分,该地区是具有全球意义的生物多样性热点地区。特别是,胡克山龙眼(Banksia hookeriana)是该地区具有重要经济和生态意义的物种。为了更好地进行保护和管理,我们利用RNA测序和从头组装技术报道了胡克山龙眼转录组的概况。我们从四株胡克山龙眼的四个叶片样本中总共获得了2.027亿条读数(189.1亿个核苷酸),并通过从头转录组组装获得了59,063个单基因(平均大小 = 1098bp)。其中,39,686个单基因在瑞士蛋白质数据库(Swiss-Prot)、直系同源簇数据库(COG)和NCBI非冗余蛋白质数据库(NR)中得到注释。我们发现转录组中每5.6 - 7.1kb大约有一个单核苷酸多态性(SNP),转换型与颠换型多态性的比例约为1.82。我们通过序列同源性、基因本体(GO)注释和KEGG通路分析,将胡克山龙眼的单基因与拟南芥和莲的单基因进行了比较。比较分析表明,胡克山龙眼的单基因与莲的单基因密切相关。胡克山龙眼、莲和拟南芥具有相似的GO注释,但在KEGG通路中的分布不同,这表明胡克山龙眼通过调节特定基因的表达适应了干燥的地中海型灌木丛。总共发现了1927个潜在的简单序列重复(SSR)标记,可用于山龙眼科属植物的基因型和遗传多样性研究。我们的结果为进一步研究山龙眼科植物提供了有价值的序列资源。