Liu Congcong, Dou Ying, Guan Xuelian, Fu Qiang, Zhang Ze, Hu Zenghui, Zheng Jian, Lu Yizeng, Li Wei
College of Landscape Architecture, Beijing University of Agriculture, Beijing, China.
Beijing Collaborative Innovation Center for Eco-environmental Improvement with Forestry and Fruit Trees, Beijing, China.
PLoS One. 2017 Jun 14;12(6):e0179219. doi: 10.1371/journal.pone.0179219. eCollection 2017.
Sorbus pohuashanensis is a native tree species of northern China that is used for a variety of ecological purposes. The species is often grown as an ornamental landscape tree because of its beautiful form, silver flowers in early summer, attractive pinnate leaves in summer, and red leaves and fruits in autumn. However, development and further utilization of the species are hindered by the lack of comprehensive genetic information, which impedes research into its genetics and molecular biology. Recent advances in de novo transcriptome sequencing (RNA-seq) technology have provided an effective means to obtain genomic information from non-model species. Here, we applied RNA-seq for sequencing S. pohuashanensis leaves and obtained a total of 137,506 clean reads. After assembly, 96,213 unigenes with an average length of 770 bp were obtained. We found that 64.5% of the unigenes could be annotated using bioinformatics tools to analyze gene function and alignment with the NCBI database. Overall, 59,089 unigenes were annotated using the Nr database(non-redundant protein database), 35,225 unigenes were annotated using the GO (Gene Ontology categories) database, and 33,168 unigenes were annotated using COG (Cluster of Orthologous Groups). Analysis of the unigenes using the KEGG (Kyoto Encyclopedia of Genes and Genomes) database indicated that 13,953 unigenes were involved in 322 metabolic pathways. Finally, simple sequence repeat (SSR) site detection identified 6,604 unigenes that included EST-SSRs and a total of 7,473 EST-SSRs in the unigene sequences. Fifteen polymorphic SSRs were screened and found to be of use for future genetic research. These unigene sequences will provide important genetic resources for genetic improvement and investigation of biochemical processes in S. pohuashanensis.
花楸是中国北方的本土树种,具有多种生态用途。由于其优美的形态、初夏的银色花朵、夏季迷人的羽状叶片以及秋季的红叶和红果,该树种常被用作观赏景观树。然而,由于缺乏全面的遗传信息,该树种的开发和进一步利用受到阻碍,这也妨碍了对其遗传学和分子生物学的研究。从头转录组测序(RNA-seq)技术的最新进展为从非模式物种中获取基因组信息提供了一种有效手段。在此,我们应用RNA-seq对花楸叶片进行测序,共获得137,506条clean reads。组装后,获得了96,213个单基因,平均长度为770 bp。我们发现,64.5%的单基因可以使用生物信息学工具进行注释,以分析基因功能并与NCBI数据库进行比对。总体而言,59,089个单基因使用Nr数据库(非冗余蛋白质数据库)进行注释,35,225个单基因使用GO(基因本体类别)数据库进行注释,33,168个单基因使用COG(直系同源簇)进行注释。使用KEGG(京都基因与基因组百科全书)数据库对单基因进行分析表明,13,953个单基因参与了322条代谢途径。最后,简单序列重复(SSR)位点检测鉴定出6,604个包含EST-SSR的单基因,单基因序列中共有7,473个EST-SSR。筛选出15个多态性SSR,发现可用于未来的遗传研究。这些单基因序列将为花楸的遗传改良和生化过程研究提供重要的遗传资源。