Li Yuli, Wang Xiliang, Chen Tingting, Yao Fuwen, Li Cuiping, Tang Qingli, Sun Min, Sun Gaoyuan, Hu Songnian, Yu Jun, Song Shuhui
CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
Core Genomic Facility, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.
PLoS One. 2015 May 4;10(5):e0125722. doi: 10.1371/journal.pone.0125722. eCollection 2015.
Cistanche deserticola is a completely non-photosynthetic parasitic plant with great medicinal value and mainly distributed in desert of Northwest China. Its dried fleshy stem is a crucial tonic in traditional Chinese medicine with roles of mainly improving male sexual function and strengthening immunity, but few mechanistic studies have been conducted partly due to the lack of genomic and transcriptomic resources.
In this study, we performed deep transcriptome sequencing in fleshy stem of C. deserticola, and about 80 million reads were generated using Illumina pair-end sequencing on HiSeq2000 platform. Using trinity assembler, we obtained 95,787 transcript sequences with transcript lengths ranging from 200 bp to 15,698 bp, having an average length of 950 bases and the N50 length of 1,519 bases. 63,957 transcripts were identified actively expressed with FPKM ≥ 0.5, in which 30,098 transcripts were annotated with gene descriptions or gene ontology terms by sequence similarity analyses against several public databases (Uniprot, NR and Nt at NCBI, and KEGG). Furthermore, we identified key enzyme genes involved in biosynthesis of lignin and phenylethanoid glycosides (PhGs) which are known to be the primary active ingredients. Four phenylalanine ammonia-lyase (PAL) genes, the first key enzyme in lignin and PhG biosynthesis, were identified based on sequences comparison and phylogenetic analysis. Two biosynthesis pathways of PhGs were also proposed for the first time.
In all, we completed a global analysis of the C. deserticola fleshy stem transcriptome using RNA-seq technology. A collection of enzyme genes related to biosynthesis of lignin and phenylethanoid glysides were identified from the assembled and annotated transcripts, and the gene family of PAL was also predicted. The sequence data from this study will provide a valuable resource for conducting future phenylethanoid glysides biosynthesis researches and functional genomic studies in this important medicinal plant.
肉苁蓉是一种完全非光合的寄生植物,具有很高的药用价值,主要分布在中国西北沙漠地区。其干燥的肉质茎是传统中药中的重要滋补品,主要具有改善男性性功能和增强免疫力的作用,但由于缺乏基因组和转录组资源,相关机制研究较少。
在本研究中,我们对肉苁蓉肉质茎进行了深度转录组测序,使用Illumina双末端测序在HiSeq2000平台上产生了约8000万个读段。使用Trinity组装器,我们获得了95787个转录本序列,转录本长度范围为200bp至15698bp,平均长度为950个碱基,N50长度为1519个碱基。通过FPKM≥0.5鉴定出63957个活跃表达的转录本,其中30098个转录本通过与几个公共数据库(Uniprot、NCBI的NR和Nt以及KEGG)进行序列相似性分析,用基因描述或基因本体术语进行了注释。此外,我们鉴定了参与木质素和苯乙醇苷(PhGs)生物合成的关键酶基因,已知它们是主要的活性成分。基于序列比较和系统发育分析,鉴定出四个苯丙氨酸解氨酶(PAL)基因,这是木质素和PhG生物合成中的第一个关键酶。还首次提出了PhGs的两条生物合成途径。
总之,我们使用RNA-seq技术完成了对肉苁蓉肉质茎转录组的全局分析。从组装和注释的转录本中鉴定出了与木质素和苯乙醇苷生物合成相关的一系列酶基因,并预测了PAL基因家族。本研究的序列数据将为今后对这种重要药用植物进行苯乙醇苷生物合成研究和功能基因组学研究提供有价值的资源。