Bose Mazumdar Aparupa, Chattopadhyay Sharmila
Organic and Medicinal Chemistry Division, Plant Biology Laboratory, Council for Scientific and Industrial Research-Indian Institute of Chemical Biology Kolkata, India.
Front Plant Sci. 2016 Jan 28;6:1199. doi: 10.3389/fpls.2015.01199. eCollection 2015.
Phyllanthus amarus Schum. and Thonn., a widely distributed annual medicinal herb has a long history of use in the traditional system of medicine for over 2000 years. However, the lack of genomic data for P. amarus, a non-model organism hinders research at the molecular level. In the present study, high-throughput sequencing technology has been employed to enhance better understanding of this herb and provide comprehensive genomic information for future work. Here P. amarus leaf transcriptome was sequenced using the Illumina Miseq platform. We assembled 85,927 non-redundant (nr) "unitranscript" sequences with an average length of 1548 bp, from 18,060,997 raw reads. Sequence similarity analyses and annotation of these unitranscripts were performed against databases like green plants nr protein database, Gene Ontology (GO), Clusters of Orthologous Groups (COG), PlnTFDB, KEGG databases. As a result, 69,394 GO terms, 583 enzyme codes (EC), 134 KEGG maps, and 59 Transcription Factor (TF) families were generated. Functional and comparative analyses of assembled unitranscripts were also performed with the most closely related species like Populus trichocarpa and Ricinus communis using TRAPID. KEGG analysis showed that a number of assembled unitranscripts were involved in secondary metabolites, mainly phenylpropanoid, flavonoid, terpenoids, alkaloids, and lignan biosynthetic pathways that have significant medicinal attributes. Further, Fragments Per Kilobase of transcript per Million mapped reads (FPKM) values of the identified secondary metabolite pathway genes were determined and Reverse Transcription PCR (RT-PCR) of a few of these genes were performed to validate the de novo assembled leaf transcriptome dataset. In addition 65,273 simple sequence repeats (SSRs) were also identified. To the best of our knowledge, this is the first transcriptomic dataset of P. amarus till date. Our study provides the largest genetic resource that will lead to drug development and pave the way in deciphering various secondary metabolite biosynthetic pathways in P. amarus, especially those conferring the medicinal attributes of this potent herb.
余甘子(Phyllanthus amarus Schum. and Thonn.)是一种广泛分布的一年生药用草本植物,在传统医学体系中有超过2000年的使用历史。然而,作为一种非模式生物,余甘子缺乏基因组数据,这阻碍了分子水平的研究。在本研究中,采用了高通量测序技术来更好地了解这种草本植物,并为未来的研究提供全面的基因组信息。在这里,使用Illumina Miseq平台对余甘子叶片转录组进行了测序。我们从18,060,997条原始 reads 中组装了85,927条非冗余(nr)“单转录本”序列,平均长度为1548 bp。对这些单转录本进行了序列相似性分析和注释,比对的数据库包括绿色植物nr蛋白质数据库、基因本体(GO)、直系同源簇(COG)、植物转录因子数据库(PlnTFDB)、KEGG数据库。结果,生成了69,394个GO术语、583个酶代码(EC)、134个KEGG图谱和59个转录因子(TF)家族。还使用TRAPID对组装的单转录本与最密切相关的物种如毛果杨(Populus trichocarpa)和蓖麻(Ricinus communis)进行了功能和比较分析。KEGG分析表明,许多组装的单转录本参与了次生代谢物的合成,主要是苯丙烷类、黄酮类、萜类、生物碱和木脂素生物合成途径,这些途径具有重要的药用特性。此外,确定了已鉴定的次生代谢物途径基因的每百万映射 reads 中每千碱基转录本的片段数(FPKM)值,并对其中一些基因进行了逆转录聚合酶链反应(RT-PCR),以验证从头组装的叶片转录组数据集。此外,还鉴定了65,273个简单序列重复(SSR)。据我们所知,这是迄今为止余甘子的第一个转录组数据集。我们的研究提供了最大的遗传资源,将有助于药物开发,并为解读余甘子中各种次生代谢物生物合成途径,特别是那些赋予这种强效草本植物药用特性的途径铺平道路。