ICAR-National Bureau of Plant Genetic Resources, Pusa, New Delhi 110012, India.
The Centre for DNA Fingerprinting and Diagnostics, Hyderabad 500039, India.
Genes (Basel). 2022 Aug 12;13(8):1433. doi: 10.3390/genes13081433.
, commonly known as "Giloe" in India, is a shrub belonging to the family Menispermaceae. It is an important medicinal plant known for its antipyretic, anti-inflammatory, antispasmodic, and antidiabetic properties and is used in the treatment of jaundice, gout, and rheumatism. Despite its economic importance, the limited information related to its genomic resources prohibits its judicious exploitation through molecular breeding or biotechnological approaches. In this study, we generated a meta-transcriptome assembly of 43,090 non-redundant transcripts by merging the RNASeq data obtained from Roche 454 GS-FLX, and Illumina platforms, and report the first transcriptome-based database for simple sequence repeats and transcription factors ("TinoTranscriptDB" ( Transcriptome Database)). We annotated 26,716 (62%) of the total transcripts successfully from National Center for Biotechnology Information non-redundant protein (NCBI-NR), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Swiss-Prot, and Pfam databases. This database contains information of 2620 perfect simple sequence repeats (P-SSRs) with a relative abundance of 340.12 (loci/Mb), and relative density of 6309.29 (bp/Mb). Excluding mono-nucleotides, the most abundant SSR motifs were tri-nucleotides (54.31%), followed by di-nucleotides (37.51%), tetra-nucleotides (4.54%), penta-nucleotides (3.16%) and hexa-nucleotides (0.45%). Additionally, we also identified 4,311 transcription factors (TFs) and categorized them into 55 sub-families. This database is expected to fill the gap in genomic resource availability in and thus accelerate molecular breeding and related functional and other applied studies aimed towards genetic improvements of and related species.
在印度,俗称“吉尔”,是防己科灌木。它是一种重要的药用植物,具有解热、抗炎、抗痉挛和降血糖的特性,用于治疗黄疸、痛风和风湿。尽管它具有经济重要性,但与基因组资源相关的有限信息禁止通过分子育种或生物技术方法对其进行明智的开发。在这项研究中,我们通过合并 Roche 454 GS-FLX 和 Illumina 平台获得的 RNA-Seq 数据,生成了 43090 个非冗余转录本的元转录组组装,并报告了第一个基于转录组的简单重复序列和转录因子数据库(“TinoTranscriptDB”(转录组数据库))。我们成功地从国家生物技术信息中心非冗余蛋白质(NCBI-NR)、基因本体论(GO)、京都基因与基因组百科全书(KEGG)、Swiss-Prot 和 Pfam 数据库注释了 26716 个(62%)总转录本。该数据库包含 2620 个完美简单重复序列(P-SSRs)的信息,相对丰度为 340.12(基因座/Mb),相对密度为 6309.29(bp/Mb)。除单核苷酸外,最丰富的 SSR 基序是三核苷酸(54.31%),其次是二核苷酸(37.51%)、四核苷酸(4.54%)、五核苷酸(3.16%)和六核苷酸(0.45%)。此外,我们还鉴定了 4311 个转录因子(TF),并将它们分为 55 个子家族。这个数据库有望填补 在基因组资源可用性方面的空白,从而加速分子育种和相关功能以及其他针对 和相关物种遗传改良的应用研究。