Backiyarani Suthanthiram, Chandrasekar Arumugam, Uma Subbaraya, Saraswathi Marimuthu Somasundaram
ICAR-National Research Centre for Banana, Thogamalai Road, Thayanur Post, Tiruchirapalli 620 102, Tamil Nadu, India.
J Biosci. 2019 Mar;44(1).
Availability of transcriptome datasets for use in accelerated molecular-based breeding in species is limited. Illumina Hiseq technology was employed to determine differential gene expression between the contrasting cultivars for three different stresses ( leaf spot -, root lesion nematode - and moisture deficit stress) under challenged and unchallenged conditions. An average of 34.72 million of reads was assembled into ~47629 contigs, and ~5,466 simple sequence repeats (SSR) from each library were identified. GO annotation and KEGG pathway analysis were carried for all the transcripts and the SSR, SNPs were also detected. Based on this information, a MusatransSSRDB has been developed. Currently, the database consists of 32,800 SSRs with the unique information like putative function of the SSR-containing genes and their metabolic pathway and expression profiling under various stress conditions. This database provides information on polymorphic SSRs (2830 SSRs) between the contrasting cultivars for each stress and within stress. Information on polymorphic SSRs specific to differentially expressed genes under challenged condition for each stress can also be accessed. This database facilitates the retrieval of results by navigating the tabs for cultivars, stress and polymorphism. This database was developed using HTML, Java and PHP; datasets are stored in MySQL database and accessible in the public domain . This unique information facilitates the banana breeder to select the SSR primers based on specific objectives. MusatransSSRDB along with other genomics databases will facilitate the genetic dissection and breeding for complex traits in banana. Thus, this database is a step forward in economizing cost, time, manpower and other resources. Keywords.
用于加速物种分子育种的转录组数据集有限。采用Illumina Hiseq技术来确定在受胁迫和未受胁迫条件下,三个不同胁迫(叶斑病、根结线虫和水分亏缺胁迫)的对比品种之间的差异基因表达。平均3472万个读段被组装成约47629个重叠群,并且从每个文库中鉴定出约5466个简单序列重复(SSR)。对所有转录本和SSR进行了基因本体(GO)注释和京都基因与基因组百科全书(KEGG)通路分析,还检测到单核苷酸多态性(SNP)。基于这些信息,开发了一个香蕉转录组SSR数据库(MusatransSSRDB)。目前,该数据库包含32800个SSR,具有诸如含SSR基因的推定功能及其代谢途径以及各种胁迫条件下的表达谱等独特信息。该数据库提供了每个胁迫下以及胁迫内对比品种之间的多态性SSR(2830个SSR)的信息。还可以获取每个胁迫下受胁迫条件下差异表达基因特有的多态性SSR信息。该数据库通过浏览品种、胁迫和多态性标签来方便检索结果。该数据库使用HTML、Java和PHP开发;数据集存储在MySQL数据库中,可在公共领域访问。这些独特信息有助于香蕉育种者根据特定目标选择SSR引物。MusatransSSRDB与其他基因组数据库将有助于香蕉复杂性状的遗传解析和育种。因此,该数据库在节省成本、时间、人力和其他资源方面向前迈进了一步。关键词。