Suppr超能文献

基于 RNA 测序的天麻杂种幼年期转录组从头组装与注释:SSR 标记的检测。

De Novo Assembly and Annotation of the Juvenile Tuber Transcriptome of a Gastrodia elata Hybrid by RNA Sequencing: Detection of SSR Markers.

机构信息

College of Life and Health Science, Kaili University, Kaili City, 556011, Guizhou Province, People's Republic of China.

State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, South China Agricultural University, Guangzhou, 510642, China.

出版信息

Biochem Genet. 2020 Dec;58(6):914-934. doi: 10.1007/s10528-020-09983-w. Epub 2020 Jul 6.

Abstract

Gastrodia elata is a traditional Chinese herbal medicine with good therapeutic effect on various nervous and cerebrovascular diseases. In the present study, we generated 20,611,556 raw reads from the young tuber transcriptome of a G. elata hybrid (Gastrodia elata BI.f.elata × Gastrodia elata BI.f.pilifera) by using Illumina HiSeq™ 4000 sequencing platform. De novo assembly and bioinformatics analysis revealed 20,237,474 clean reads, including 2,529,684,250 bp that assembled into 34,323 unigenes with an average length of 695.19 bp. Among them, 24,698 (71.96%) unigenes were annotated by at least one of the Nr, Swiss-Prot, COG and KEGG databases. A total of 4236 (12.34%) unigenes were identified as candidate transcription factors, and 2007 (5.85%) unigenes were found to contain at least one single sequence repeat (SSR). Of these SSRs, AG/CT repeat motif was the most frequent, with a total of 498 (21.67%). This study will enhance our understanding about the molecular mechanism of physiological metabolism, growth and development of G. elata, particularly abundant SSR markers will offer plenty of alternative tools for further studies about molecular genetics, molecular breeding and association analysis.

摘要

天麻是一种传统的中草药,对各种神经和脑血管疾病有良好的治疗效果。本研究采用 Illumina HiSeq™ 4000 测序平台,从天麻杂种(天麻 BI.f.elata × 天麻 BI.f.pilifera)的幼茎转录组中获得了 20611556 条原始reads。de novo 组装和生物信息学分析显示,20239474 条清洁reads ,总长度为 2529684250bp,组装成 34323 条 unigenes,平均长度为 695.19bp。其中,24698 条(71.96%)unigenes至少被 Nr、Swiss-Prot、COG 和 KEGG 数据库之一注释。共鉴定出 4236 个(12.34%)unigenes为候选转录因子,2007 个(5.85%)unigenes含有至少一个单核苷酸重复(SSR)。在这些 SSR 中,AG/CT 重复基序是最常见的,共有 498 个(21.67%)。本研究将增进我们对天麻生理代谢、生长发育分子机制的理解,丰富的 SSR 标记将为进一步的分子遗传学、分子育种和关联分析研究提供大量的替代工具。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验