Advanced Radiation Technology Institute, Korea Atomic Energy Research Institute, Jeongup, 56212, Korea.
Department of Horticulture, College of Industrial Sciences, Kongju National University, Yesan, Chungnam, 32439, Korea.
Sci Rep. 2021 Oct 26;11(1):21094. doi: 10.1038/s41598-021-00506-0.
Faba bean (Vicia faba L.), a globally important grain legume providing a stable source of dietary protein, was one of the earliest plant cytogenetic models. However, the lack of draft genome annotations and unclear structural information on mRNA transcripts have impeded its genetic improvement. To address this, we sequenced faba bean leaf transcriptome using the PacBio single-molecule long-read isoform sequencing platform. We identified 28,569 nonredundant unigenes, ranging from 108 to 9669 bp, with a total length of 94.5 Mb. Many unigenes (3597, 12.5%) had 2-20 isoforms, indicating a highly complex transcriptome. Approximately 96.5% of the unigenes matched sequences in public databases. The predicted proteins and transcription factors included NB-ARC, Myb_domain, C3H, bHLH, and heat shock proteins, implying that this genome has an abundance of stress resistance genes. To validate our results, we selected WCOR413-15785, DHN2-12403, DHN2-14197, DHN2-14797, COR15-14478, and HVA22-15 unigenes from the ICE-CBF-COR pathway to analyze their expression patterns in cold-treated samples via qRT-PCR. The expression of dehydrin-related genes was induced by cold stress. The assembled data provide the first insights into the deep sequencing of full-length RNA from faba bean at the single-molecule level. This study provides an important foundation to improve gene modeling and protein prediction.
蚕豆(Vicia faba L.)是一种全球重要的粮食豆类作物,为膳食蛋白质提供了稳定的来源,它曾是植物细胞遗传学的早期模式生物之一。然而,缺乏基因组草图注释和 mRNA 转录本的结构信息,阻碍了其遗传改良。为了解决这个问题,我们使用 PacBio 单分子长读长异构体测序平台对蚕豆叶片转录组进行了测序。我们鉴定了 28569 个非冗余的 unigenes,长度从 108bp 到 9669bp,总长 94.5Mb。许多 unigenes(3597 个,占 12.5%)具有 2-20 个异构体,表明转录组具有高度复杂性。大约 96.5%的 unigenes与公共数据库中的序列匹配。预测的蛋白质和转录因子包括 NB-ARC、Myb 结构域、C3H、bHLH 和热休克蛋白,这表明该基因组中含有丰富的抗逆基因。为了验证我们的结果,我们选择了 ICE-CBF-COR 途径中的 WCOR413-15785、DHN2-12403、DHN2-14197、DHN2-14797、COR15-14478 和 HVA22-15 这 6 个 unigenes,通过 qRT-PCR 分析它们在冷处理样品中的表达模式。脱水素相关基因的表达受冷胁迫诱导。组装的数据首次提供了蚕豆全长 RNA 单分子水平深度测序的深入见解。本研究为基因建模和蛋白质预测的改进提供了重要基础。