从菜豆（Lens culinaris Medik.）转录组的短序列读取从头组装中开发表达基因目录和分子标记。

Development of an expressed gene catalogue and molecular markers from the de novo assembly of short sequence reads of the lentil (Lens culinaris Medik.) transcriptome.

机构信息

National Institute of Plant Genome Research, New Delhi, India.

出版信息

Plant Biotechnol J. 2013 Sep;11(7):894-905. doi: 10.1111/pbi.12082. Epub 2013 Jun 13.

DOI:10.1111/pbi.12082

Abstract

Genomic resources such as ESTs, molecular markers and linkage maps are essential for crop improvement. However, these resources are still limited in important legumes such as lentil (Lens culinaris Medik.), which is valued world wide as a rich source of dietary protein. In this study, the de novo transcriptome assembly of 119,855,798 short reads, generated by Illumina paired-end sequencing, was performed using various assembly programs. This resulted in 42,196 nonredundant high-quality transcripts of average length 810 bases, N50 value of 1,432 and an average expression per transcript of 26.21 rpkm reads per kilobase per million(RPKM). Similarity search with the unigenes and protein sequences of other plants resulted in maximum similarity with soybean. A total of 20,009 nonredundant transcripts showed similarity with the UniProtKB database and of these, 18,064 transcripts were grouped into three main GO categories, that is, biological process (15,126), molecular function (15,505) and cellular component (9,434). Annotated transcripts were mapped to 289 predicted Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways and 8,893 transcripts were classified into 24 functional categories based on Cluster of Orthologous Groups (COG) of proteins. Mining the data set for the presence of SSRs resulted in 8,722 SSRs with a frequency occurrence of one SSR per 3.92 kb. From these, 5,673 SSR primer pairs were designed, and a subset of these were utilized for diversity analysis. This study, which provides a large data set of annotated transcripts and gene-based SSR markers, would serve as a foundation for various applications in lentil breeding and genetics.

摘要

基因组资源，如 ESTs、分子标记和连锁图谱，对于作物改良至关重要。然而，这些资源在重要的豆类作物中仍然有限，例如小扁豆（Lens culinaris Medik.），它是世界范围内一种丰富的膳食蛋白质来源。在这项研究中，通过 Illumina 配对末端测序生成的 119,855,798 条短读序列进行了从头转录组组装，使用了各种组装程序。这导致了 42,196 个非冗余的高质量转录本，平均长度为 810 个碱基，N50 值为 1,432，每个转录本的平均表达量为 26.21 RPKM（每百万读取每千碱基的 RPKM）。与其他植物的 unigenes 和蛋白质序列的相似性搜索结果与大豆的相似度最高。总共 20,009 个非冗余转录本与 UniProtKB 数据库具有相似性，其中 18,064 个转录本分为三个主要的 GO 类别，即生物过程（15,126）、分子功能（15,505）和细胞组成（9,434）。注释转录本被映射到 289 个预测的京都基因与基因组百科全书（KEGG）途径，8,893 个转录本根据同源基因簇（COG）的蛋白质被分类为 24 个功能类别。对数据集中 SSRs 的存在进行挖掘，得到了 8,722 个 SSR，平均每 3.92 kb 出现一个 SSR。其中，设计了 5,673 对 SSR 引物对，并利用其中的一部分进行了多样性分析。这项研究提供了大量注释转录本和基于基因的 SSR 标记数据集，将成为小扁豆育种和遗传学中各种应用的基础。

相似文献

Development of an expressed gene catalogue and molecular markers from the de novo assembly of short sequence reads of the lentil (Lens culinaris Medik.) transcriptome.

Plant Biotechnol J. 2013 Sep;11(7):894-905. doi: 10.1111/pbi.12082. Epub 2013 Jun 13.

De novo assembly and characterization of bark transcriptome using Illumina sequencing and development of EST-SSR markers in rubber tree (Hevea brasiliensis Muell. Arg.).

BMC Genomics. 2012 May 18;13:192. doi: 10.1186/1471-2164-13-192.

Transcriptome sequencing of lentil based on second-generation technology permits large-scale unigene assembly and SSR marker discovery.

BMC Genomics. 2011 May 25;12:265. doi: 10.1186/1471-2164-12-265.

De novo sequencing analysis of the Rosa roxburghii fruit transcriptome reveals putative ascorbate biosynthetic genes and EST-SSR markers.

Gene. 2015 Apr 25;561(1):54-62. doi: 10.1016/j.gene.2015.02.054. Epub 2015 Feb 19.

De novo Assembly, Characterization of Immature Seed Transcriptome and Development of Genic-SSR Markers in Black Gram [Vigna mungo (L.) Hepper].

PLoS One. 2015 Jun 4;10(6):e0128748. doi: 10.1371/journal.pone.0128748. eCollection 2015.

Illumina-based de novo transcriptome sequencing and analysis of Amanita exitialis basidiocarps.

Gene. 2013 Dec 10;532(1):63-71. doi: 10.1016/j.gene.2013.09.014. Epub 2013 Sep 17.

De Novo Transcriptome Assembly of the Chinese Swamp Buffalo by RNA Sequencing and SSR Marker Discovery.

PLoS One. 2016 Jan 14;11(1):e0147132. doi: 10.1371/journal.pone.0147132. eCollection 2016.

De Novo Assembly and Annotation of the Chinese Chive (Allium tuberosum Rottler ex Spr.) Transcriptome Using the Illumina Platform.

PLoS One. 2015 Jul 23;10(7):e0133312. doi: 10.1371/journal.pone.0133312. eCollection 2015.

De novo assembly and characterization of root transcriptome using Illumina paired-end sequencing and development of cSSR markers in sweet potato (Ipomoea batatas).

BMC Genomics. 2010 Dec 24;11:726. doi: 10.1186/1471-2164-11-726.

Analysis of the Dendrobium officinale transcriptome reveals putative alkaloid biosynthetic genes and genetic markers.

Gene. 2013 Sep 15;527(1):131-8. doi: 10.1016/j.gene.2013.05.073. Epub 2013 Jun 10.

引用本文的文献

Modern Plant Breeding Techniques in Crop Improvement and Genetic Diversity: From Molecular Markers and Gene Editing to Artificial Intelligence-A Critical Review.

Plants (Basel). 2024 Sep 24;13(19):2676. doi: 10.3390/plants13192676.

Development and Validation of Gene-Based SSR Markers in the Genus .

Scientifica (Cairo). 2023 Oct 30;2023:6624354. doi: 10.1155/2023/6624354. eCollection 2023.

Next-Generation-Sequencing-Based Simple Sequence Repeat (SSR) Marker Development and Linkage Mapping in Lentil ( L.).

Life (Basel). 2023 Jul 18;13(7):1579. doi: 10.3390/life13071579.

Genome-wide discovery of di-nucleotide SSR markers based on whole genome re-sequencing data of Cicer arietinum L. and Cicer reticulatum Ladiz.

Sci Rep. 2023 Jun 26;13(1):10351. doi: 10.1038/s41598-023-37268-w.

The Prospects of gene introgression from crop wild relatives into cultivated lentil for climate change mitigation.

Front Plant Sci. 2023 Mar 10;14:1127239. doi: 10.3389/fpls.2023.1127239. eCollection 2023.

Omics Path to Increasing Productivity in Less-Studied Crops Under Changing Climate-Lentil a Case Study.

Front Plant Sci. 2022 May 9;13:813985. doi: 10.3389/fpls.2022.813985. eCollection 2022.

Insights into the Host-Pathogen Interaction Pathways through RNA-Seq Analysis of Medik. in Response to Infection.

Genes (Basel). 2021 Dec 29;13(1):90. doi: 10.3390/genes13010090.

Genomics Associated Interventions for Heat Stress Tolerance in Cool Season Adapted Grain Legumes.

Int J Mol Sci. 2021 Dec 30;23(1):399. doi: 10.3390/ijms23010399.

Identification of Pueraria spp. through DNA barcoding and comparative transcriptomics.

BMC Plant Biol. 2022 Jan 3;22(1):10. doi: 10.1186/s12870-021-03383-x.

Toward understanding of the methoxylated flavonoid biosynthesis pathway in Dracocephalum kotschyi Boiss.

Sci Rep. 2021 Oct 1;11(1):19549. doi: 10.1038/s41598-021-99066-6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从菜豆（Lens culinaris Medik.）转录组的短序列读取从头组装中开发表达基因目录和分子标记。

Development of an expressed gene catalogue and molecular markers from the de novo assembly of short sequence reads of the lentil (Lens culinaris Medik.) transcriptome.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献