• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用短读 RNA-Seq 数据优化从头构建的普通小麦转录组组装。

Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data.

机构信息

Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Zhongguancun, Beijing, People's Republic of China.

出版信息

BMC Genomics. 2012 Aug 14;13:392. doi: 10.1186/1471-2164-13-392.

DOI:10.1186/1471-2164-13-392
PMID:22891638
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3485621/
Abstract

BACKGROUND

Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq). The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model organisms and non-model organisms to identify and quantify RNA. For non-model organisms lacking well-defined genomes, de novo assembly is typically required for downstream RNA-Seq analyses, including SNP discovery and identification of genes differentially expressed by phenotypes. Although RNA-Seq has been successfully used to sequence many non-model organisms, the results of de novo assembly from short reads can still be improved by using recent bioinformatic developments.

RESULTS

In this study, we used 212.6 million pair-end reads, which accounted for 16.2 Gb, to assemble the hexaploid wheat transcriptome. Two state-of-the-art assemblers, Trinity and Trans-ABySS, which use the single and multiple k-mer methods, respectively, were used, and the whole de novo assembly process was divided into the following four steps: pre-assembly, merging different samples, removal of redundancy and scaffolding. We documented every detail of these steps and how these steps influenced assembly performance to gain insight into transcriptome assembly from short reads. After optimization, the assembled transcripts were comparable to Sanger-derived ESTs in terms of both continuity and accuracy. We also provided considerable new wheat transcript data to the community.

CONCLUSIONS

It is feasible to assemble the hexaploid wheat transcriptome from short reads. Special attention should be paid to dealing with multiple samples to balance the spectrum of expression levels and redundancy. To obtain an accurate overview of RNA profiling, removal of redundancy may be crucial in de novo assembly.

摘要

背景

下一代测序方法的快速发展为转录组测序(RNA-Seq)提供了新的机会。RNA-Seq 提供的前所未有的测序深度使其成为转录组研究的强大且经济高效的方法,已广泛应用于模式生物和非模式生物,用于鉴定和定量 RNA。对于缺乏明确基因组的非模式生物,通常需要进行从头组装,以便进行下游 RNA-Seq 分析,包括 SNP 发现和表型差异表达基因的鉴定。尽管 RNA-Seq 已成功用于许多非模式生物的测序,但通过使用最新的生物信息学进展,仍可以改善来自短读长的从头组装结果。

结果

本研究使用了 2.126 亿对末端读长,总计 16.2GB,组装了六倍体小麦转录组。使用了两种最先进的组装器,Trinity 和 Trans-ABySS,分别使用单和多 k-mer 方法,整个从头组装过程分为以下四个步骤:预组装、合并不同的样本、去除冗余和支架搭建。我们记录了这些步骤的每一个细节以及这些步骤如何影响组装性能,以深入了解来自短读长的转录组组装。经过优化,组装的转录本在连续性和准确性方面与 Sanger 衍生的 EST 相当。我们还为社区提供了大量新的小麦转录数据。

结论

从短读长组装六倍体小麦转录组是可行的。应特别注意处理多个样本,以平衡表达水平和冗余的分布。为了获得 RNA 谱的准确概述,在从头组装中去除冗余可能至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54e9/3485621/b089d2ea717f/1471-2164-13-392-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54e9/3485621/32e9a404df1a/1471-2164-13-392-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54e9/3485621/f5f5a8e472a9/1471-2164-13-392-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54e9/3485621/2873506d7a3c/1471-2164-13-392-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54e9/3485621/b089d2ea717f/1471-2164-13-392-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54e9/3485621/32e9a404df1a/1471-2164-13-392-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54e9/3485621/f5f5a8e472a9/1471-2164-13-392-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54e9/3485621/2873506d7a3c/1471-2164-13-392-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54e9/3485621/b089d2ea717f/1471-2164-13-392-4.jpg

相似文献

1
Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data.利用短读 RNA-Seq 数据优化从头构建的普通小麦转录组组装。
BMC Genomics. 2012 Aug 14;13:392. doi: 10.1186/1471-2164-13-392.
2
Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study.优化从头转录组组装从短读 RNA-Seq 数据:一项比较研究。
BMC Bioinformatics. 2011 Dec 14;12 Suppl 14(Suppl 14):S2. doi: 10.1186/1471-2105-12-S14-S2.
3
Optimizing de novo assembly of short-read RNA-seq data for phylogenomics.优化短读 RNA-seq 数据的从头组装用于系统发生基因组学。
BMC Genomics. 2013 May 14;14:328. doi: 10.1186/1471-2164-14-328.
4
De Novo Plant Transcriptome Assembly and Annotation Using Illumina RNA-Seq Reads.利用Illumina RNA测序读数进行从头植物转录组组装与注释
Methods Mol Biol. 2019;1933:265-275. doi: 10.1007/978-1-4939-9045-0_16.
5
Evaluation of assembly strategies using RNA-seq data associated with grain development of wheat (Triticum aestivum L.).利用与小麦(普通小麦)籽粒发育相关的RNA测序数据评估组装策略。
PLoS One. 2013 Dec 12;8(12):e83530. doi: 10.1371/journal.pone.0083530. eCollection 2013.
6
A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing.利用全长异构体测序和短读长测序的从头组装对高度多倍体甘蔗基因组的复杂转录组进行的一项调查。
BMC Genomics. 2017 May 22;18(1):395. doi: 10.1186/s12864-017-3757-8.
7
Comparative performance of transcriptome assembly methods for non-model organisms.非模式生物转录组组装方法的比较性能
BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.
8
Optimizing Hybrid de Novo Transcriptome Assembly and Extending Genomic Resources for Giant Freshwater Prawns (Macrobrachium rosenbergii): The Identification of Genes and Markers Associated with Reproduction.优化罗氏沼虾(Macrobrachium rosenbergii)的混合从头转录组组装并扩展基因组资源:与繁殖相关的基因和标记的鉴定
Int J Mol Sci. 2016 May 7;17(5):690. doi: 10.3390/ijms17050690.
9
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers.从头转录组组装:短读 RNA-Seq 组装器的全面跨物种比较。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz039.
10
Comprehensive evaluation of de novo transcriptome assembly programs and their effects on differential gene expression analysis.从头转录组组装程序的综合评估及其对差异基因表达分析的影响。
Bioinformatics. 2017 Feb 1;33(3):327-333. doi: 10.1093/bioinformatics/btw625.

引用本文的文献

1
RNA-Seq transcriptome profiling of immature grain wheat is a technique for understanding comparative modeling of baking quality.小麦幼胚 RNA-Seq 转录组分析是一种用于了解烘焙品质比较建模的技术。
Sci Rep. 2024 May 13;14(1):10940. doi: 10.1038/s41598-024-61528-y.
2
Elucidating the Mesocarp Drupe Transcriptome of Açai ( Mart.): An Amazonian Tree Palm Producer of Bioactive Compounds.阐明亚马逊树棕榈产生活性化合物的中果皮核果转录组。
Int J Mol Sci. 2023 May 26;24(11):9315. doi: 10.3390/ijms24119315.
3
QTL cluster analysis and marker development for kernel traits based on DArT markers in spring bread wheat ( L.).

本文引用的文献

1
De novo assembly and characterisation of the transcriptome during seed development, and generation of genic-SSR markers in peanut (Arachis hypogaea L.).从头组装和鉴定花生种子发育过程中的转录组,并生成花生基因 SSR 标记。
BMC Genomics. 2012 Mar 12;13:90. doi: 10.1186/1471-2164-13-90.
2
Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels.绿洲:跨越表达水平动态范围的稳健从头 RNA-seq 组装。
Bioinformatics. 2012 Apr 15;28(8):1086-92. doi: 10.1093/bioinformatics/bts094. Epub 2012 Feb 24.
3
Combining SNP discovery from next-generation sequencing data with bulked segregant analysis (BSA) to fine-map genes in polyploid wheat.
基于DArT标记的春小麦籽粒性状QTL聚类分析及标记开发
Front Plant Sci. 2023 Feb 10;14:1072233. doi: 10.3389/fpls.2023.1072233. eCollection 2023.
4
Transcriptome and Proteome Co-Profiling Offers an Understanding of Pre-Harvest Sprouting (PHS) Molecular Mechanisms in Wheat ().转录组和蛋白质组联合分析有助于理解小麦收获前发芽(PHS)的分子机制()。
Plants (Basel). 2022 Oct 22;11(21):2807. doi: 10.3390/plants11212807.
5
Conservation and Diversity in Gibberellin-Mediated Transcriptional Responses Among Host Plants Forming Distinct Arbuscular Mycorrhizal Morphotypes.形成不同丛枝菌根形态型的宿主植物中赤霉素介导的转录反应的保守性与多样性
Front Plant Sci. 2021 Dec 16;12:795695. doi: 10.3389/fpls.2021.795695. eCollection 2021.
6
Relationship between the Phenylpropanoid Pathway and Dwarfism of Based on RNA-Seq and iTRAQ.基于 RNA-Seq 和 iTRAQ 的苯丙烷代谢途径与矮化的关系。
Int J Mol Sci. 2021 Sep 3;22(17):9568. doi: 10.3390/ijms22179568.
7
Identification of Genes in Hexaploid Wheat ( L.) by RNA-Seq and Paralog Analyses.利用 RNA-Seq 和旁系同源分析鉴定六倍体小麦( L.)中的基因。
Int J Mol Sci. 2021 Aug 24;22(17):9146. doi: 10.3390/ijms22179146.
8
Development and application of the Faba_bean_130K targeted next-generation sequencing SNP genotyping platform based on transcriptome sequencing.基于转录组测序的菜豆 130K 靶向二代测序 SNP 基因分型平台的开发与应用。
Theor Appl Genet. 2021 Oct;134(10):3195-3207. doi: 10.1007/s00122-021-03885-0. Epub 2021 Jun 12.
9
De novo transcriptome analysis of Lantana camara L. revealed candidate genes involved in phenylpropanoid biosynthesis pathway.山黄麻从头转录组分析揭示了苯丙烷生物合成途径相关的候选基因。
Sci Rep. 2020 Aug 13;10(1):13726. doi: 10.1038/s41598-020-70635-5.
10
Interaction between serine carboxypeptidase-like protein TtGS5 and Annexin D1 in developing seeds of Triticum timopheevi.小麦冰草中丝氨酸羧肽酶样蛋白 TtGS5 与 Annexin D1 在发育种子中的相互作用。
J Appl Genet. 2020 May;61(2):151-162. doi: 10.1007/s13353-020-00539-7. Epub 2020 Jan 22.
利用下一代测序数据中的 SNP 发现与 bulked segregant analysis (BSA) 相结合,对多倍体小麦中的基因进行精细定位。
BMC Plant Biol. 2012 Jan 26;12:14. doi: 10.1186/1471-2229-12-14.
4
Transcriptomic analysis of Chinese bayberry (Myrica rubra) fruit development and ripening using RNA-Seq.基于 RNA-Seq 的杨梅(Myrica rubra)果实发育和成熟的转录组分析。
BMC Genomics. 2012 Jan 13;13:19. doi: 10.1186/1471-2164-13-19.
5
Transcriptome analysis of H2O2-treated wheat seedlings reveals a H2O2-responsive fatty acid desaturase gene participating in powdery mildew resistance.过氧化氢处理的小麦幼苗转录组分析揭示了一个过氧化氢响应的脂肪酸去饱和酶基因参与白粉病抗性。
PLoS One. 2011;6(12):e28810. doi: 10.1371/journal.pone.0028810. Epub 2011 Dec 12.
6
RNA-seq in grain unveils fate of neo- and paleopolyploidization events in bread wheat (Triticum aestivum L.).谷物 RNA 测序揭示了普通小麦(Triticum aestivum L.)中新和古多倍体化事件的命运。
Genome Biol. 2011 Dec 2;12(12):R119. doi: 10.1186/gb-2011-12-12-r119.
7
The Pfam protein families database.Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2012 Jan;40(Database issue):D290-301. doi: 10.1093/nar/gkr1065. Epub 2011 Nov 29.
8
Cell walls of developing wheat starchy endosperm: comparison of composition and RNA-Seq transcriptome.发育中小麦粉质胚乳细胞壁的组成和 RNA-Seq 转录组比较。
Plant Physiol. 2012 Feb;158(2):612-27. doi: 10.1104/pp.111.189191. Epub 2011 Nov 28.
9
Effect of the down-regulation of the high Grain Protein Content (GPC) genes on the wheat transcriptome during monocarpic senescence.下调高谷蛋白含量(GPC)基因对小麦单株衰老过程中转录组的影响。
BMC Genomics. 2011 Oct 7;12:492. doi: 10.1186/1471-2164-12-492.
10
De novo sequence assembly and characterization of the floral transcriptome in cross- and self-fertilizing plants.从头组装和鉴定杂交与自交植物的花转录组。
BMC Genomics. 2011 Jun 7;12:298. doi: 10.1186/1471-2164-12-298.