橡胶树中实现最佳转录组覆盖所需的RNA测序读长深度

RNA sequencing read depth requirement for optimal transcriptome coverage in Hevea brasiliensis.

作者信息

Chow Keng-See, Ghazali Ahmad-Kamal, Hoh Chee-Choong, Mohd-Zainuddin Zainorlina

机构信息

Biotechnology Unit, Malaysian Rubber Board, Rubber Research Institute of Malaysia, Experiment Station, Kuala Lumpur 47000, Sungai Buloh, Selangor, Malaysia.

出版信息

BMC Res Notes. 2014 Feb 1;7:69. doi: 10.1186/1756-0500-7-69.

DOI:10.1186/1756-0500-7-69

PMID:24484543

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3926681/

Abstract

BACKGROUND

One of the concerns of assembling de novo transcriptomes is determining the amount of read sequences required to ensure a comprehensive coverage of genes expressed in a particular sample. In this report, we describe the use of Illumina paired-end RNA-Seq (PE RNA-Seq) reads from Hevea brasiliensis (rubber tree) bark to devise a transcript mapping approach for the estimation of the read amount needed for deep transcriptome coverage.

FINDINGS

We optimized the assembly of a Hevea bark transcriptome based on 16 Gb Illumina PE RNA-Seq reads using the Oases assembler across a range of k-mer sizes. We then assessed assembly quality based on transcript N50 length and transcript mapping statistics in relation to (a) known Hevea cDNAs with complete open reading frames, (b) a set of core eukaryotic genes and (c) Hevea genome scaffolds. This was followed by a systematic transcript mapping process where sub-assemblies from a series of incremental amounts of bark transcripts were aligned to transcripts from the entire bark transcriptome assembly. The exercise served to relate read amounts to the degree of transcript mapping level, the latter being an indicator of the coverage of gene transcripts expressed in the sample. As read amounts or datasize increased toward 16 Gb, the number of transcripts mapped to the entire bark assembly approached saturation. A colour matrix was subsequently generated to illustrate sequencing depth requirement in relation to the degree of coverage of total sample transcripts.

CONCLUSIONS

We devised a procedure, the "transcript mapping saturation test", to estimate the amount of RNA-Seq reads needed for deep coverage of transcriptomes. For Hevea de novo assembly, we propose generating between 5-8 Gb reads, whereby around 90% transcript coverage could be achieved with optimized k-mers and transcript N50 length. The principle behind this methodology may also be applied to other non-model plants, or with reads from other second generation sequencing platforms.

摘要

背景

从头组装转录组的一个问题是确定所需的读段序列数量，以确保全面覆盖特定样本中表达的基因。在本报告中，我们描述了如何使用来自巴西橡胶树树皮的Illumina双端RNA测序（PE RNA-Seq）读段，设计一种转录本映射方法，以估计深度转录组覆盖所需的读段数量。

研究结果

我们使用Oases组装器，在一系列k-mer大小的基础上，基于16 Gb的Illumina PE RNA-Seq读段，优化了橡胶树树皮转录组的组装。然后，我们根据转录本N50长度和转录本映射统计数据，评估组装质量，这些统计数据与（a）具有完整开放阅读框的已知橡胶树cDNA、（b）一组核心真核基因和（c）橡胶树基因组支架相关。接下来是一个系统的转录本映射过程，其中将一系列增量树皮转录本的子组装与整个树皮转录组组装的转录本进行比对。该实验用于将读段数量与转录本映射水平相关联，后者是样本中表达的基因转录本覆盖程度的指标。随着读段数量或数据量增加到16 Gb，映射到整个树皮组装的转录本数量接近饱和。随后生成了一个颜色矩阵，以说明与总样本转录本覆盖程度相关的测序深度要求。

结论

我们设计了一种程序，即“转录本映射饱和度测试”，以估计深度覆盖转录组所需的RNA-Seq读段数量。对于橡胶树的从头组装，我们建议生成5-8 Gb的读段，通过优化k-mer和转录本N50长度，可以实现约90%的转录本覆盖。该方法背后的原理也可应用于其他非模式植物，或来自其他第二代测序平台的读段。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e840/3926681/667d664b614d/1756-0500-7-69-1.jpg

相似文献

RNA sequencing read depth requirement for optimal transcriptome coverage in Hevea brasiliensis.

BMC Res Notes. 2014 Feb 1;7:69. doi: 10.1186/1756-0500-7-69.

De novo assembly and characterization of bark transcriptome using Illumina sequencing and development of EST-SSR markers in rubber tree (Hevea brasiliensis Muell. Arg.).

BMC Genomics. 2012 May 18;13:192. doi: 10.1186/1471-2164-13-192.

Molecular mechanism of ethylene stimulation of latex yield in rubber tree (Hevea brasiliensis) revealed by de novo sequencing and transcriptome analysis.

BMC Genomics. 2016 Mar 24;17:257. doi: 10.1186/s12864-016-2587-4.

RNA-Seq analysis and de novo transcriptome assembly of Hevea brasiliensis.

Plant Mol Biol. 2011 Oct;77(3):299-308. doi: 10.1007/s11103-011-9811-z. Epub 2011 Aug 3.

De novo assembly and transcriptome analysis of the rubber tree (Hevea brasiliensis) and SNP markers development for rubber biosynthesis pathways.

PLoS One. 2014 Jul 21;9(7):e102665. doi: 10.1371/journal.pone.0102665. eCollection 2014.

Identification of differentially expressed genes and signalling pathways in bark of Hevea brasiliensis seedlings associated with secondary laticifer differentiation using gene expression microarray.

Plant Physiol Biochem. 2016 Oct;107:45-55. doi: 10.1016/j.plaphy.2016.05.011. Epub 2016 May 15.

Transcriptome sequencing and analysis of rubber tree (Hevea brasiliensis Muell.) to discover putative genes associated with tapping panel dryness (TPD).

BMC Genomics. 2015 May 21;16(1):398. doi: 10.1186/s12864-015-1562-9.

Construction of Pará rubber tree genome and multi-transcriptome database accelerates rubber researches.

BMC Genomics. 2018 Jan 19;19(Suppl 1):922. doi: 10.1186/s12864-017-4333-y.

Assembly and Analysis of Differential Transcriptome Responses of Hevea brasiliensis on Interaction with Microcyclus ulei.

PLoS One. 2015 Aug 19;10(8):e0134837. doi: 10.1371/journal.pone.0134837. eCollection 2015.

De novo Transcriptome Analysis Reveals Distinct Defense Mechanisms by Young and Mature Leaves of Hevea brasiliensis (Para Rubber Tree).

Sci Rep. 2016 Sep 13;6:33151. doi: 10.1038/srep33151.

引用本文的文献

seekCRIT: Detecting and characterizing differentially expressed circular RNAs using high-throughput sequencing data.

PLoS Comput Biol. 2020 Oct 20;16(10):e1008338. doi: 10.1371/journal.pcbi.1008338. eCollection 2020 Oct.

Optimal sequencing depth design for whole genome re-sequencing in pigs.

BMC Bioinformatics. 2019 Nov 8;20(1):556. doi: 10.1186/s12859-019-3164-z.

Identification of Immediate Early Genes in the Nervous System of Snail .

eNeuro. 2019 May 21;6(3). doi: 10.1523/ENEURO.0416-18.2019. Print 2019 May/Jun.

Comprehensive evaluation of RNA-seq analysis pipelines in diploid and polyploid species.

Gigascience. 2018 Dec 1;7(12):giy132. doi: 10.1093/gigascience/giy132.

Comparative transcriptome analysis reveals phytohormone signalings, heat shock module and ROS scavenger mediate the cold-tolerance of rubber tree.

Sci Rep. 2018 Mar 21;8(1):4931. doi: 10.1038/s41598-018-23094-y.

Construction of Pará rubber tree genome and multi-transcriptome database accelerates rubber researches.

BMC Genomics. 2018 Jan 19;19(Suppl 1):922. doi: 10.1186/s12864-017-4333-y.

The SWEET gene family in - its evolution and expression compared with four other plant species.

FEBS Open Bio. 2017 Oct 30;7(12):1943-1959. doi: 10.1002/2211-5463.12332. eCollection 2017 Dec.

Papain-like cysteine protease encoding genes in rubber (Hevea brasiliensis): comparative genomics, phylogenetic, and transcriptional profiling analysis.

Planta. 2017 Nov;246(5):999-1018. doi: 10.1007/s00425-017-2739-z. Epub 2017 Jul 27.

Sequencing and Transcriptome Analysis Reveal Key Genes Regulating Steroid Metabolism in Leaves, Roots, Adventitious Roots and Calli of Bunge.

Front Plant Sci. 2017 Apr 21;8:594. doi: 10.3389/fpls.2017.00594. eCollection 2017.

Survey of the rubber tree genome reveals a high number of cysteine protease-encoding genes homologous to Arabidopsis SAG12.

PLoS One. 2017 Feb 6;12(2):e0171725. doi: 10.1371/journal.pone.0171725. eCollection 2017.

本文引用的文献

Improved RNA Extraction from Woody Plants for the Detection of Viral Pathogens by Reverse Transcription-Polymerase Chain Reaction.

Plant Dis. 1997 Feb;81(2):222-226. doi: 10.1094/PDIS.1997.81.2.222.

Transcriptome analysis of Japanese pear (Pyrus pyrifolia Nakai) flower buds transitioning through endodormancy.

Plant Cell Physiol. 2013 Jul;54(7):1132-51. doi: 10.1093/pcp/pct067. Epub 2013 Apr 26.

Transcriptome analysis of Cymbidium sinense and its application to the identification of genes associated with floral development.

BMC Genomics. 2013 Apr 24;14:279. doi: 10.1186/1471-2164-14-279.

Glycyrrhiza uralensis transcriptome landscape and study of phytochemicals.

Plant Cell Physiol. 2013 May;54(5):697-710. doi: 10.1093/pcp/pct057. Epub 2013 Apr 14.

CathaCyc, a metabolic pathway database built from Catharanthus roseus RNA-Seq data.

Plant Cell Physiol. 2013 May;54(5):673-85. doi: 10.1093/pcp/pct039. Epub 2013 Mar 14.

RNA-Seq: revelation of the messengers.

Trends Plant Sci. 2013 Apr;18(4):175-9. doi: 10.1016/j.tplants.2013.02.001. Epub 2013 Mar 5.

Draft genome sequence of the rubber tree Hevea brasiliensis.

BMC Genomics. 2013 Feb 2;14:75. doi: 10.1186/1471-2164-14-75.

Identification of the Hevea brasiliensis AP2/ERF superfamily by RNA sequencing.

BMC Genomics. 2013 Jan 16;14:30. doi: 10.1186/1471-2164-14-30.

Transcriptome analysis of mature fruit abscission control in olive.

Plant Cell Physiol. 2013 Feb;54(2):244-69. doi: 10.1093/pcp/pcs179. Epub 2013 Jan 3.

Some ethylene biosynthesis and AP2/ERF genes reveal a specific pattern of expression during somatic embryogenesis in Hevea brasiliensis.

BMC Plant Biol. 2012 Dec 26;12:244. doi: 10.1186/1471-2229-12-244.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

橡胶树中实现最佳转录组覆盖所需的RNA测序读长深度

RNA sequencing read depth requirement for optimal transcriptome coverage in Hevea brasiliensis.

作者信息

机构信息

出版信息

BACKGROUND

FINDINGS

CONCLUSIONS

背景

研究结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献