用于 RNA-seq 实验的合成 Spike-in 标准品。

Synthetic spike-in standards for RNA-seq experiments.

机构信息

Section of Developmental Genomics, Laboratory of Cellular and Developmental Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892, USA.

出版信息

Genome Res. 2011 Sep;21(9):1543-51. doi: 10.1101/gr.121095.111. Epub 2011 Aug 4.

DOI:10.1101/gr.121095.111

PMID:21816910

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3166838/

Abstract

High-throughput sequencing of cDNA (RNA-seq) is a widely deployed transcriptome profiling and annotation technique, but questions about the performance of different protocols and platforms remain. We used a newly developed pool of 96 synthetic RNAs with various lengths, and GC content covering a 2(20) concentration range as spike-in controls to measure sensitivity, accuracy, and biases in RNA-seq experiments as well as to derive standard curves for quantifying the abundance of transcripts. We observed linearity between read density and RNA input over the entire detection range and excellent agreement between replicates, but we observed significantly larger imprecision than expected under pure Poisson sampling errors. We use the control RNAs to directly measure reproducible protocol-dependent biases due to GC content and transcript length as well as stereotypic heterogeneity in coverage across transcripts correlated with position relative to RNA termini and priming sequence bias. These effects lead to biased quantification for short transcripts and individual exons, which is a serious problem for measurements of isoform abundances, but that can partially be corrected using appropriate models of bias. By using the control RNAs, we derive limits for the discovery and detection of rare transcripts in RNA-seq experiments. By using data collected as part of the model organism and human Encyclopedia of DNA Elements projects (ENCODE and modENCODE), we demonstrate that external RNA controls are a useful resource for evaluating sensitivity and accuracy of RNA-seq experiments for transcriptome discovery and quantification. These quality metrics facilitate comparable analysis across different samples, protocols, and platforms.

摘要

cDNA（RNA-seq）高通量测序是一种广泛应用的转录组分析和注释技术，但不同方案和平台的性能问题仍然存在。我们使用了新开发的 96 种合成 RNA 池，其长度和 GC 含量涵盖了 2（20）浓度范围的 Spike-in 对照，以测量 RNA-seq 实验中的灵敏度、准确性和偏差，并为定量转录本丰度推导标准曲线。我们观察到在整个检测范围内，读取密度与 RNA 输入之间呈线性关系，并且重复之间具有极好的一致性，但我们观察到的不准确性明显大于纯泊松抽样误差所预期的不准确性。我们使用对照 RNA 直接测量由于 GC 含量和转录本长度以及跨转录本的覆盖范围与 RNA 末端和启动子序列偏置位置相关的典型异质性引起的可重复的、依赖于方案的偏差。这些效应导致对短转录本和个别外显子的定量偏倚，这对于测量同工型丰度是一个严重的问题，但可以使用适当的偏倚模型进行部分纠正。通过使用对照 RNA，我们确定了在 RNA-seq 实验中发现和检测稀有转录本的限制。通过使用作为模型生物和人类 DNA 元件百科全书（ENCODE 和 modENCODE）项目一部分收集的数据，我们证明外部 RNA 对照是评估用于转录组发现和定量的 RNA-seq 实验的灵敏度和准确性的有用资源。这些质量指标有助于在不同样本、方案和平台之间进行可比分析。

相似文献

Synthetic spike-in standards for RNA-seq experiments.

Genome Res. 2011 Sep;21(9):1543-51. doi: 10.1101/gr.121095.111. Epub 2011 Aug 4.

Evaluation of the External RNA Controls Consortium (ERCC) reference material using a modified Latin square design.

BMC Biotechnol. 2016 Jun 24;16(1):54. doi: 10.1186/s12896-016-0281-x.

Normalization of human RNA-seq experiments using chimpanzee RNA as a spike-in standard.

Sci Rep. 2016 Aug 24;6:31923. doi: 10.1038/srep31923.

Using Synthetic Mouse Spike-In Transcripts to Evaluate RNA-Seq Analysis Tools.

PLoS One. 2016 Apr 21;11(4):e0153782. doi: 10.1371/journal.pone.0153782. eCollection 2016.

mRNA enrichment protocols determine the quantification characteristics of external RNA spike-in controls in RNA-Seq studies.

Sci China Life Sci. 2013 Feb;56(2):134-42. doi: 10.1007/s11427-013-4437-9. Epub 2013 Feb 8.

RNA-seq Sample Preparation Kits Strongly Affect Transcriptome Profiles of a Gas-Fermenting Bacterium.

Microbiol Spectr. 2022 Aug 31;10(4):e0230322. doi: 10.1128/spectrum.02303-22. Epub 2022 Jul 27.

Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories.

Nat Biotechnol. 2013 Nov;31(11):1015-22. doi: 10.1038/nbt.2702. Epub 2013 Sep 15.

Quality control of RNA-seq experiments.

Methods Mol Biol. 2015;1269:137-46. doi: 10.1007/978-1-4939-2291-8_8.

Blind spots of quantitative RNA-seq: the limits for assessing abundance, differential expression, and isoform switching.

BMC Bioinformatics. 2013 Dec 24;14:370. doi: 10.1186/1471-2105-14-370.

Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules.

Sci Rep. 2019 Oct 17;9(1):14908. doi: 10.1038/s41598-019-51470-9.

引用本文的文献

The RNA degradation enzyme RNase E is essential for early flagellar assembly in .

PNAS Nexus. 2025 Aug 18;4(9):pgaf269. doi: 10.1093/pnasnexus/pgaf269. eCollection 2025 Sep.

Explicit Scale Simulation for analysis of RNA-sequencing count data with ALDEx2.

NAR Genom Bioinform. 2025 Aug 19;7(3):lqaf108. doi: 10.1093/nargab/lqaf108. eCollection 2025 Sep.

Singletrome enhances detection of long noncoding RNAs in single cell transcriptomes.

Sci Rep. 2025 Aug 12;15(1):29542. doi: 10.1038/s41598-025-13528-9.

The infection of mycovirus down regulates to weaken the pathogenicity of the f. sp. .

Front Plant Sci. 2025 Jul 22;16:1598183. doi: 10.3389/fpls.2025.1598183. eCollection 2025.

Integration of Bulk RNA-seq Pipeline Metrics for Assessing Low-Quality Samples.

Res Sq. 2025 Jul 3:rs.3.rs-6976695. doi: 10.21203/rs.3.rs-6976695/v1.

Development of EST-SSR markers based on transcriptome for genetic analysis in .

PeerJ. 2025 Jun 17;13:e19560. doi: 10.7717/peerj.19560. eCollection 2025.

Efficient profiling of total RNA in single cells with STORM-seq.

bioRxiv. 2025 May 20:2022.03.14.484332. doi: 10.1101/2022.03.14.484332.

Advances in ribosome profiling technologies.

Biochem Soc Trans. 2025 Jun 30;53(3):555-564. doi: 10.1042/BST20253061.

Revealing the Diversity of the Mycobiome in Different Phases of Ticks: ITS Gene-Based Analysis.

Transbound Emerg Dis. 2024 Jan 8;2024:8814592. doi: 10.1155/2024/8814592. eCollection 2024.

A systematic benchmark of Nanopore long-read RNA sequencing for transcript-level analysis in human cell lines.

Nat Methods. 2025 Apr;22(4):801-812. doi: 10.1038/s41592-025-02623-4. Epub 2025 Mar 13.

本文引用的文献

The developmental transcriptome of Drosophila melanogaster.

Nature. 2011 Mar 24;471(7339):473-9. doi: 10.1038/nature09715. Epub 2010 Dec 22.

Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project.

Science. 2010 Dec 24;330(6012):1775-87. doi: 10.1126/science.1196914. Epub 2010 Dec 22.

Identification of functional elements and regulatory circuits by Drosophila modENCODE.

Science. 2010 Dec 24;330(6012):1787-97. doi: 10.1126/science.1198374. Epub 2010 Dec 22.

Deep annotation of Drosophila melanogaster microRNAs yields insights into their processing, modification, and emergence.

Genome Res. 2011 Feb;21(2):203-15. doi: 10.1101/gr.116657.110. Epub 2010 Dec 22.

Isoform abundance inference provides a more accurate estimation of gene expression levels in RNA-seq.

J Bioinform Comput Biol. 2010 Dec;8 Suppl 1:177-92. doi: 10.1142/s0219720010005178.

Evaluation of external RNA controls for the standardisation of gene expression biomarker measurements.

BMC Genomics. 2010 Nov 24;11:662. doi: 10.1186/1471-2164-11-662.

Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays.

BMC Genomics. 2010 Jun 17;11:383. doi: 10.1186/1471-2164-11-383.

Most "dark matter" transcripts are associated with known genes.

PLoS Biol. 2010 May 18;8(5):e1000371. doi: 10.1371/journal.pbio.1000371.

Modeling non-uniformity in short-read rates in RNA-Seq data.

Genome Biol. 2010;11(5):R50. doi: 10.1186/gb-2010-11-5-r50. Epub 2010 May 11.

Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Nat Biotechnol. 2010 May;28(5):511-5. doi: 10.1038/nbt.1621. Epub 2010 May 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于 RNA-seq 实验的合成 Spike-in 标准品。

Synthetic spike-in standards for RNA-seq experiments.

机构信息

出版信息

Genome Res. 2011 Sep;21(9):1543-51. doi: 10.1101/gr.121095.111. Epub 2011 Aug 4.

DOI:10.1101/gr.121095.111

PMID:21816910

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3166838/

Abstract

摘要

用于 RNA-seq 实验的合成 Spike-in 标准品。

Synthetic spike-in standards for RNA-seq experiments.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于 RNA-seq 实验的合成 Spike-in 标准品。

Synthetic spike-in standards for RNA-seq experiments.

机构信息

出版信息