• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于在 RNA 测序实验中利用 RNA 样本池化来优化成本和统计功效的研究。

On the utility of RNA sample pooling to optimize cost and statistical power in RNA sequencing experiments.

机构信息

Department of Data Analysis and Mathematical Modeling, Ghent University, Ghent, 9000, Belgium.

Department of Biomolecular Medicine, Ghent University, Ghent, 9000, Belgium.

出版信息

BMC Genomics. 2020 Apr 19;21(1):312. doi: 10.1186/s12864-020-6721-y.

DOI:10.1186/s12864-020-6721-y
PMID:32306892
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7168886/
Abstract

BACKGROUND

In gene expression studies, RNA sample pooling is sometimes considered because of budget constraints or lack of sufficient input material. Using microarray technology, RNA sample pooling strategies have been reported to optimize both the cost of data generation as well as the statistical power for differential gene expression (DGE) analysis. For RNA sequencing, with its different quantitative output in terms of counts and tunable dynamic range, the adequacy and empirical validation of RNA sample pooling strategies have not yet been evaluated. In this study, we comprehensively assessed the utility of pooling strategies in RNA-seq experiments using empirical and simulated RNA-seq datasets.

RESULT

The data generating model in pooled experiments is defined mathematically to evaluate the mean and variability of gene expression estimates. The model is further used to examine the trade-off between the statistical power of testing for DGE and the data generating costs. Empirical assessment of pooling strategies is done through analysis of RNA-seq datasets under various pooling and non-pooling experimental settings. Simulation study is also used to rank experimental scenarios with respect to the rate of false and true discoveries in DGE analysis. The results demonstrate that pooling strategies in RNA-seq studies can be both cost-effective and powerful when the number of pools, pool size and sequencing depth are optimally defined.

CONCLUSION

For high within-group gene expression variability, small RNA sample pools are effective to reduce the variability and compensate for the loss of the number of replicates. Unlike the typical cost-saving strategies, such as reducing sequencing depth or number of RNA samples (replicates), an adequate pooling strategy is effective in maintaining the power of testing DGE for genes with low to medium abundance levels, along with a substantial reduction of the total cost of the experiment. In general, pooling RNA samples or pooling RNA samples in conjunction with moderate reduction of the sequencing depth can be good options to optimize the cost and maintain the power.

摘要

背景

在基因表达研究中,由于预算限制或缺乏足够的输入材料,有时会考虑对 RNA 样本进行汇集。使用微阵列技术,已经报道了 RNA 样本汇集策略,以优化数据生成的成本以及差异基因表达(DGE)分析的统计功效。对于 RNA 测序,由于其在计数方面具有不同的定量输出和可调节的动态范围,因此尚未评估 RNA 样本汇集策略的充分性和经验验证。在这项研究中,我们使用经验和模拟的 RNA-seq 数据集全面评估了汇集策略在 RNA-seq 实验中的效用。

结果

汇集实验中的数据生成模型是通过数学定义的,用于评估基因表达估计值的均值和变异性。该模型进一步用于检查 DGE 测试的统计功效与数据生成成本之间的权衡。通过在各种汇集和非汇集实验设置下分析 RNA-seq 数据集来进行汇集策略的经验评估。还使用模拟研究来根据 DGE 分析中假阳性和真阳性发现的比率对实验情况进行排名。结果表明,当最佳定义了池的数量、池大小和测序深度时,RNA-seq 研究中的汇集策略既具有成本效益又具有功效。

结论

对于高组内基因表达变异性,小的 RNA 样本池可有效降低变异性并弥补重复次数的减少。与典型的节省成本策略(例如降低测序深度或 RNA 样本数量(重复))不同,适当的汇集策略可有效地维持对低到中等丰度水平的基因进行 DGE 测试的功效,同时大大降低实验的总成本。一般而言,汇集 RNA 样本或与适度降低测序深度相结合,都是优化成本和维持功效的不错选择。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/db4233a3e22b/12864_2020_6721_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/276d24565de1/12864_2020_6721_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/04b97f56556a/12864_2020_6721_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/0a669a1082dc/12864_2020_6721_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/2abe2409466e/12864_2020_6721_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/fd62aae18987/12864_2020_6721_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/db4233a3e22b/12864_2020_6721_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/276d24565de1/12864_2020_6721_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/04b97f56556a/12864_2020_6721_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/0a669a1082dc/12864_2020_6721_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/2abe2409466e/12864_2020_6721_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/fd62aae18987/12864_2020_6721_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/7168886/db4233a3e22b/12864_2020_6721_Fig6_HTML.jpg

相似文献

1
On the utility of RNA sample pooling to optimize cost and statistical power in RNA sequencing experiments.关于在 RNA 测序实验中利用 RNA 样本池化来优化成本和统计功效的研究。
BMC Genomics. 2020 Apr 19;21(1):312. doi: 10.1186/s12864-020-6721-y.
2
Statistical implications of pooling RNA samples for microarray experiments.用于微阵列实验的RNA样本合并的统计学意义。
BMC Bioinformatics. 2003 Jun 24;4:26. doi: 10.1186/1471-2105-4-26.
3
Power analysis and sample size estimation for RNA-Seq differential expression.RNA测序差异表达的功效分析与样本量估计
RNA. 2014 Nov;20(11):1684-96. doi: 10.1261/rna.046011.114. Epub 2014 Sep 22.
4
Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing.使用 RNA 测序检测差异表达的高效实验设计和分析策略。
BMC Genomics. 2012 Sep 17;13:484. doi: 10.1186/1471-2164-13-484.
5
Experimental validation of methods for differential gene expression analysis and sample pooling in RNA-seq.RNA测序中差异基因表达分析和样本合并方法的实验验证
BMC Genomics. 2015 Jul 25;16(1):548. doi: 10.1186/s12864-015-1767-y.
6
Effects of pooling mRNA in microarray class comparisons.在微阵列类别比较中汇集mRNA的效果。
Bioinformatics. 2004 Dec 12;20(18):3318-25. doi: 10.1093/bioinformatics/bth391. Epub 2004 Jul 9.
7
LPEseq: Local-Pooled-Error Test for RNA Sequencing Experiments with a Small Number of Replicates.LPEseq:针对少量重复样本的RNA测序实验的局部合并误差检验
PLoS One. 2016 Aug 17;11(8):e0159182. doi: 10.1371/journal.pone.0159182. eCollection 2016.
8
Sample size calculations for the differential expression analysis of RNA-seq data using a negative binomial regression model.使用负二项回归模型对RNA测序数据进行差异表达分析的样本量计算。
Stat Appl Genet Mol Biol. 2019 Jan 22;18(1):/j/sagmb.2019.18.issue-1/sagmb-2018-0021/sagmb-2018-0021.xml. doi: 10.1515/sagmb-2018-0021.
9
3'Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling.3'Pool-seq:一种优化的、具有成本效益且可扩展的全转录组基因表达谱分析方法。
BMC Genomics. 2020 Jan 20;21(1):64. doi: 10.1186/s12864-020-6478-3.
10
Effect of pooling samples on the efficiency of comparative studies using microarrays.样本合并对使用微阵列的比较研究效率的影响。
Bioinformatics. 2005 Dec 15;21(24):4378-83. doi: 10.1093/bioinformatics/bti717. Epub 2005 Oct 18.

引用本文的文献

1
Adaptive immune changes in colorectal cancer: a focus on T and B cell activation genes.结直肠癌中的适应性免疫变化:聚焦于T和B细胞活化基因。
Discov Oncol. 2025 Jun 8;16(1):1032. doi: 10.1007/s12672-025-02794-8.
2
Transcriptome map and genome annotation of flax line 3896.亚麻品系3896的转录组图谱与基因组注释
Front Plant Sci. 2025 May 16;16:1520832. doi: 10.3389/fpls.2025.1520832. eCollection 2025.
3
N-methyladenosine RNA base modification regulates NKG2D-dependent and cytotoxic genes expression in natural killer cells.N-甲基腺苷RNA碱基修饰调节自然杀伤细胞中NKG2D依赖性和细胞毒性基因的表达。

本文引用的文献

1
SPsimSeq: semi-parametric simulation of bulk and single-cell RNA-sequencing data.SPsimSeq:批量和单细胞 RNA-seq 数据的半参数模拟。
Bioinformatics. 2020 May 1;36(10):3276-3278. doi: 10.1093/bioinformatics/btaa105.
2
Differential gene expression analysis tools exhibit substandard performance for long non-coding RNA-sequencing data.差异基因表达分析工具在长链非编码 RNA-seq 数据上的表现不佳。
Genome Biol. 2018 Jul 24;19(1):96. doi: 10.1186/s13059-018-1466-5.
3
MultiQC: summarize analysis results for multiple tools and samples in a single report.
BMC Med Genomics. 2025 May 19;18(1):91. doi: 10.1186/s12920-025-02147-y.
4
Social context prevents heat hormetic effects against mutagens during fish development.社会环境会阻碍鱼类发育过程中热应激对诱变剂的影响。
FEBS Lett. 2025 Apr 23;599(15):2107-28. doi: 10.1002/1873-3468.70047.
5
Experimental reporting of fish transcriptomic responses in environmental toxicology and ecotoxicology.环境毒理学和生态毒理学中鱼类转录组反应的实验报告
Environ Toxicol Chem. 2025 Sep 1;44(9):2580-2598. doi: 10.1093/etojnl/vgae077.
6
Infiltration-RNAseq Reveals Enhanced Defense Responses in Leaves Overexpressing the Banana Gene .浸润RNA测序揭示了过表达香蕉基因的叶片中增强的防御反应。
Plants (Basel). 2025 Feb 6;14(3):483. doi: 10.3390/plants14030483.
7
Thermal variation influences the transcriptome of the major malaria vector Anopheles stephensi.温度变化影响主要疟疾媒介斯氏按蚊的转录组。
Commun Biol. 2025 Jan 22;8(1):112. doi: 10.1038/s42003-025-07477-2.
8
Multi-omic insights into molecular mechanism and therapeutic targets in spinocerebellar ataxia type 7.对7型脊髓小脑共济失调分子机制和治疗靶点的多组学见解
Mol Ther Nucleic Acids. 2024 Dec 5;36(1):102414. doi: 10.1016/j.omtn.2024.102414. eCollection 2025 Mar 11.
9
Esterase-Mediated Pyrethroid Resistance in Populations of an Invasive Malaria Vector from Ethiopia.埃塞俄比亚一种侵入性疟疾媒介种群中酯酶介导的拟除虫菊酯抗性
Genes (Basel). 2024 Dec 15;15(12):1603. doi: 10.3390/genes15121603.
10
Aneuploidy-driven gene expression profiling in human blastocysts: insights from RNA-Seq analysis.人囊胚中由非整倍性驱动的基因表达谱分析:来自RNA测序分析的见解
J Assist Reprod Genet. 2025 Mar;42(3):885-896. doi: 10.1007/s10815-024-03369-2. Epub 2025 Jan 3.
MultiQC:在一份报告中汇总多个工具和样本的分析结果。
Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16.
4
How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?RNA测序实验需要多少生物学重复,以及应该使用哪种差异表达工具?
RNA. 2016 Jun;22(6):839-51. doi: 10.1261/rna.053959.115. Epub 2016 Mar 28.
5
Experimental validation of methods for differential gene expression analysis and sample pooling in RNA-seq.RNA测序中差异基因表达分析和样本合并方法的实验验证
BMC Genomics. 2015 Jul 25;16(1):548. doi: 10.1186/s12864-015-1767-y.
6
Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.用于临床终点预测的RNA测序和基于微阵列模型的比较。
Genome Biol. 2015 Jun 25;16(1):133. doi: 10.1186/s13059-015-0694-1.
7
limma powers differential expression analyses for RNA-sequencing and microarray studies.limma为RNA测序和微阵列研究提供差异表达分析的动力。
Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.
8
voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.voom:精确权重为RNA测序读数计数解锁线性模型分析工具。
Genome Biol. 2014 Feb 3;15(2):R29. doi: 10.1186/gb-2014-15-2-r29.
9
Sample size calculation for comparing two negative binomial rates.比较两个负二项分布率的样本量计算。
Stat Med. 2014 Feb 10;33(3):376-87. doi: 10.1002/sim.5947. Epub 2013 Aug 23.
10
Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data.RNA测序数据差异基因表达分析方法的综合评估
Genome Biol. 2013;14(9):R95. doi: 10.1186/gb-2013-14-9-r95.