• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重复测量 RNA 测序实验中用于多次自由度检验方法的比较。

A comparison of methods for multiple degree of freedom testing in repeated measures RNA-sequencing experiments.

机构信息

Department of Biostatistics and Informatics, University of Colorado, Anschutz Medical Campus, Aurora, CO, USA.

Center for Genes, Environment and Health, National Jewish Health, 1400 Jackson St, Denver, 80206, CO, USA.

出版信息

BMC Med Res Methodol. 2022 May 28;22(1):153. doi: 10.1186/s12874-022-01615-8.

DOI:10.1186/s12874-022-01615-8
PMID:35643435
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9148455/
Abstract

BACKGROUND

As the cost of RNA-sequencing decreases, complex study designs, including paired, longitudinal, and other correlated designs, become increasingly feasible. These studies often include multiple hypotheses and thus multiple degree of freedom tests, or tests that evaluate multiple hypotheses jointly, are often useful for filtering the gene list to a set of interesting features for further exploration while controlling the false discovery rate. Though there are several methods which have been proposed for analyzing correlated RNA-sequencing data, there has been little research evaluating and comparing the performance of multiple degree of freedom tests across methods.

METHODS

We evaluated 11 different methods for modelling correlated RNA-sequencing data by performing a simulation study to compare the false discovery rate, power, and model convergence rate across several hypothesis tests and sample size scenarios. We also applied each method to a real longitudinal RNA-sequencing dataset.

RESULTS

Linear mixed modelling using transformed data had the best false discovery rate control while maintaining relatively high power. However, this method had high model non-convergence, particularly at small sample sizes. No method had high power at the lowest sample size. We found a mix of conservative and anti-conservative behavior across the other methods, which was influenced by the sample size and the hypothesis being evaluated. The patterns observed in the simulation study were largely replicated in the analysis of a longitudinal study including data from intensive care unit patients experiencing cardiogenic or septic shock.

CONCLUSIONS

Multiple degree of freedom testing is a valuable tool in longitudinal and other correlated RNA-sequencing experiments. Of the methods that we investigated, linear mixed modelling had the best overall combination of power and false discovery rate control. Other methods may also be appropriate in some scenarios.

摘要

背景

随着 RNA 测序成本的降低,包括配对、纵向和其他相关设计在内的复杂研究设计变得越来越可行。这些研究通常包含多个假设,因此多个自由度检验,或联合评估多个假设的检验,通常有助于将基因列表过滤到一组有趣的特征,以便进一步探索,同时控制假发现率。虽然已经提出了几种用于分析相关 RNA 测序数据的方法,但很少有研究评估和比较多种自由度检验在不同方法中的性能。

方法

我们通过进行模拟研究来评估 11 种不同的方法,以比较几种假设检验和样本量场景下的假发现率、功效和模型收敛率。我们还将每种方法应用于真实的纵向 RNA 测序数据集。

结果

使用转换后数据的线性混合建模具有最佳的假发现率控制,同时保持相对较高的功效。然而,该方法的模型非收敛性较高,特别是在样本量较小的情况下。在最小样本量下,没有一种方法具有高功效。我们发现,在其他方法中存在保守和反保守行为的混合,这受样本量和正在评估的假设的影响。在模拟研究中观察到的模式在对包括经历心源性或感染性休克的重症监护病房患者数据的纵向研究的分析中得到了很大程度的复制。

结论

在纵向和其他相关 RNA 测序实验中,多个自由度检验是一种很有价值的工具。在所研究的方法中,线性混合建模具有最佳的功效和假发现率控制的整体组合。其他方法在某些情况下也可能适用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ceff/9148455/7c6bcd318c14/12874_2022_1615_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ceff/9148455/095186f36de3/12874_2022_1615_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ceff/9148455/52f1d0aacdab/12874_2022_1615_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ceff/9148455/7c6bcd318c14/12874_2022_1615_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ceff/9148455/095186f36de3/12874_2022_1615_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ceff/9148455/52f1d0aacdab/12874_2022_1615_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ceff/9148455/7c6bcd318c14/12874_2022_1615_Fig3_HTML.jpg

相似文献

1
A comparison of methods for multiple degree of freedom testing in repeated measures RNA-sequencing experiments.重复测量 RNA 测序实验中用于多次自由度检验方法的比较。
BMC Med Res Methodol. 2022 May 28;22(1):153. doi: 10.1186/s12874-022-01615-8.
2
MCMSeq: Bayesian hierarchical modeling of clustered and repeated measures RNA sequencing experiments.MCMSeq:用于聚类和重复测量 RNA 测序实验的贝叶斯层次模型。
BMC Bioinformatics. 2020 Aug 28;21(1):375. doi: 10.1186/s12859-020-03715-y.
3
Sample size calculation while controlling false discovery rate for differential expression analysis with RNA-sequencing experiments.在RNA测序实验的差异表达分析中控制错误发现率时的样本量计算。
BMC Bioinformatics. 2016 Mar 31;17:146. doi: 10.1186/s12859-016-0994-9.
4
Impact of adaptive filtering on power and false discovery rate in RNA-seq experiments.自适应滤波对 RNA-seq 实验中功率和假发现率的影响。
BMC Bioinformatics. 2022 Sep 24;23(1):388. doi: 10.1186/s12859-022-04928-z.
5
An evaluation of RNA-seq differential analysis methods.RNA-seq 差异分析方法评估。
PLoS One. 2022 Sep 16;17(9):e0264246. doi: 10.1371/journal.pone.0264246. eCollection 2022.
6
Power analysis and sample size estimation for RNA-Seq differential expression.RNA测序差异表达的功效分析与样本量估计
RNA. 2014 Nov;20(11):1684-96. doi: 10.1261/rna.046011.114. Epub 2014 Sep 22.
7
LPEseq: Local-Pooled-Error Test for RNA Sequencing Experiments with a Small Number of Replicates.LPEseq:针对少量重复样本的RNA测序实验的局部合并误差检验
PLoS One. 2016 Aug 17;11(8):e0159182. doi: 10.1371/journal.pone.0159182. eCollection 2016.
8
PROPER: comprehensive power evaluation for differential expression using RNA-seq.正确的:使用RNA测序对差异表达进行综合能力评估。
Bioinformatics. 2015 Jan 15;31(2):233-41. doi: 10.1093/bioinformatics/btu640. Epub 2014 Oct 1.
9
On the utility of RNA sample pooling to optimize cost and statistical power in RNA sequencing experiments.关于在 RNA 测序实验中利用 RNA 样本池化来优化成本和统计功效的研究。
BMC Genomics. 2020 Apr 19;21(1):312. doi: 10.1186/s12864-020-6721-y.
10
False discovery rate control in two-stage designs.两阶段设计中的假发现率控制。
BMC Bioinformatics. 2012 May 6;13:81. doi: 10.1186/1471-2105-13-81.

引用本文的文献

1
lmerSeq: an R package for analyzing transformed RNA-Seq data with linear mixed effects models.lmerSeq:一个用于使用线性混合效应模型分析转换后的RNA测序数据的R包。
BMC Bioinformatics. 2022 Nov 16;23(1):489. doi: 10.1186/s12859-022-05019-9.

本文引用的文献

1
A practical solution to pseudoreplication bias in single-cell studies.单细胞研究中拟似重复偏倚的实用解决方案。
Nat Commun. 2021 Feb 2;12(1):738. doi: 10.1038/s41467-021-21038-1.
2
Negative Binomial mixed models estimated with the maximum likelihood method can be used for longitudinal RNAseq data.可以使用最大似然法估计负二项混合模型,用于纵向 RNAseq 数据。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa264.
3
MCMSeq: Bayesian hierarchical modeling of clustered and repeated measures RNA sequencing experiments.MCMSeq:用于聚类和重复测量 RNA 测序实验的贝叶斯层次模型。
BMC Bioinformatics. 2020 Aug 28;21(1):375. doi: 10.1186/s12859-020-03715-y.
4
rmRNAseq: differential expression analysis for repeated-measures RNA-seq data.rmRNAseq:重复测量 RNA-seq 数据的差异表达分析。
Bioinformatics. 2020 Aug 15;36(16):4432-4439. doi: 10.1093/bioinformatics/btaa525.
5
A longitudinal study highlights shared aspects of the transcriptomic response to cardiogenic and septic shock.一项纵向研究强调了心源性和感染性休克转录组反应的共同方面。
Crit Care. 2019 Dec 19;23(1):414. doi: 10.1186/s13054-019-2670-8.
6
PairedFB: a full hierarchical Bayesian model for paired RNA-seq data with heterogeneous treatment effects.配对 FB:一种具有异质处理效应的配对 RNA-seq 数据的全层次贝叶斯模型。
Bioinformatics. 2019 Mar 1;35(5):787-797. doi: 10.1093/bioinformatics/bty731.
7
Longitudinal transcriptomic characterization of the immune response to acute hepatitis C virus infection in patients with spontaneous viral clearance.自发性病毒清除的急性丙型肝炎病毒感染患者免疫反应的纵向转录组学特征。
PLoS Pathog. 2018 Sep 17;14(9):e1007290. doi: 10.1371/journal.ppat.1007290. eCollection 2018 Sep.
8
Negative Binomial Mixed Models for Analyzing Longitudinal Microbiome Data.用于分析纵向微生物组数据的负二项混合模型
Front Microbiol. 2018 Jul 26;9:1683. doi: 10.3389/fmicb.2018.01683. eCollection 2018.
9
A modular transcriptional signature identifies phenotypic heterogeneity of human tuberculosis infection.一种模块化转录特征可识别人类结核病感染的表型异质性。
Nat Commun. 2018 Jun 19;9(1):2308. doi: 10.1038/s41467-018-04579-w.
10
stageR: a general stage-wise method for controlling the gene-level false discovery rate in differential expression and differential transcript usage.stageR:一种用于控制差异表达和差异转录本使用中基因水平错误发现率的通用逐阶段方法。
Genome Biol. 2017 Aug 7;18(1):151. doi: 10.1186/s13059-017-1277-0.