• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于广义线性混合效应模型的 RNA-Seq 差异表达研究的功效分析。

Power analysis for RNA-Seq differential expression studies using generalized linear mixed effects models.

机构信息

Center for Biostatistics, Department of Biomedical Informatics, The Ohio State University, 1800 Cannon Dr., Columbus, 43210, OH, USA.

出版信息

BMC Bioinformatics. 2020 May 19;21(1):198. doi: 10.1186/s12859-020-3541-7.

DOI:10.1186/s12859-020-3541-7
PMID:32429934
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7236949/
Abstract

BACKGROUND

Power analysis becomes an inevitable step in experimental design of current biomedical research. Complex designs allowing diverse correlation structures are commonly used in RNA-Seq experiments. However, the field currently lacks statistical methods to calculate sample size and estimate power for RNA-Seq differential expression studies using such designs. To fill the gap, simulation based methods have a great advantage by providing numerical solutions, since theoretical distributions of test statistics are typically unavailable for such designs.

RESULTS

In this paper, we propose a novel simulation based procedure for power estimation of differential expression with the employment of generalized linear mixed effects models for correlated expression data. We also propose a new procedure for power estimation of differential expression with the use of a bivariate negative binomial distribution for paired designs. We compare the performance of both the likelihood ratio test and Wald test under a variety of simulation scenarios with the proposed procedures. The simulated distribution was used to estimate the null distribution of test statistics in order to achieve the desired false positive control and was compared to the asymptotic Chi-square distribution. In addition, we applied the procedure for paired designs to the TCGA breast cancer data set.

CONCLUSIONS

In summary, we provide a framework for power estimation of RNA-Seq differential expression under complex experimental designs. Simulation results demonstrate that both the proposed procedures properly control the false positive rate at the nominal level.

摘要

背景

在当前的生物医学研究实验设计中,功效分析成为一个必不可少的步骤。在 RNA-Seq 实验中,通常会使用允许各种相关结构的复杂设计。然而,目前该领域缺乏用于此类设计的 RNA-Seq 差异表达研究的统计方法来计算样本量和估计功效。为了弥补这一空白,基于模拟的方法具有很大的优势,因为对于这种设计,测试统计量的理论分布通常是不可用的,所以可以提供数值解。

结果

在本文中,我们提出了一种新的模拟方法,用于基于广义线性混合效应模型对相关表达数据进行差异表达功效估计。我们还提出了一种新的基于双变量负二项式分布的配对设计差异表达功效估计方法。我们比较了在不同模拟场景下,这两种方法(似然比检验和 Wald 检验)与所提出的方法的性能。模拟分布用于估计测试统计量的零分布,以实现所需的假阳性控制,并与渐近卡方分布进行了比较。此外,我们还将配对设计的方法应用于 TCGA 乳腺癌数据集。

结论

总之,我们提供了一种在复杂实验设计下进行 RNA-Seq 差异表达功效估计的框架。模拟结果表明,这两种方法都能在名义水平上正确控制假阳性率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/93b76cdb9ca3/12859_2020_3541_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/1fe34927df6e/12859_2020_3541_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/52e559a4a4ca/12859_2020_3541_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/d8aa3a8fbf26/12859_2020_3541_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/17aeebe8ab44/12859_2020_3541_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/816e0895b43c/12859_2020_3541_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/ecf3b80798c9/12859_2020_3541_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/93b76cdb9ca3/12859_2020_3541_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/1fe34927df6e/12859_2020_3541_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/52e559a4a4ca/12859_2020_3541_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/d8aa3a8fbf26/12859_2020_3541_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/17aeebe8ab44/12859_2020_3541_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/816e0895b43c/12859_2020_3541_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/ecf3b80798c9/12859_2020_3541_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1705/7236949/93b76cdb9ca3/12859_2020_3541_Fig7_HTML.jpg

相似文献

1
Power analysis for RNA-Seq differential expression studies using generalized linear mixed effects models.基于广义线性混合效应模型的 RNA-Seq 差异表达研究的功效分析。
BMC Bioinformatics. 2020 May 19;21(1):198. doi: 10.1186/s12859-020-3541-7.
2
Power analysis for RNA-Seq differential expression studies.RNA测序差异表达研究的功效分析。
BMC Bioinformatics. 2017 May 3;18(1):234. doi: 10.1186/s12859-017-1648-2.
3
Sample size calculations for the differential expression analysis of RNA-seq data using a negative binomial regression model.使用负二项回归模型对RNA测序数据进行差异表达分析的样本量计算。
Stat Appl Genet Mol Biol. 2019 Jan 22;18(1):/j/sagmb.2019.18.issue-1/sagmb-2018-0021/sagmb-2018-0021.xml. doi: 10.1515/sagmb-2018-0021.
4
Power analysis and sample size estimation for RNA-Seq differential expression.RNA测序差异表达的功效分析与样本量估计
RNA. 2014 Nov;20(11):1684-96. doi: 10.1261/rna.046011.114. Epub 2014 Sep 22.
5
Sample size calculation while controlling false discovery rate for differential expression analysis with RNA-sequencing experiments.在RNA测序实验的差异表达分析中控制错误发现率时的样本量计算。
BMC Bioinformatics. 2016 Mar 31;17:146. doi: 10.1186/s12859-016-0994-9.
6
An evaluation of RNA-seq differential analysis methods.RNA-seq 差异分析方法评估。
PLoS One. 2022 Sep 16;17(9):e0264246. doi: 10.1371/journal.pone.0264246. eCollection 2022.
7
Large scale maximum average power multiple inference on time-course count data with application to RNA-seq analysis.大规模最大平均功率多重推断在时间序列计数数据中的应用及在 RNA-seq 分析中的应用。
Biometrics. 2020 Mar;76(1):9-22. doi: 10.1111/biom.13144. Epub 2019 Nov 14.
8
Sample size calculation based on exact test for assessing differential expression analysis in RNA-seq data.基于确切检验的 RNA-seq 数据差异表达分析样本量计算。
BMC Bioinformatics. 2013 Dec 6;14:357. doi: 10.1186/1471-2105-14-357.
9
A simulation study for comparing testing statistics in response-adaptive randomization.一种用于比较响应自适应随机化中检验统计量的仿真研究。
BMC Med Res Methodol. 2010 Jun 5;10:48. doi: 10.1186/1471-2288-10-48.
10
MCMSeq: Bayesian hierarchical modeling of clustered and repeated measures RNA sequencing experiments.MCMSeq:用于聚类和重复测量 RNA 测序实验的贝叶斯层次模型。
BMC Bioinformatics. 2020 Aug 28;21(1):375. doi: 10.1186/s12859-020-03715-y.

引用本文的文献

1
Differentially Expressed Genes, miRNAs and Network Models: A Strategy to Shed Light on Molecular Interactions Driving HNSCC Tumorigenesis.差异表达基因、miRNA与网络模型:揭示驱动头颈部鳞状细胞癌发生的分子相互作用的策略
Cancers (Basel). 2023 Sep 4;15(17):4420. doi: 10.3390/cancers15174420.
2
Randomized quantile residuals for diagnosing zero-inflated generalized linear mixed models with applications to microbiome count data.随机分位数残差在诊断零膨胀广义线性混合模型中的应用——以微生物组计数数据为例。
BMC Bioinformatics. 2021 Nov 25;22(1):564. doi: 10.1186/s12859-021-04371-6.
3
Differential expression analysis using a model-based gene clustering algorithm for RNA-seq data.

本文引用的文献

1
BALLI: Bartlett-adjusted likelihood-based linear model approach for identifying differentially expressed genes with RNA-seq data.BALLI:基于 Bartlett 调整似然比的线性模型方法,用于鉴定 RNA-seq 数据中差异表达的基因。
BMC Genomics. 2019 Jul 2;20(1):540. doi: 10.1186/s12864-019-5851-6.
2
Epigenetic control of innate and adaptive immune memory.先天和适应性免疫记忆的表观遗传控制。
Nat Immunol. 2018 Sep;19(9):963-972. doi: 10.1038/s41590-018-0176-1. Epub 2018 Aug 6.
3
B cell activation and plasma cell differentiation are inhibited by de novo DNA methylation.
基于模型的基因聚类算法在 RNA-seq 数据中的差异表达分析。
BMC Bioinformatics. 2021 Oct 20;22(1):511. doi: 10.1186/s12859-021-04438-4.
4
Dream: powerful differential expression analysis for repeated measures designs.梦境:重复测量设计的强大差异表达分析。
Bioinformatics. 2021 Apr 19;37(2):192-201. doi: 10.1093/bioinformatics/btaa687.
B 细胞活化和浆细胞分化受从头 DNA 甲基化抑制。
Nat Commun. 2018 May 15;9(1):1900. doi: 10.1038/s41467-018-04234-4.
4
Cellular Differentiation of Human Monocytes Is Regulated by Time-Dependent Interleukin-4 Signaling and the Transcriptional Regulator NCOR2.人类单核细胞的细胞分化受时间依赖性白细胞介素-4信号和转录调节因子NCOR2调控。
Immunity. 2017 Dec 19;47(6):1051-1066.e12. doi: 10.1016/j.immuni.2017.11.024.
5
A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles.下一代连接图谱:L1000平台及首批100万个图谱
Cell. 2017 Nov 30;171(6):1437-1452.e17. doi: 10.1016/j.cell.2017.10.049.
6
Meta-signature of human endometrial receptivity: a meta-analysis and validation study of transcriptomic biomarkers.人类子宫内膜容受性的特征:转录组生物标志物的荟萃分析和验证研究。
Sci Rep. 2017 Aug 30;7(1):10077. doi: 10.1038/s41598-017-10098-3.
7
Power analysis for RNA-Seq differential expression studies.RNA测序差异表达研究的功效分析。
BMC Bioinformatics. 2017 May 3;18(1):234. doi: 10.1186/s12859-017-1648-2.
8
Differential expression analysis for RNAseq using Poisson mixed models.使用泊松混合模型对RNA测序数据进行差异表达分析。
Nucleic Acids Res. 2017 Jun 20;45(11):e106. doi: 10.1093/nar/gkx204.
9
Sample size calculation while controlling false discovery rate for differential expression analysis with RNA-sequencing experiments.在RNA测序实验的差异表达分析中控制错误发现率时的样本量计算。
BMC Bioinformatics. 2016 Mar 31;17:146. doi: 10.1186/s12859-016-0994-9.
10
What if we ignore the random effects when analyzing RNA-seq data in a multifactor experiment.在多因素实验中分析RNA测序数据时,如果我们忽略随机效应会怎样?
Stat Appl Genet Mol Biol. 2016 Apr;15(2):87-105. doi: 10.1515/sagmb-2015-0011.