• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

混合群体 RNA-Seq 的 ANOVA 样差异表达 (ALDEx) 分析。

ANOVA-like differential expression (ALDEx) analysis for mixed population RNA-Seq.

机构信息

YouKaryote Genomics, London, Ontario, Canada.

出版信息

PLoS One. 2013 Jul 2;8(7):e67019. doi: 10.1371/journal.pone.0067019. Print 2013.

DOI:10.1371/journal.pone.0067019
PMID:23843979
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3699591/
Abstract

Experimental variance is a major challenge when dealing with high-throughput sequencing data. This variance has several sources: sampling replication, technical replication, variability within biological conditions, and variability between biological conditions. The high per-sample cost of RNA-Seq often precludes the large number of experiments needed to partition observed variance into these categories as per standard ANOVA models. We show that the partitioning of within-condition to between-condition variation cannot reasonably be ignored, whether in single-organism RNA-Seq or in Meta-RNA-Seq experiments, and further find that commonly-used RNA-Seq analysis tools, as described in the literature, do not enforce the constraint that the sum of relative expression levels must be one, and thus report expression levels that are systematically distorted. These two factors lead to misleading inferences if not properly accommodated. As it is usually only the biological between-condition and within-condition differences that are of interest, we developed ALDEx, an ANOVA-like differential expression procedure, to identify genes with greater between- to within-condition differences. We show that the presence of differential expression and the magnitude of these comparative differences can be reasonably estimated with even very small sample sizes.

摘要

当处理高通量测序数据时,实验方差是一个主要的挑战。这种方差有几个来源:采样复制、技术复制、生物条件内的可变性以及生物条件之间的可变性。RNA-Seq 的每个样本成本很高,通常无法按照标准的 ANOVA 模型将观察到的方差分成这些类别。我们表明,无论在单个生物体的 RNA-Seq 还是在 Meta-RNA-Seq 实验中,都不能合理地忽略条件内的方差分配到条件之间的变化,进一步发现文献中描述的常用 RNA-Seq 分析工具没有强制实施相对表达水平之和必须为一的约束,因此报告的表达水平会受到系统扭曲。如果不妥善处理这两个因素,就会导致误导性的推断。由于通常只有生物学上的条件之间和条件内的差异是感兴趣的,因此我们开发了 ALDEx,一种类似于 ANOVA 的差异表达程序,用于识别具有更大条件之间与条件内差异的基因。我们表明,即使在非常小的样本量下,也可以合理地估计差异表达的存在及其比较差异的幅度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/8264b2bb4895/pone.0067019.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/c7095d8bdadf/pone.0067019.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/8d33d94f9cca/pone.0067019.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/5ed8df7380d1/pone.0067019.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/baa616308b5f/pone.0067019.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/c4bf1548b4da/pone.0067019.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/72e2e11381a3/pone.0067019.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/2f563ab5c706/pone.0067019.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/c9b3832852bf/pone.0067019.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/8264b2bb4895/pone.0067019.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/c7095d8bdadf/pone.0067019.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/8d33d94f9cca/pone.0067019.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/5ed8df7380d1/pone.0067019.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/baa616308b5f/pone.0067019.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/c4bf1548b4da/pone.0067019.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/72e2e11381a3/pone.0067019.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/2f563ab5c706/pone.0067019.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/c9b3832852bf/pone.0067019.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cea9/3699591/8264b2bb4895/pone.0067019.g009.jpg

相似文献

1
ANOVA-like differential expression (ALDEx) analysis for mixed population RNA-Seq.混合群体 RNA-Seq 的 ANOVA 样差异表达 (ALDEx) 分析。
PLoS One. 2013 Jul 2;8(7):e67019. doi: 10.1371/journal.pone.0067019. Print 2013.
2
Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data.RNA测序数据差异基因表达分析方法的综合评估
Genome Biol. 2013;14(9):R95. doi: 10.1186/gb-2013-14-9-r95.
3
What if we ignore the random effects when analyzing RNA-seq data in a multifactor experiment.在多因素实验中分析RNA测序数据时,如果我们忽略随机效应会怎样?
Stat Appl Genet Mol Biol. 2016 Apr;15(2):87-105. doi: 10.1515/sagmb-2015-0011.
4
Removing technical variability in RNA-seq data using conditional quantile normalization.使用条件分位数归一化去除 RNA-seq 数据中的技术变异性。
Biostatistics. 2012 Apr;13(2):204-16. doi: 10.1093/biostatistics/kxr054. Epub 2012 Jan 27.
5
How does normalization impact RNA-seq disease diagnosis?归一化如何影响 RNA-seq 疾病诊断?
J Biomed Inform. 2018 Sep;85:80-92. doi: 10.1016/j.jbi.2018.07.016. Epub 2018 Jul 21.
6
A flexible count data model to fit the wide diversity of expression profiles arising from extensively replicated RNA-seq experiments.一种灵活的计数数据模型,可适用于广泛复制的 RNA-seq 实验所产生的广泛多样化的表达谱。
BMC Bioinformatics. 2013 Aug 21;14:254. doi: 10.1186/1471-2105-14-254.
7
Computational methods for transcriptome annotation and quantification using RNA-seq.基于 RNA-seq 的转录组注释和定量的计算方法。
Nat Methods. 2011 Jun;8(6):469-77. doi: 10.1038/nmeth.1613. Epub 2011 May 27.
8
mRNA enrichment protocols determine the quantification characteristics of external RNA spike-in controls in RNA-Seq studies.mRNA 富集方案决定了 RNA-Seq 研究中外源 RNA Spike-in 对照品定量的特点。
Sci China Life Sci. 2013 Feb;56(2):134-42. doi: 10.1007/s11427-013-4437-9. Epub 2013 Feb 8.
9
Sample size calculations for the differential expression analysis of RNA-seq data using a negative binomial regression model.使用负二项回归模型对RNA测序数据进行差异表达分析的样本量计算。
Stat Appl Genet Mol Biol. 2019 Jan 22;18(1):/j/sagmb.2019.18.issue-1/sagmb-2018-0021/sagmb-2018-0021.xml. doi: 10.1515/sagmb-2018-0021.
10
Do count-based differential expression methods perform poorly when genes are expressed in only one condition?当基因仅在一种条件下表达时,基于计数的差异表达方法表现不佳吗?
Genome Biol. 2015 Oct 8;16:222. doi: 10.1186/s13059-015-0781-3.

引用本文的文献

1
Identification of nasopharyngeal microbial dysbiosis in COVID-19 patients by 16S rRNA gene sequencing.通过16S rRNA基因测序鉴定COVID-19患者鼻咽部微生物群落失调
Front Microbiol. 2025 Aug 29;16:1631198. doi: 10.3389/fmicb.2025.1631198. eCollection 2025.
2
Microbial Community Associations With Listeria monocytogenes in Food Processing Environments: A Systematic Review and Meta-Analysis.食品加工环境中与单核细胞增生李斯特菌相关的微生物群落:系统评价与荟萃分析
Compr Rev Food Sci Food Saf. 2025 Sep;24(5):e70277. doi: 10.1111/1541-4337.70277.
3
Blueberry Consumption in Early Life and Its Effects on Allergy, Immune Biomarkers, and Their Association with the Gut Microbiome.

本文引用的文献

1
Comparative meta-RNA-seq of the vaginal microbiota and differential expression by Lactobacillus iners in health and dysbiosis.比较阴道微生物组的宏 RNA 测序和健康与失调状态下非优势乳杆菌的差异表达。
Microbiome. 2013 Apr 12;1(1):12. doi: 10.1186/2049-2618-1-12.
2
Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution.基于贝塔二项式分布的高通量测序数据配对的经验贝叶斯分析。
BMC Bioinformatics. 2013 Apr 23;14:135. doi: 10.1186/1471-2105-14-135.
3
Hypothesis testing and power calculations for taxonomic-based human microbiome data.
早期生活中蓝莓的摄入及其对过敏、免疫生物标志物的影响以及它们与肠道微生物群的关联。
Nutrients. 2025 Aug 28;17(17):2795. doi: 10.3390/nu17172795.
4
The cervicovaginal microbiome of pregnant people living with HIV on antiretroviral therapy in the Democratic Republic of Congo: A Pilot Study and Global Meta-analysis.刚果民主共和国接受抗逆转录病毒治疗的感染艾滋病毒孕妇的宫颈阴道微生物群:一项试点研究和全球荟萃分析。
bioRxiv. 2025 Aug 21:2025.08.18.670785. doi: 10.1101/2025.08.18.670785.
5
Altered gut microbial networks and metabolic pathways in multiple system atrophy: a comparative 16S rRNA study.多系统萎缩中肠道微生物网络和代谢途径的改变:一项比较性16S rRNA研究
Front Neurosci. 2025 Aug 13;19:1623165. doi: 10.3389/fnins.2025.1623165. eCollection 2025.
6
Lubiprostone in chronic kidney disease: Insights into mitochondrial function and polyamines from a randomized phase 2 clinical trial.鲁比前列酮用于慢性肾脏病:一项随机2期临床试验对线粒体功能和多胺的见解
Sci Adv. 2025 Aug 29;11(35):eadw3934. doi: 10.1126/sciadv.adw3934.
7
Heat stress and soil thermal gradients shape root-associated fungal community recruitment.热胁迫和土壤热梯度影响根系相关真菌群落的招募。
Front Microbiol. 2025 Aug 12;16:1334648. doi: 10.3389/fmicb.2025.1334648. eCollection 2025.
8
Prebiotic Xylo-Oligosaccharides Modulate the Gut Microbiome to Improve Innate Immunity and Gut Barrier Function and Enhance Performance in Piglets Experiencing Post-Weaning Diarrhoea.益生元木寡糖调节仔猪肠道微生物群,以改善先天免疫和肠道屏障功能,并提高断奶后腹泻仔猪的性能。
Microorganisms. 2025 Jul 28;13(8):1760. doi: 10.3390/microorganisms13081760.
9
Clinical and microbiological effectiveness of limosilactobacillus reuteri in supportive periodontal therapy: randomized clinical trial.罗伊氏乳杆菌在支持性牙周治疗中的临床及微生物学疗效:随机临床试验
Clin Oral Investig. 2025 Aug 26;29(9):422. doi: 10.1007/s00784-025-06508-w.
10
Subspecific variation in gut microbiota of North American bison in a sympatric setting reveals differentially abundant taxa.同域环境下北美野牛肠道微生物群的亚种变异揭示了丰度有差异的分类群。
Anim Microbiome. 2025 Aug 21;7(1):89. doi: 10.1186/s42523-025-00451-7.
基于分类的人类微生物组数据的假设检验和功效计算。
PLoS One. 2012;7(12):e52078. doi: 10.1371/journal.pone.0052078. Epub 2012 Dec 20.
4
Inferring correlation networks from genomic survey data.从基因组普查数据推断关联网络。
PLoS Comput Biol. 2012;8(9):e1002687. doi: 10.1371/journal.pcbi.1002687. Epub 2012 Sep 20.
5
A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae.基于 RNA-Seq 的转录组分析从读段到差异基因表达的全面比较,以及与微阵列的交叉比较:以酿酒酵母为例的研究。
Nucleic Acids Res. 2012 Nov 1;40(20):10084-97. doi: 10.1093/nar/gks804. Epub 2012 Sep 10.
6
Dirichlet multinomial mixtures: generative models for microbial metagenomics.狄利克雷多项混合模型:微生物宏基因组学的生成模型。
PLoS One. 2012;7(2):e30126. doi: 10.1371/journal.pone.0030126. Epub 2012 Feb 3.
7
A comparison of statistical methods for detecting differentially expressed genes from RNA-seq data.基于 RNA-seq 数据的差异表达基因检测的统计学方法比较。
Am J Bot. 2012 Feb;99(2):248-56. doi: 10.3732/ajb.1100340. Epub 2012 Jan 20.
8
The real cost of sequencing: higher than you think!测序的真实成本:比你想象的更高!
Genome Biol. 2011 Aug 25;12(8):125. doi: 10.1186/gb-2011-12-8-125.
9
TopHat-Fusion: an algorithm for discovery of novel fusion transcripts.TopHat-Fusion:一种用于发现新型融合转录本的算法。
Genome Biol. 2011 Aug 11;12(8):R72. doi: 10.1186/gb-2011-12-8-r72.
10
RNA-seq: technical variability and sampling.RNA-seq:技术变异性和采样。
BMC Genomics. 2011 Jun 6;12:293. doi: 10.1186/1471-2164-12-293.