• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合均值和方差异质性以识别差异表达基因。

Integrating mean and variance heterogeneities to identify differentially expressed genes.

作者信息

Ouyang Weiwei, An Qiang, Zhao Jinying, Qin Huaizhen

机构信息

Department of Global Biostatistics and Data Science, Tulane University School of Public Health and Tropical Medicine, 1440 Canal Street, Suite 2001, New Orleans, LA, 70112, USA.

Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, 1440 Canal Street, New Orleans, LA, 70112, USA.

出版信息

BMC Bioinformatics. 2016 Dec 6;17(1):497. doi: 10.1186/s12859-016-1393-y.

DOI:10.1186/s12859-016-1393-y
PMID:27923367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5139036/
Abstract

BACKGROUND

In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes.

RESULTS

In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment-wide significant MVDE genes.

CONCLUSIONS

Our results indicate tremendous potential gain of integrating informative variance heterogeneity after adjusting for global confounders and background data structure. The proposed informative integration test better summarizes the impacts of condition change on expression distributions of susceptible genes than do the existent competitors. Therefore, particular attention should be paid to explicitly exploit the variance heterogeneity induced by condition change in functional genomics analysis.

摘要

背景

在功能基因组学研究中,对均值异质性的检验已被广泛用于识别在不同实验条件下具有不同平均表达水平的差异表达基因。基因表达水平的方差异质性(即特定条件方差之间的差异)却被简单地忽略或作为一种干扰因素进行校准。基因表达水平的均值异质性反映了其分布变化的一个方面;而条件变化引起的方差异质性可能反映了另一个方面。条件的改变可能会改变易感基因表达水平分布的均值和一些高阶特征。

结果

在本报告中,我们提出了均值 - 方差差异表达(MVDE)基因的概念,其表达均值和方差对实验条件的变化敏感。我们从数学上证明了现有均值异质性检验和方差异质性检验的零独立性。基于这种独立性,我们提出了一种综合均值 - 方差检验(IMVT),以结合由条件变化引起的基因层面的均值异质性和方差异质性。在正态性和拉普拉斯设置的综合模拟下,IMVT优于其竞争对手。对于中等样本量,IMVT能很好地控制I型错误率,现有的均值异质性检验(即 Welch t检验(WT)、适度 Welch t检验(MWT))以及均值和方差异质性的单独检验程序(SMVT)也是如此,但似然比检验(LRT)严重夸大了I型错误率。在存在方差异质性的情况下,IMVT明显比所有有效的均值异质性检验更具功效。将其应用于外周循环B细胞的基因谱,有力地证明了信息性方差异质性的存在。在调整背景数据结构后,IMVT重复了之前发现的结果,并识别出了全实验范围内新的显著MVDE基因。

结论

我们的结果表明,在调整全局混杂因素和背景数据结构后,整合信息性方差异质性具有巨大的潜在收益。所提出的信息性整合检验比现有的竞争对手能更好地总结条件变化对易感基因表达分布的影响。因此,在功能基因组学分析中应特别注意明确利用由条件变化引起的方差异质性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/727aa2fe2b68/12859_2016_1393_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/27fdabf6053d/12859_2016_1393_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/1da67f7b434c/12859_2016_1393_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/a0627049dcbd/12859_2016_1393_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/5fc0b9378e9a/12859_2016_1393_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/3252c61ef0d8/12859_2016_1393_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/99a54ef3046c/12859_2016_1393_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/69db3e798974/12859_2016_1393_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/19ed9dfee80d/12859_2016_1393_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/1b8943659425/12859_2016_1393_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/56ed9e7f56c7/12859_2016_1393_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/727aa2fe2b68/12859_2016_1393_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/27fdabf6053d/12859_2016_1393_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/1da67f7b434c/12859_2016_1393_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/a0627049dcbd/12859_2016_1393_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/5fc0b9378e9a/12859_2016_1393_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/3252c61ef0d8/12859_2016_1393_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/99a54ef3046c/12859_2016_1393_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/69db3e798974/12859_2016_1393_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/19ed9dfee80d/12859_2016_1393_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/1b8943659425/12859_2016_1393_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/56ed9e7f56c7/12859_2016_1393_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f284/5139036/727aa2fe2b68/12859_2016_1393_Fig11_HTML.jpg

相似文献

1
Integrating mean and variance heterogeneities to identify differentially expressed genes.整合均值和方差异质性以识别差异表达基因。
BMC Bioinformatics. 2016 Dec 6;17(1):497. doi: 10.1186/s12859-016-1393-y.
2
Unequal group variances in microarray data analyses.微阵列数据分析中组方差不相等。
Bioinformatics. 2008 May 1;24(9):1168-74. doi: 10.1093/bioinformatics/btn100. Epub 2008 Mar 14.
3
Arrow plot: a new graphical tool for selecting up and down regulated genes and genes differentially expressed on sample subgroups.箭头图:一种新的图形工具,用于选择上调和下调的基因以及在样本亚组中差异表达的基因。
BMC Bioinformatics. 2012 Jun 26;13:147. doi: 10.1186/1471-2105-13-147.
4
BALLI: Bartlett-adjusted likelihood-based linear model approach for identifying differentially expressed genes with RNA-seq data.BALLI:基于 Bartlett 调整似然比的线性模型方法,用于鉴定 RNA-seq 数据中差异表达的基因。
BMC Genomics. 2019 Jul 2;20(1):540. doi: 10.1186/s12864-019-5851-6.
5
Detecting differentially expressed genes in heterogeneous diseases using control-only analysis of variance.使用仅对照方差分析检测异质疾病中的差异表达基因。
Ann Epidemiol. 2012 Aug;22(8):598-602. doi: 10.1016/j.annepidem.2012.04.017. Epub 2012 May 31.
6
CHESS (CgHExpreSS): a comprehensive analysis tool for the analysis of genomic alterations and their effects on the expression profile of the genome.CHESS(CgHExpreSS):一种全面的分析工具,用于分析基因组改变及其对基因组表达谱的影响。
BMC Bioinformatics. 2009 Dec 16;10:424. doi: 10.1186/1471-2105-10-424.
7
The Global Error Assessment (GEA) model for the selection of differentially expressed genes in microarray data.用于在微阵列数据中选择差异表达基因的全局误差评估(GEA)模型。
Bioinformatics. 2004 Nov 1;20(16):2726-37. doi: 10.1093/bioinformatics/bth319. Epub 2004 May 14.
8
Performance of five two-sample location tests for skewed distributions with unequal variances.五种用于具有不等方差的偏态分布的两样本位置检验的性能。
Contemp Clin Trials. 2009 Sep;30(5):490-6. doi: 10.1016/j.cct.2009.06.007. Epub 2009 Jul 2.
9
EVE (external variance estimation) increases statistical power for detecting differentially expressed genes.EVE(外部方差估计)提高了检测差异表达基因的统计功效。
Plant J. 2007 Nov;52(3):561-9. doi: 10.1111/j.1365-313X.2007.03227.x. Epub 2007 Aug 3.
10
A versatile omnibus test for detecting mean and variance heterogeneity.一种通用的综合检验方法,用于检测均值和方差异质性。
Genet Epidemiol. 2014 Jan;38(1):51-59. doi: 10.1002/gepi.21778.

引用本文的文献

1
Stochastic expression of invasion genes in Plasmodium falciparum schizonts.疟原虫裂殖子中入侵基因的随机表达。
Nat Commun. 2022 May 30;13(1):3004. doi: 10.1038/s41467-022-30605-z.
2
Identification of differentially distributed gene expression and distinct sets of cancer-related genes identified by changes in mean and variability.通过均值和变异性变化鉴定差异分布的基因表达以及不同的癌症相关基因集。
NAR Genom Bioinform. 2022 Jan 14;4(1):lqab124. doi: 10.1093/nargab/lqab124. eCollection 2022 Mar.
3
Erratum to: Integrating mean and variance heterogeneities to identify differentially expressed genes.

本文引用的文献

1
Fluticasone furoate is more effective than mometasone furoate in restoring tobacco smoke inhibited SOCS-3 expression in airway epithelial cells.氟替卡松糠酸酯在恢复气道上皮细胞中受烟草烟雾抑制的 SOCS-3 表达方面比糠酸莫米松更有效。
Int Immunopharmacol. 2014 Mar;19(1):153-60. doi: 10.1016/j.intimp.2013.12.029. Epub 2014 Jan 13.
2
STAT3 modulates cigarette smoke-induced inflammation and protease expression.STAT3 调节香烟烟雾引起的炎症和蛋白酶表达。
Front Physiol. 2013 Oct 1;4:267. doi: 10.3389/fphys.2013.00267. eCollection 2013.
3
DNA methylation in nasal epithelial cells from smokers: identification of ULBP3-related effects.
《整合均值和方差异质性以识别差异表达基因》勘误
BMC Bioinformatics. 2017 Feb 3;18(1):89. doi: 10.1186/s12859-017-1507-1.
吸烟者鼻上皮细胞中的 DNA 甲基化:ULBP3 相关作用的鉴定。
Am J Physiol Lung Cell Mol Physiol. 2013 Sep 15;305(6):L432-8. doi: 10.1152/ajplung.00116.2013. Epub 2013 Jul 5.
4
The details in the distributions: why and how to study phenotypic variability.分布中的细节:为什么以及如何研究表型可变性。
Curr Opin Biotechnol. 2013 Aug;24(4):752-9. doi: 10.1016/j.copbio.2013.03.010. Epub 2013 Apr 6.
5
Genetic variants contribute to gene expression variability in humans.遗传变异导致人类基因表达的变异性。
Genetics. 2013 Jan;193(1):95-108. doi: 10.1534/genetics.112.146779. Epub 2012 Nov 12.
6
Inheritance beyond plain heritability: variance-controlling genes in Arabidopsis thaliana.超越单纯遗传力的遗传:拟南芥中的方差控制基因。
PLoS Genet. 2012;8(8):e1002839. doi: 10.1371/journal.pgen.1002839. Epub 2012 Aug 2.
7
Using control genes to correct for unwanted variation in microarray data.利用对照基因纠正微阵列数据中的非期望变异。
Biostatistics. 2012 Jul;13(3):539-52. doi: 10.1093/biostatistics/kxr034. Epub 2011 Nov 17.
8
Detecting major genetic loci controlling phenotypic variability in experimental crosses.检测控制实验杂交中表型变异的主要遗传基因座。
Genetics. 2011 Jun;188(2):435-47. doi: 10.1534/genetics.111.127068. Epub 2011 Apr 5.
9
Should we abandon the t-test in the analysis of gene expression microarray data: a comparison of variance modeling strategies.我们是否应该放弃基因表达微阵列数据分析中的 t 检验:方差建模策略的比较。
PLoS One. 2010 Sep 3;5(9):e12336. doi: 10.1371/journal.pone.0012336.
10
Genetic variation and antioxidant response gene expression in the bronchial airway epithelium of smokers at risk for lung cancer.吸烟者肺癌风险支气管气道上皮细胞中的遗传变异和抗氧化反应基因表达。
PLoS One. 2010 Aug 3;5(8):e11934. doi: 10.1371/journal.pone.0011934.