• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

稳健多变量方差分析及其在从寡核苷酸阵列中检测差异表达基因方面的应用

Robustified MANOVA with applications in detecting differentially expressed genes from oligonucleotide arrays.

作者信息

Xu Jin, Cui Xinping

机构信息

Department of Statistics, East China Normal University, Shanghai 200241, China.

出版信息

Bioinformatics. 2008 Apr 15;24(8):1056-62. doi: 10.1093/bioinformatics/btn053. Epub 2008 Mar 3.

DOI:10.1093/bioinformatics/btn053
PMID:18316342
Abstract

MOTIVATION

Oligonucleotide arrays such as Affymetrix GeneChips use multiple probes, or a probe set, to measure the abundance of mRNA of every gene of interest. Some analysis methods attempt to summarize the multiple observations into one single score before conducting further analysis such as detecting differentially expressed genes (DEG), clustering and classification. However, there is a risk of losing a significant amount of information and consequently reaching inaccurate or even incorrect conclusions during this data reduction.

RESULTS

We developed a novel statistical method called robustified multivariate analysis of variance (MANOVA) based on the traditional MANOVA model and permutation test to detect DEG for both one-way and two-way cases. It can be extended to detect some special patterns of gene expression through profile analysis across k (>or=2) populations. The method utilizes probe-level data and requires no assumptions about the distribution of the dataset. We also propose a method of estimating the null distribution using quantile normalization in contrast to the 'pooling' method (Section 3.1). Monte Carlo simulation and real data analysis are conducted to demonstrate the performance of the proposed method comparing with the 'pooling' method and the usual Analysis of Variance (ANOVA) test based on the summarized scores. It is found that the new method successfully detects DEG under desired false discovery rate and is more powerful than the competing method especially when the number of groups is small.

AVAILABILITY

The package of robustified MANOVA can be downloaded from http://faculty.ucr.edu/~xpcui/software

摘要

动机

诸如Affymetrix基因芯片之类的寡核苷酸阵列使用多个探针或一个探针集来测量每个感兴趣基因的mRNA丰度。一些分析方法试图在进行进一步分析(如检测差异表达基因(DEG)、聚类和分类)之前,将多个观测值汇总为一个单一分数。然而,在这种数据简化过程中,存在丢失大量信息的风险,从而在得出结论时可能不准确甚至错误。

结果

我们基于传统的多变量方差分析(MANOVA)模型和置换检验,开发了一种名为稳健多变量方差分析(MANOVA)的新型统计方法,用于检测单向和双向情况下的DEG。它可以通过跨k(≥2)个群体的轮廓分析扩展到检测基因表达的一些特殊模式。该方法利用探针水平的数据,并且不需要对数据集的分布做任何假设。与“合并”方法(3.1节)相比,我们还提出了一种使用分位数归一化估计零分布的方法。进行了蒙特卡罗模拟和实际数据分析,以证明所提出的方法与“合并”方法以及基于汇总分数的常规方差分析(ANOVA)检验相比的性能。结果发现,新方法能够在期望的错误发现率下成功检测DEG,并且比竞争方法更强大,尤其是在组数较少时。

可用性

稳健MANOVA软件包可从http://faculty.ucr.edu/~xpcui/software下载

相似文献

1
Robustified MANOVA with applications in detecting differentially expressed genes from oligonucleotide arrays.稳健多变量方差分析及其在从寡核苷酸阵列中检测差异表达基因方面的应用
Bioinformatics. 2008 Apr 15;24(8):1056-62. doi: 10.1093/bioinformatics/btn053. Epub 2008 Mar 3.
2
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。
BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.
3
Leveraging two-way probe-level block design for identifying differential gene expression with high-density oligonucleotide arrays.利用双向探针水平块设计通过高密度寡核苷酸阵列鉴定差异基因表达。
BMC Bioinformatics. 2004 Apr 20;5:42. doi: 10.1186/1471-2105-5-42.
4
A new outlier removal approach for cDNA microarray normalization.一种用于cDNA微阵列标准化的新离群值去除方法。
Biotechniques. 2009 Aug;47(2):691-2, 694-700. doi: 10.2144/000113195.
5
Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis.使用非参数多变量分析在微阵列研究中鉴定差异表达的基因类别。
Bioinformatics. 2008 Jan 15;24(2):192-201. doi: 10.1093/bioinformatics/btm583. Epub 2007 Nov 27.
6
Quick calculation for sample size while controlling false discovery rate with application to microarray analysis.在控制错误发现率的同时进行样本量的快速计算及其在微阵列分析中的应用。
Bioinformatics. 2007 Mar 15;23(6):739-46. doi: 10.1093/bioinformatics/btl664. Epub 2007 Jan 19.
7
Gene expression and isoform variation analysis using Affymetrix Exon Arrays.使用Affymetrix外显子芯片进行基因表达和异构体变异分析。
BMC Genomics. 2008 Nov 7;9:529. doi: 10.1186/1471-2164-9-529.
8
Probe rank approaches for gene selection in oligonucleotide arrays with a small number of replicates.探索少量重复样本的寡核苷酸阵列中基因选择的排序方法。
Bioinformatics. 2005 Jun 15;21(12):2861-6. doi: 10.1093/bioinformatics/bti413. Epub 2005 Apr 6.
9
Multidimensional local false discovery rate for microarray studies.微阵列研究的多维局部错误发现率
Bioinformatics. 2006 Mar 1;22(5):556-65. doi: 10.1093/bioinformatics/btk013. Epub 2005 Dec 20.
10
Combining multiple microarrays in the presence of controlling variables.在存在控制变量的情况下合并多个微阵列。
Bioinformatics. 2006 Jul 15;22(14):1682-9. doi: 10.1093/bioinformatics/btl183. Epub 2006 May 16.

引用本文的文献

1
Robust tests for multivariate repeated measures with small samples.小样本多元重复测量的稳健检验。
J Appl Stat. 2022 Nov 13;51(3):555-580. doi: 10.1080/02664763.2022.2142537. eCollection 2024.
2
Immune activation underlies a sustained clinical response to Yttrium-90 radioembolisation in hepatocellular carcinoma.免疫激活是钇-90 放射性栓塞治疗肝细胞癌持续临床应答的基础。
Gut. 2019 Feb;68(2):335-346. doi: 10.1136/gutjnl-2017-315485. Epub 2018 Feb 13.
3
POLYPHEMUS: R package for comparative analysis of RNA polymerase II ChIP-seq profiles by non-linear normalization.
多利弗莫斯:用于通过非线性归一化比较 RNA 聚合酶 II ChIP-seq 图谱的 R 包。
Nucleic Acids Res. 2012 Feb;40(4):e30. doi: 10.1093/nar/gkr1205. Epub 2011 Dec 7.
4
Expression analysis of flavonoid biosynthesis genes during Arabidopsis thaliana silique and seed development with a primary focus on the proanthocyanidin biosynthetic pathway.拟南芥角果和种子发育过程中类黄酮生物合成基因的表达分析,主要聚焦于原花青素生物合成途径。
BMC Res Notes. 2010 Oct 7;3:255. doi: 10.1186/1756-0500-3-255.
5
A comparison of probe-level and probeset models for small-sample gene expression data.探针水平和探针集模型在小样本基因表达数据中的比较。
BMC Bioinformatics. 2010 May 26;11:281. doi: 10.1186/1471-2105-11-281.
6
Ranking differentially expressed genes from Affymetrix gene expression data: methods with reproducibility, sensitivity, and specificity.对Affymetrix基因表达数据中的差异表达基因进行排名:具有可重复性、敏感性和特异性的方法。
Algorithms Mol Biol. 2009 Apr 22;4:7. doi: 10.1186/1748-7188-4-7.