• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于微阵列实验中差异基因表达分析的多元分层贝叶斯模型。

Multivariate hierarchical Bayesian model for differential gene expression analysis in microarray experiments.

作者信息

Zhao Hongya, Chan Kwok-Leung, Cheng Lee-Ming, Yan Hong

机构信息

Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong.

出版信息

BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-9-S1-S9.

DOI:10.1186/1471-2105-9-S1-S9
PMID:18315862
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2259410/
Abstract

BACKGROUND

Identification of differentially expressed genes is a typical objective when analyzing gene expression data. Recently, Bayesian hierarchical models have become increasingly popular to solve this type of problems. These models show good performance in accommodating noise, variability and low replication of microarray data. However, the correlation between different fluorescent signals measured from a gene spot is ignored, which can diversely affect the data analysis step. In fact, the intensities of the two signals are significantly correlated across samples. The larger the log-transformed intensities are, the smaller the correlation is.

RESULTS

Motivated by the complicated error relations in microarray data, we propose a multivariate hierarchical Bayesian framework for data analysis in the replicated microarray experiments. Gene expression data are modelled by a multivariate normal distribution, parameterized by the corresponding mean vectors and covariance matrixes with a conjugate prior distribution. Within the Bayesian framework, a generalized likelihood ratio test (GLRT) is also developed to infer the gene expression patterns. Simulation studies show that the proposed approach presents better operating characteristics and lower false discovery rate (FDR) than existing methods, especially when the correlation coefficient is large. The approach is illustrated with two examples of microarray analysis. The proposed method successfully detects significant genes closely related to the experimental states, which are verified by the biological information.

CONCLUSIONS

The multivariate Bayesian model, compatible with the dependence between mean and variance in the univariate Bayesian model, relaxes the constant coefficient of variation assumption between measurements by adding a covariance structure. This model improves the identification of differentially expressed genes significantly since the Bayesian model fit well with the microarray data.

摘要

背景

在分析基因表达数据时,识别差异表达基因是一个典型目标。最近,贝叶斯分层模型在解决这类问题上越来越受欢迎。这些模型在处理微阵列数据的噪声、变异性和低重复率方面表现良好。然而,从基因点测量的不同荧光信号之间的相关性被忽略了,这可能会对数据分析步骤产生不同影响。事实上,两个信号的强度在样本间显著相关。对数转换后的强度越大,相关性越小。

结果

受微阵列数据中复杂误差关系的启发,我们提出了一种用于重复微阵列实验数据分析的多元分层贝叶斯框架。基因表达数据由多元正态分布建模,通过相应的均值向量和协方差矩阵以及共轭先验分布进行参数化。在贝叶斯框架内,还开发了一种广义似然比检验(GLRT)来推断基因表达模式。模拟研究表明,与现有方法相比,所提出的方法具有更好的操作特性和更低的错误发现率(FDR),尤其是在相关系数较大时。通过两个微阵列分析示例对该方法进行了说明。所提出的方法成功检测到了与实验状态密切相关的显著基因,这些基因已通过生物学信息得到验证。

结论

多元贝叶斯模型与单变量贝叶斯模型中均值和方差之间的依赖性兼容,通过添加协方差结构放宽了测量之间恒定变异系数的假设。由于贝叶斯模型与微阵列数据拟合良好,该模型显著提高了差异表达基因的识别能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddda/2259410/fe9d216ea206/1471-2105-9-S1-S9-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddda/2259410/35ccba4c74ca/1471-2105-9-S1-S9-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddda/2259410/4e1d340f781a/1471-2105-9-S1-S9-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddda/2259410/fe9d216ea206/1471-2105-9-S1-S9-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddda/2259410/35ccba4c74ca/1471-2105-9-S1-S9-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddda/2259410/4e1d340f781a/1471-2105-9-S1-S9-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddda/2259410/fe9d216ea206/1471-2105-9-S1-S9-3.jpg

相似文献

1
Multivariate hierarchical Bayesian model for differential gene expression analysis in microarray experiments.用于微阵列实验中差异基因表达分析的多元分层贝叶斯模型。
BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-9-S1-S9.
2
Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data.使用微阵列基因表达数据的用于疾病分类的核嵌入高斯过程。
BMC Bioinformatics. 2007 Feb 28;8:67. doi: 10.1186/1471-2105-8-67.
3
Identifying differentially expressed genes in meta-analysis via Bayesian model-based clustering.通过基于贝叶斯模型的聚类在荟萃分析中识别差异表达基因。
Biom J. 2006 Jun;48(3):435-50. doi: 10.1002/bimj.200410230.
4
Context-specific infinite mixtures for clustering gene expression profiles across diverse microarray dataset.用于跨不同微阵列数据集对基因表达谱进行聚类的特定上下文无限混合模型。
Bioinformatics. 2006 Jul 15;22(14):1737-44. doi: 10.1093/bioinformatics/btl184. Epub 2006 May 18.
5
A hierarchical Naïve Bayes Model for handling sample heterogeneity in classification problems: an application to tissue microarrays.一种用于处理分类问题中样本异质性的分层朴素贝叶斯模型:在组织微阵列中的应用。
BMC Bioinformatics. 2006 Nov 24;7:514. doi: 10.1186/1471-2105-7-514.
6
Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments.基于强度的分层贝叶斯方法改进了微阵列实验中差异表达基因的检测。
BMC Bioinformatics. 2006 Dec 19;7:538. doi: 10.1186/1471-2105-7-538.
7
Bayesian mixture model based clustering of replicated microarray data.基于贝叶斯混合模型的重复微阵列数据聚类
Bioinformatics. 2004 May 22;20(8):1222-32. doi: 10.1093/bioinformatics/bth068. Epub 2004 Feb 10.
8
Bayesian hierarchical error model for analysis of gene expression data.用于基因表达数据分析的贝叶斯分层误差模型。
Bioinformatics. 2004 Sep 1;20(13):2016-25. doi: 10.1093/bioinformatics/bth192. Epub 2004 Mar 25.
9
Bayesian variable selection for the analysis of microarray data with censored outcomes.用于分析具有删失结局的微阵列数据的贝叶斯变量选择
Bioinformatics. 2006 Sep 15;22(18):2262-8. doi: 10.1093/bioinformatics/btl362. Epub 2006 Jul 15.
10
A mixture model with random-effects components for clustering correlated gene-expression profiles.一种具有随机效应成分的混合模型,用于对相关基因表达谱进行聚类。
Bioinformatics. 2006 Jul 15;22(14):1745-52. doi: 10.1093/bioinformatics/btl165. Epub 2006 May 3.

引用本文的文献

1
Biological assessment of robust noise models in microarray data analysis.生物评估稳健噪声模型在微阵列数据分析中的应用。
Bioinformatics. 2011 Mar 15;27(6):807-14. doi: 10.1093/bioinformatics/btr018. Epub 2011 Jan 19.
2
Bioinformatics research in the Asia Pacific: a 2007 update.亚太地区的生物信息学研究:2007年最新情况
BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2105-9-S1-S1.

本文引用的文献

1
Ratio-based decisions and the quantitative analysis of cDNA microarray images.基于比率的决策与cDNA微阵列图像的定量分析
J Biomed Opt. 1997 Oct;2(4):364-74. doi: 10.1117/12.281504.
2
A full Bayesian hierarchical mixture model for the variance of gene differential expression.用于基因差异表达方差的全贝叶斯分层混合模型。
BMC Bioinformatics. 2007 Apr 17;8:124. doi: 10.1186/1471-2105-8-124.
3
Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments.基于强度的分层贝叶斯方法改进了微阵列实验中差异表达基因的检测。
BMC Bioinformatics. 2006 Dec 19;7:538. doi: 10.1186/1471-2105-7-538.
4
Flexible empirical Bayes models for differential gene expression.用于差异基因表达的灵活经验贝叶斯模型。
Bioinformatics. 2007 Feb 1;23(3):328-35. doi: 10.1093/bioinformatics/btl612. Epub 2006 Nov 30.
5
Bayesian modeling of differential gene expression.差异基因表达的贝叶斯建模
Biometrics. 2006 Mar;62(1):1-9. doi: 10.1111/j.1541-0420.2005.00394.x.
6
Normal uniform mixture differential gene expression detection for cDNA microarrays.用于cDNA微阵列的正常均匀混合物差异基因表达检测
BMC Bioinformatics. 2005 Jul 12;6:173. doi: 10.1186/1471-2105-6-173.
7
Identifying differentially expressed genes from microarray experiments via statistic synthesis.通过统计合成从微阵列实验中识别差异表达基因。
Bioinformatics. 2005 Apr 1;21(7):1084-93. doi: 10.1093/bioinformatics/bti108. Epub 2004 Oct 28.
8
Multiple-testing strategy for analyzing cDNA array data on gene expression.用于分析基因表达cDNA阵列数据的多重检验策略。
Biometrics. 2004 Sep;60(3):774-82. doi: 10.1111/j.0006-341X.2004.00228.x.
9
Detecting differential gene expression with a semiparametric hierarchical mixture method.使用半参数分层混合方法检测差异基因表达。
Biostatistics. 2004 Apr;5(2):155-76. doi: 10.1093/biostatistics/5.2.155.
10
A generalized likelihood ratio test to identify differentially expressed genes from microarray data.一种用于从微阵列数据中识别差异表达基因的广义似然比检验。
Bioinformatics. 2004 Jan 1;20(1):100-4. doi: 10.1093/bioinformatics/btg384.