• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在经验贝叶斯方法中纳入生物信息作为先验信息来分析微阵列数据。

Incorporating biological information as a prior in an empirical bayes approach to analyzing microarray data.

作者信息

Pan Wei

机构信息

University of Minnesota, USA.

出版信息

Stat Appl Genet Mol Biol. 2005;4:Article12. doi: 10.2202/1544-6115.1124. Epub 2005 May 25.

DOI:10.2202/1544-6115.1124
PMID:16646829
Abstract

Currently the practice of using existing biological knowledge in analyzing high throughput genomic and proteomic data is mainly for the purpose of validations. Here we take a different approach of incorporating biological knowledge into statistical analysis to improve statistical power and efficiency. Specifically, we consider how to fuse biological information into a mixture model to analyze microarray data. In contrast to a standard mixture model where it is assumed that all the genes come from the same (marginal) distribution, including an equal prior probability of having an event, such as having differential expression or being bound by a transcription factor (TF), our proposed mixture model allows the genes in different groups to have different distributions while the grouping of the genes reflects biological information. Using a list of about 800 putative cell cycle-regulated genes as prior biological knowledge, we analyze a genome-wide location data to detect binding sites of TF Fkh1. We find that our proposal improves over the standard approach, resulting in reduced false discovery rates (FDR), and hence it is a useful alternative to the current practice.

摘要

目前,在分析高通量基因组和蛋白质组数据时运用现有生物学知识的做法主要是为了进行验证。在此,我们采用一种不同的方法,即将生物学知识纳入统计分析,以提高统计效力和效率。具体而言,我们考虑如何将生物学信息融入混合模型来分析微阵列数据。与标准混合模型不同,标准混合模型假定所有基因都来自相同的(边际)分布,包括发生某事件(如具有差异表达或被转录因子(TF)结合)的相等先验概率,而我们提出的混合模型允许不同组中的基因具有不同分布,同时基因的分组反映生物学信息。利用一份约800个推定的细胞周期调控基因列表作为先验生物学知识,我们分析了全基因组定位数据以检测TF Fkh1的结合位点。我们发现我们的方法比标准方法有所改进,从而降低了错误发现率(FDR),因此它是当前做法的一种有用替代方法。

相似文献

1
Incorporating biological information as a prior in an empirical bayes approach to analyzing microarray data.在经验贝叶斯方法中纳入生物信息作为先验信息来分析微阵列数据。
Stat Appl Genet Mol Biol. 2005;4:Article12. doi: 10.2202/1544-6115.1124. Epub 2005 May 25.
2
Incorporating prior information via shrinkage: a combined analysis of genome-wide location data and gene expression data.通过收缩法纳入先验信息:全基因组定位数据与基因表达数据的联合分析
Stat Med. 2007 May 10;26(10):2258-75. doi: 10.1002/sim.2703.
3
Functional genomics and proteomics in the clinical neurosciences: data mining and bioinformatics.临床神经科学中的功能基因组学和蛋白质组学:数据挖掘与生物信息学
Prog Brain Res. 2006;158:83-108. doi: 10.1016/S0079-6123(06)58004-5.
4
Penalized and weighted K-means for clustering with scattered objects and prior information in high-throughput biological data.用于高通量生物数据中具有分散对象和先验信息的聚类的惩罚加权K均值算法
Bioinformatics. 2007 Sep 1;23(17):2247-55. doi: 10.1093/bioinformatics/btm320. Epub 2007 Jun 27.
5
Reconstructing gene regulatory networks with bayesian networks by combining expression data with multiple sources of prior knowledge.通过将表达数据与多种先验知识来源相结合,利用贝叶斯网络重建基因调控网络。
Stat Appl Genet Mol Biol. 2007;6:Article15. doi: 10.2202/1544-6115.1282. Epub 2007 May 29.
6
Empirical bayes estimation of a sparse vector of gene expression changes.
Stat Appl Genet Mol Biol. 2005;4:Article22. doi: 10.2202/1544-6115.1132. Epub 2005 Sep 6.
7
Genome-wide co-expression based prediction of differential expressions.基于全基因组共表达的差异表达预测。
Bioinformatics. 2008 Mar 1;24(5):666-73. doi: 10.1093/bioinformatics/btm507. Epub 2007 Nov 15.
8
Statistical methods in integrative analysis for gene regulatory modules.基因调控模块综合分析中的统计方法
Stat Appl Genet Mol Biol. 2008;7(1):Article 28. doi: 10.2202/1544-6115.1369. Epub 2008 Oct 10.
9
Identification of differentially expressed genes and false discovery rate in microarray studies.微阵列研究中差异表达基因的鉴定及错误发现率
Curr Opin Lipidol. 2007 Apr;18(2):187-93. doi: 10.1097/MOL.0b013e3280895d6f.
10
An equilibrium partitioning model connecting gene expression and cis-motif content.一个连接基因表达与顺式基序含量的平衡分配模型。
Bioinformatics. 2006 Jul 15;22(14):e368-74. doi: 10.1093/bioinformatics/btl253.

引用本文的文献

1
Novel application of multi-stimuli network inference to synovial fibroblasts of rheumatoid arthritis patients.新型多刺激网络推断在类风湿关节炎患者滑膜成纤维细胞中的应用。
BMC Med Genomics. 2014 Jul 3;7:40. doi: 10.1186/1755-8794-7-40.
2
Batch correction of microarray data substantially improves the identification of genes differentially expressed in rheumatoid arthritis and osteoarthritis.批量校正微阵列数据可显著提高类风湿关节炎和骨关节炎中差异表达基因的识别。
BMC Med Genomics. 2012 Jun 8;5:23. doi: 10.1186/1755-8794-5-23.
3
Use of pathway information in molecular epidemiology.
在分子流行病学中使用途径信息。
Hum Genomics. 2009 Oct;4(1):21-42. doi: 10.1186/1479-7364-4-1-21.
4
Statistical methods for integrating multiple types of high-throughput data.整合多种类型高通量数据的统计方法。
Methods Mol Biol. 2010;620:511-29. doi: 10.1007/978-1-60761-580-4_19.
5
Methodological Issues in Multistage Genome-wide Association Studies.多阶段全基因组关联研究中的方法学问题
Stat Sci. 2009 Nov 1;24(4):414-429. doi: 10.1214/09-sts288.
6
A novel method incorporating gene ontology information for unsupervised clustering and feature selection.一种结合基因本体信息用于无监督聚类和特征选择的新方法。
PLoS One. 2008;3(12):e3860. doi: 10.1371/journal.pone.0003860. Epub 2008 Dec 4.
7
Global analysis of exon creation versus loss and the role of alternative splicing in 17 vertebrate genomes.17种脊椎动物基因组中外显子产生与丢失的全局分析及可变剪接的作用
RNA. 2007 May;13(5):661-70. doi: 10.1261/rna.325107. Epub 2007 Mar 16.