• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过收缩法纳入先验信息:全基因组定位数据与基因表达数据的联合分析

Incorporating prior information via shrinkage: a combined analysis of genome-wide location data and gene expression data.

作者信息

Xie Yang, Pan Wei, Jeong Kyeong S, Khodursky Arkady

机构信息

Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA.

出版信息

Stat Med. 2007 May 10;26(10):2258-75. doi: 10.1002/sim.2703.

DOI:10.1002/sim.2703
PMID:16958153
Abstract

Transcriptional control is a critical step in regulation of gene expression. Understanding such a control on a genomic level involves deciphering the mechanisms and structures of regulatory programmes and networks. A difficulty arises due to the weak signal and high noise in various sources of data while most current approaches are limited to analysis of a single source of data. A natural alternative is to improve statistical efficiency and power by a combined analysis of multiple sources of data. Here we propose a shrinkage method to combine genome-wide location data and gene expression data to detect the binding sites or target genes of a transcription factor. Specifically, a prior 'non-target' gene list is generated by analysing the expression data, and then this information is incorporated into the subsequent binding data analysis via a shrinkage method. There is a Bayesian justification for this shrinkage method. Both simulated and real data were used to evaluate the proposed method and compare it with analysing binding data alone. In simulation studies, the proposed method gives higher sensitivity and lower false discovery rate (FDR) in detecting the target genes. In real data example, the proposed method can reduce the estimated FDR and increase the power to detect the previously known target genes of a broad transcription regulator, leucine responsive regulatory protein (Lrp) in Escherichia coli. This method can also be used to incorporate other information, such as gene ontology (GO), to microarray data analysis to detect differentially expressed genes.

摘要

转录调控是基因表达调控中的关键步骤。在基因组水平上理解这种调控涉及解读调控程序和网络的机制与结构。由于各种数据来源中信号微弱且噪声高,同时大多数现有方法局限于单一数据源的分析,因此出现了困难。一种自然的替代方法是通过对多个数据源进行联合分析来提高统计效率和功效。在此,我们提出一种收缩方法,将全基因组定位数据和基因表达数据相结合,以检测转录因子的结合位点或靶基因。具体而言,通过分析表达数据生成一个先验的“非靶标”基因列表,然后通过收缩方法将此信息纳入后续的结合数据分析中。这种收缩方法有贝叶斯理论依据。使用模拟数据和真实数据来评估所提出的方法,并将其与单独分析结合数据进行比较。在模拟研究中,所提出的方法在检测靶基因时具有更高的灵敏度和更低的错误发现率(FDR)。在真实数据示例中,所提出的方法可以降低估计的FDR,并提高检测大肠杆菌中广泛转录调节因子亮氨酸响应调节蛋白(Lrp)先前已知靶基因的能力。该方法还可用于将其他信息(如基因本体论(GO))纳入微阵列数据分析,以检测差异表达基因。

相似文献

1
Incorporating prior information via shrinkage: a combined analysis of genome-wide location data and gene expression data.通过收缩法纳入先验信息:全基因组定位数据与基因表达数据的联合分析
Stat Med. 2007 May 10;26(10):2258-75. doi: 10.1002/sim.2703.
2
Incorporating biological information as a prior in an empirical bayes approach to analyzing microarray data.在经验贝叶斯方法中纳入生物信息作为先验信息来分析微阵列数据。
Stat Appl Genet Mol Biol. 2005;4:Article12. doi: 10.2202/1544-6115.1124. Epub 2005 May 25.
3
Comparison of false discovery rate methods in identifying genes with differential expression.在识别差异表达基因方面错误发现率方法的比较。
Genomics. 2005 Oct;86(4):495-503. doi: 10.1016/j.ygeno.2005.06.007.
4
Consensus and Meta-analysis regulatory networks for combining multiple microarray gene expression datasets.用于整合多个微阵列基因表达数据集的共识与荟萃分析调控网络。
J Biomed Inform. 2008 Dec;41(6):914-26. doi: 10.1016/j.jbi.2008.01.011. Epub 2008 Feb 6.
5
Reconstructing gene regulatory networks with bayesian networks by combining expression data with multiple sources of prior knowledge.通过将表达数据与多种先验知识来源相结合,利用贝叶斯网络重建基因调控网络。
Stat Appl Genet Mol Biol. 2007;6:Article15. doi: 10.2202/1544-6115.1282. Epub 2007 May 29.
6
Statistical methods in integrative analysis for gene regulatory modules.基因调控模块综合分析中的统计方法
Stat Appl Genet Mol Biol. 2008;7(1):Article 28. doi: 10.2202/1544-6115.1369. Epub 2008 Oct 10.
7
A Gibbs sampler for the identification of gene expression and network connectivity consistency.一种用于识别基因表达和网络连通性一致性的吉布斯采样器。
Bioinformatics. 2006 Dec 15;22(24):3040-6. doi: 10.1093/bioinformatics/btl541. Epub 2006 Oct 23.
8
Analysis of gene networks for drug target discovery and validation.用于药物靶点发现与验证的基因网络分析。
Methods Mol Biol. 2007;360:33-56. doi: 10.1385/1-59745-165-7:33.
9
A Bayesian determination of threshold for identifying differentially expressed genes in microarray experiments.微阵列实验中用于鉴定差异表达基因的阈值的贝叶斯确定方法。
Stat Med. 2006 Sep 30;25(18):3174-89. doi: 10.1002/sim.2422.
10
Genome-wide co-expression based prediction of differential expressions.基于全基因组共表达的差异表达预测。
Bioinformatics. 2008 Mar 1;24(5):666-73. doi: 10.1093/bioinformatics/btm507. Epub 2007 Nov 15.

引用本文的文献

1
Use of pathway information in molecular epidemiology.在分子流行病学中使用途径信息。
Hum Genomics. 2009 Oct;4(1):21-42. doi: 10.1186/1479-7364-4-1-21.
2
Statistical methods for integrating multiple types of high-throughput data.整合多种类型高通量数据的统计方法。
Methods Mol Biol. 2010;620:511-29. doi: 10.1007/978-1-60761-580-4_19.
3
A Bayesian approach to joint modeling of protein-DNA binding, gene expression and sequence data.一种联合建模蛋白质-DNA 结合、基因表达和序列数据的贝叶斯方法。
Stat Med. 2010 Feb 20;29(4):489-503. doi: 10.1002/sim.3815.
4
Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data.基于基因表达数据的癌症分类的线性判别分析方法比较。
J Exp Clin Cancer Res. 2009 Dec 10;28(1):149. doi: 10.1186/1756-9966-28-149.