Suppr超能文献

贝叶斯综合表达数据分析模型:RhoG 案例研究

Bayesian integrated modeling of expression data: a case study on RhoG.

机构信息

Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland.

出版信息

BMC Bioinformatics. 2010 Jun 1;11:295. doi: 10.1186/1471-2105-11-295.

Abstract

BACKGROUND

DNA microarrays provide an efficient method for measuring activity of genes in parallel and even covering all the known transcripts of an organism on a single array. This has to be balanced against that analyzing data emerging from microarrays involves several consecutive steps, and each of them is a potential source of errors. Errors tend to accumulate when moving from the lower level towards the higher level analyses because of the sequential nature. Eliminating such errors does not seem feasible without completely changing the technologies, but one should nevertheless try to meet the goal of being able to realistically assess degree of the uncertainties that are involved when drawing the final conclusions from such analyses.

RESULTS

We present a Bayesian hierarchical model for finding differentially expressed genes between two experimental conditions, proposing an integrated statistical approach where correcting signal saturation, systematic array effects, dye effects, and finding differentially expressed genes, are all modeled jointly. The integration allows all these components, and also the associated errors, to be considered simultaneously. The inference is based on full posterior distribution of gene expression indices and on quantities derived from them rather than on point estimates. The model was applied and tested on two different datasets.

CONCLUSIONS

The method presents a way of integrating various steps of microarray analysis into a single joint analysis, and thereby enables extracting information on differential expression in a manner, which properly accounts for various sources of potential error in the process.

摘要

背景

DNA 微阵列提供了一种高效的方法来并行测量基因的活性,甚至可以在单个阵列上覆盖生物体的所有已知转录本。这必须与分析微阵列中出现的数据相平衡,因为分析涉及几个连续的步骤,每个步骤都是潜在误差的来源。由于顺序性质,从较低水平向较高水平分析时,误差往往会累积。如果不彻底改变技术,似乎不可能消除这些错误,但人们仍然应该努力实现能够真实评估从这些分析中得出最终结论时所涉及的不确定性程度的目标。

结果

我们提出了一种用于在两种实验条件之间找到差异表达基因的贝叶斯层次模型,提出了一种集成的统计方法,其中信号饱和、系统阵列效应、染料效应以及寻找差异表达基因的校正都被联合建模。这种集成允许同时考虑所有这些组件以及相关的误差。推理基于基因表达指数的完整后验分布,以及从中得出的数量,而不是基于点估计。该模型已应用于两个不同的数据集并进行了测试。

结论

该方法提供了一种将微阵列分析的各个步骤集成到单个联合分析中的方法,从而能够以适当考虑到该过程中各种潜在误差源的方式提取差异表达信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6c/2894040/9123faf20417/1471-2105-11-295-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验