Suppr超能文献

贝叶斯结构方程模型在多组学数据中的应用及在生物钟基因中的应用。

Bayesian structural equation modeling in multiple omics data with application to circadian genes.

机构信息

Early Clinical Development Oncology Statistics, Pfizer Inc., San Diego, CA 92121, USA.

Department of Statistics.

出版信息

Bioinformatics. 2020 Jul 1;36(13):3951-3958. doi: 10.1093/bioinformatics/btaa286.

Abstract

MOTIVATION

It is well known that the integration among different data-sources is reliable because of its potential of unveiling new functionalities of the genomic expressions, which might be dormant in a single-source analysis. Moreover, different studies have justified the more powerful analyses of multi-platform data. Toward this, in this study, we consider the circadian genes' omics profile, such as copy number changes and RNA-sequence data along with their survival response. We develop a Bayesian structural equation modeling coupled with linear regressions and log normal accelerated failure-time regression to integrate the information between these two platforms to predict the survival of the subjects. We place conjugate priors on the regression parameters and derive the Gibbs sampler using the conditional distributions of them.

RESULTS

Our extensive simulation study shows that the integrative model provides a better fit to the data than its closest competitor. The analyses of glioblastoma cancer data and the breast cancer data from TCGA, the largest genomics and transcriptomics database, support our findings.

AVAILABILITY AND IMPLEMENTATION

The developed method is wrapped in R package available at https://github.com/MAITYA02/semmcmc.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

众所周知,不同数据源的整合是可靠的,因为它有可能揭示基因组表达的新功能,而这些功能在单一来源的分析中可能是休眠的。此外,不同的研究已经证明了多平台数据的更强大的分析。为此,在这项研究中,我们考虑了生物钟基因的组学特征,如拷贝数变化和 RNA 测序数据,以及它们的生存反应。我们开发了一种贝叶斯结构方程模型,结合线性回归和对数正态加速失效时间回归,以整合这两个平台之间的信息,从而预测受试者的生存情况。我们在回归参数上放置了共轭先验,并使用它们的条件分布来推导 Gibbs 抽样器。

结果

我们广泛的模拟研究表明,整合模型比其最接近的竞争对手提供了更好的拟合数据。对胶质母细胞瘤癌症数据和 TCGA(最大的基因组和转录组学数据库)的乳腺癌数据的分析支持了我们的发现。

可用性和实现

所开发的方法被包装在 R 包中,可在 https://github.com/MAITYA02/semmcmc 上获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

9
Bayesian network-response regression.贝叶斯网络-响应回归。
Bioinformatics. 2017 Jun 15;33(12):1859-1866. doi: 10.1093/bioinformatics/btx050.

本文引用的文献

2
Cancer statistics, 2018.癌症统计数据,2018 年。
CA Cancer J Clin. 2018 Jan;68(1):7-30. doi: 10.3322/caac.21442. Epub 2018 Jan 4.
6
Genetics of Circadian Rhythms.昼夜节律的遗传学
Sleep Med Clin. 2015 Dec;10(4):413-21. doi: 10.1016/j.jsmc.2015.08.007.
7
Data integration in the era of omics: current and future challenges.组学时代的数据整合:当前与未来的挑战
BMC Syst Biol. 2014;8 Suppl 2(Suppl 2):I1. doi: 10.1186/1752-0509-8-S2-I1. Epub 2014 Mar 13.
9
The circadian clock in cancer development and therapy.生物钟在癌症发生和治疗中的作用。
Prog Mol Biol Transl Sci. 2013;119:221-82. doi: 10.1016/B978-0-12-396971-2.00009-9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验