• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

均值和协方差模型的变量选择与联合估计及其在eQTL数据中的应用

Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data.

作者信息

Lee JungJun, Kim SungHwan, Jhong Jae-Hwan, Koo Ja-Yong

机构信息

Department of Statistics, Korea University, Seoul 02841, Republic of Korea.

Department of Applied Statistics, Konkuk University, Seoul 05029, Republic of Korea.

出版信息

Comput Math Methods Med. 2018 Jun 25;2018:4626307. doi: 10.1155/2018/4626307. eCollection 2018.

DOI:10.1155/2018/4626307
PMID:30046352
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6036858/
Abstract

In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model (JMCCM) that elucidates conditional dependent structures of genes with controlling for potential genotype perturbations. To this end, the modified Cholesky decomposition is utilized to parametrize entries of a precision matrix. The JMCCM maximizes the likelihood function to estimate parameters involved in the model. We also develop a variable selection algorithm that selects explanatory variables and Cholesky factors by exploiting the combination of the GCV and BIC as benchmarks, together with Rao and Wald statistics. Importantly, we notice that sparse estimation of a precision matrix (or equivalently gene network) is effectively achieved via the proposed variable selection scheme and contributes to exploring significant hub genes shown to be concordant to biological evidence. In simulation studies, we confirm that our model selection efficiently identifies the true underlying networks. With an application to miRNA and SNPs data from yeast (a.k.a. eQTL data), we demonstrate that constructed gene networks reproduce validated biological and clinical knowledge with regard to various pathways including the cell cycle pathway.

摘要

在基因组数据分析中,由于未知的遗传复杂性和表观遗传调控,很难确定多个基因之间潜在的调控关系,这是很常见的情况。在本文中,我们考虑了一种联合均值和常数协方差模型(JMCCM),该模型通过控制潜在的基因型扰动来阐明基因的条件依赖结构。为此,利用修正的Cholesky分解对精度矩阵的元素进行参数化。JMCCM通过最大化似然函数来估计模型中涉及的参数。我们还开发了一种变量选择算法,该算法通过利用GCV和BIC的组合作为基准,结合Rao和Wald统计量来选择解释变量和Cholesky因子。重要的是,我们注意到通过所提出的变量选择方案可以有效地实现精度矩阵(或等效的基因网络)的稀疏估计,这有助于探索与生物学证据一致的重要枢纽基因。在模拟研究中,我们证实我们的模型选择能够有效地识别真实的潜在网络。通过将其应用于酵母的miRNA和SNP数据(也称为eQTL数据),我们证明构建的基因网络能够重现关于包括细胞周期途径在内的各种途径的经过验证的生物学和临床知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/4f9658660f9c/CMMM2018-4626307.alg.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/bb75e16c606a/CMMM2018-4626307.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/ba36d381b2c5/CMMM2018-4626307.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/ea121f0f0e2b/CMMM2018-4626307.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/1b2595f00394/CMMM2018-4626307.alg.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/4f9658660f9c/CMMM2018-4626307.alg.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/bb75e16c606a/CMMM2018-4626307.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/ba36d381b2c5/CMMM2018-4626307.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/ea121f0f0e2b/CMMM2018-4626307.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/1b2595f00394/CMMM2018-4626307.alg.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3534/6036858/4f9658660f9c/CMMM2018-4626307.alg.003.jpg

相似文献

1
Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data.均值和协方差模型的变量选择与联合估计及其在eQTL数据中的应用
Comput Math Methods Med. 2018 Jun 25;2018:4626307. doi: 10.1155/2018/4626307. eCollection 2018.
2
Learning gene networks under SNP perturbations using eQTL datasets.利用eQTL数据集在SNP扰动下学习基因网络。
PLoS Comput Biol. 2014 Feb 27;10(2):e1003420. doi: 10.1371/journal.pcbi.1003420. eCollection 2014 Feb.
3
Mapping eQTL networks with mixed graphical Markov models.使用混合图形马尔可夫模型绘制表达数量性状基因座网络
Genetics. 2014 Dec;198(4):1377-93. doi: 10.1534/genetics.114.169573. Epub 2014 Sep 29.
4
Joint high-dimensional Bayesian variable and covariance selection with an application to eQTL analysis.联合高维贝叶斯变量与协方差选择及其在表达数量性状位点分析中的应用
Biometrics. 2013 Jun;69(2):447-57. doi: 10.1111/biom.12021. Epub 2013 Apr 22.
5
Fast and robust group-wise eQTL mapping using sparse graphical models.使用稀疏图形模型进行快速且稳健的分组基因表达定量性状位点定位
BMC Bioinformatics. 2015 Jan 16;16:2. doi: 10.1186/s12859-014-0421-z.
6
A Cholesky-based sparse covariance estimation with an application to genes data.基于 Cholesky 的稀疏协方差估计及其在基因数据中的应用。
J Biopharm Stat. 2021 Sep 3;31(5):603-616. doi: 10.1080/10543406.2021.1931270. Epub 2021 May 29.
7
A systems biology approach for identifying novel pathway regulators in eQTL mapping.一种用于在eQTL定位中识别新型通路调节因子的系统生物学方法。
J Biopharm Stat. 2010 Mar;20(2):373-400. doi: 10.1080/10543400903572803.
8
Model selection for factorial Gaussian graphical models with an application to dynamic regulatory networks.
Stat Appl Genet Mol Biol. 2016 Jun 1;15(3):193-212. doi: 10.1515/sagmb-2014-0075.
9
Inferring molecular interactions pathways from eQTL data.从表达数量性状基因座(eQTL)数据推断分子相互作用途径。
Methods Mol Biol. 2009;541:211-23. doi: 10.1007/978-1-59745-243-4_10.
10
Causal inference of regulator-target pairs by gene mapping of expression phenotypes.通过表达表型的基因定位对调控因子-靶标对进行因果推断。
BMC Genomics. 2006 May 24;7:125. doi: 10.1186/1471-2164-7-125.

引用本文的文献

1
Modelling speciation: Problems and implications.物种形成建模:问题与影响。
In Silico Biol. 2023;15(1-2):23-42. doi: 10.3233/ISB-220253.

本文引用的文献

1
Covariate-Adjusted Precision Matrix Estimation with an Application in Genetical Genomics.协变量调整的精度矩阵估计及其在遗传基因组学中的应用
Biometrika. 2013 Mar;100(1):139-156. doi: 10.1093/biomet/ass058. Epub 2012 Nov 30.
2
Learning gene networks under SNP perturbations using eQTL datasets.利用eQTL数据集在SNP扰动下学习基因网络。
PLoS Comput Biol. 2014 Feb 27;10(2):e1003420. doi: 10.1371/journal.pcbi.1003420. eCollection 2014 Feb.
3
Sparse Estimation of Conditional Graphical Models With Application to Gene Networks.条件图形模型的稀疏估计及其在基因网络中的应用
J Am Stat Assoc. 2012 Jan 1;107(497):152-167. doi: 10.1080/01621459.2011.644498.
4
Joint conditional Gaussian graphical models with multiple sources of genomic data.具有多种基因组数据源的联合条件高斯图形模型。
Front Genet. 2013 Dec 17;4:294. doi: 10.3389/fgene.2013.00294. eCollection 2013.
5
A SPARSE CONDITIONAL GAUSSIAN GRAPHICAL MODEL FOR ANALYSIS OF GENETICAL GENOMICS DATA.一种用于遗传基因组数据分析的稀疏条件高斯图形模型。
Ann Appl Stat. 2011 Dec;5(4):2630-2650. doi: 10.1214/11-AOAS494.
6
Sic1 as a timer of Clb cyclin waves in the yeast cell cycle--design principle of not just an inhibitor.Sic1 作为酵母细胞周期中 Clb 周期蛋白波的定时器——不仅仅是抑制剂的设计原理。
FEBS J. 2012 Sep;279(18):3386-410. doi: 10.1111/j.1742-4658.2012.08542.x. Epub 2012 Mar 21.
7
A global protein kinase and phosphatase interaction network in yeast.酵母中全局的蛋白激酶和磷酸酶相互作用网络。
Science. 2010 May 21;328(5981):1043-6. doi: 10.1126/science.1176495.
8
KEGG for representation and analysis of molecular networks involving diseases and drugs.KEGG 用于表示和分析涉及疾病和药物的分子网络。
Nucleic Acids Res. 2010 Jan;38(Database issue):D355-60. doi: 10.1093/nar/gkp896. Epub 2009 Oct 30.
9
Integrating proteomic, transcriptional, and interactome data reveals hidden components of signaling and regulatory networks.整合蛋白质组学、转录组学和相互作用组数据可揭示信号传导和调控网络的隐藏组成部分。
Sci Signal. 2009 Jul 28;2(81):ra40. doi: 10.1126/scisignal.2000350.
10
Bridging high-throughput genetic and transcriptional data reveals cellular responses to alpha-synuclein toxicity.整合高通量遗传数据和转录数据揭示细胞对α-突触核蛋白毒性的反应。
Nat Genet. 2009 Mar;41(3):316-23. doi: 10.1038/ng.337. Epub 2009 Feb 22.