• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对具有过多零值的计数数据的网络分析。

Network analysis for count data with excess zeros.

作者信息

Choi Hosik, Gim Jungsoo, Won Sungho, Kim You Jin, Kwon Sunghoon, Park Changyi

机构信息

Department of Applied Statistics, Kyonggi University, Suwon, 16227, Korea.

Institute of Health and Environment, Seoul National University, Seoul, 08826, Korea.

出版信息

BMC Genet. 2017 Nov 6;18(1):93. doi: 10.1186/s12863-017-0561-z.

DOI:10.1186/s12863-017-0561-z
PMID:29110633
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5674822/
Abstract

BACKGROUND

Undirected graphical models or Markov random fields have been a popular class of models for representing conditional dependence relationships between nodes. In particular, Markov networks help us to understand complex interactions between genes in biological processes of a cell. Local Poisson models seem to be promising in modeling positive as well as negative dependencies for count data. Furthermore, when zero counts are more frequent than are expected, excess zeros should be considered in the model.

METHODS

We present a penalized Poisson graphical model for zero inflated count data and derive an expectation-maximization (EM) algorithm built on coordinate descent. Our method is shown to be effective through simulated and real data analysis.

RESULTS

Results from the simulated data indicate that our method outperforms the local Poisson graphical model in the presence of excess zeros. In an application to a RNA sequencing data, we also investigate the gender effect by comparing the estimated networks according to different genders. Our method may help us in identifying biological pathways linked to sex hormone regulation and thus understanding underlying mechanisms of the gender differences.

CONCLUSIONS

We have presented a penalized version of zero inflated spatial Poisson regression and derive an efficient EM algorithm built on coordinate descent. We discuss possible improvements of our method as well as potential research directions associated with our findings from the RNA sequencing data.

摘要

背景

无向图模型或马尔可夫随机场一直是用于表示节点间条件依赖关系的一类流行模型。特别是,马尔可夫网络有助于我们理解细胞生物过程中基因之间的复杂相互作用。局部泊松模型在对计数数据的正相关和负相关进行建模方面似乎很有前景。此外,当零计数比预期更频繁时,模型中应考虑过多的零值。

方法

我们提出了一种针对零膨胀计数数据的惩罚泊松图模型,并推导了基于坐标下降的期望最大化(EM)算法。通过模拟和实际数据分析表明我们的方法是有效的。

结果

模拟数据的结果表明,在存在过多零值的情况下,我们的方法优于局部泊松图模型。在对RNA测序数据的应用中,我们还通过比较不同性别的估计网络来研究性别效应。我们的方法可能有助于我们识别与性激素调节相关的生物途径,从而理解性别差异的潜在机制。

结论

我们提出了零膨胀空间泊松回归的惩罚版本,并推导了基于坐标下降的高效EM算法。我们讨论了我们方法可能的改进以及与RNA测序数据结果相关的潜在研究方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2775/5674822/ff9c76915e7f/12863_2017_561_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2775/5674822/ca7fb262ddcb/12863_2017_561_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2775/5674822/83b951e1c433/12863_2017_561_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2775/5674822/ff9c76915e7f/12863_2017_561_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2775/5674822/ca7fb262ddcb/12863_2017_561_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2775/5674822/83b951e1c433/12863_2017_561_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2775/5674822/ff9c76915e7f/12863_2017_561_Fig3_HTML.jpg

相似文献

1
Network analysis for count data with excess zeros.针对具有过多零值的计数数据的网络分析。
BMC Genet. 2017 Nov 6;18(1):93. doi: 10.1186/s12863-017-0561-z.
2
A Poisson Log-Normal Model for Constructing Gene Covariation Network Using RNA-seq Data.一种使用RNA测序数据构建基因共变网络的泊松对数正态模型。
J Comput Biol. 2017 Jul;24(7):721-731. doi: 10.1089/cmb.2017.0053. Epub 2017 May 30.
3
Classifying next-generation sequencing data using a zero-inflated Poisson model.使用零膨胀泊松模型对下一代测序数据进行分类。
Bioinformatics. 2018 Apr 15;34(8):1329-1335. doi: 10.1093/bioinformatics/btx768.
4
Mapping QTL controlling count traits with excess zeros and ones using a zero-and-one-inflated generalized Poisson regression model.使用零一膨胀广义泊松回归模型定位控制具有过多零值和一值的计数性状的数量性状基因座。
Biom J. 2024 Apr;66(3):e2200342. doi: 10.1002/bimj.202200342.
5
Multi-level zero-inflated poisson regression modelling of correlated count data with excess zeros.具有过多零值的相关计数数据的多级零膨胀泊松回归建模
Stat Methods Med Res. 2006 Feb;15(1):47-61. doi: 10.1191/0962280206sm429oa.
6
A Local Poisson Graphical Model for inferring networks from sequencing data.基于测序数据推断网络的局部泊松图模型。
IEEE Trans Nanobioscience. 2013 Sep;12(3):189-98. doi: 10.1109/TNB.2013.2263838. Epub 2013 Aug 15.
7
Bivariate zero-inflated regression for count data: a Bayesian approach with application to plant counts.计数数据的双变量零膨胀回归:一种贝叶斯方法及其在植物计数中的应用
Int J Biostat. 2010;6(1):Article 27. doi: 10.2202/1557-4679.1229.
8
Gene network inference by fusing data from diverse distributions.通过融合来自不同分布的数据进行基因网络推断。
Bioinformatics. 2015 Jun 15;31(12):i230-9. doi: 10.1093/bioinformatics/btv258.
9
Differential correlation for sequencing data.测序数据的差异相关性
BMC Res Notes. 2017 Jan 19;10(1):54. doi: 10.1186/s13104-016-2331-9.
10
PLNseq: a multivariate Poisson lognormal distribution for high-throughput matched RNA-sequencing read count data.PLNseq:一种用于高通量匹配RNA测序读数计数数据的多元泊松对数正态分布。
Stat Med. 2015 Apr 30;34(9):1577-89. doi: 10.1002/sim.6449. Epub 2015 Jan 30.

引用本文的文献

1
Hepatic expression of sodium-glucose cotransporter 2 (SGLT2) in patients with chronic liver disease.慢性肝病患者肝脏中钠-葡萄糖共转运蛋白 2(SGLT2)的表达。
Med Mol Morphol. 2022 Dec;55(4):304-315. doi: 10.1007/s00795-022-00334-9. Epub 2022 Sep 21.
2
A novel probabilistic generator for large-scale gene association networks.一种用于大规模基因关联网络的新型概率生成器。
PLoS One. 2021 Nov 12;16(11):e0259193. doi: 10.1371/journal.pone.0259193. eCollection 2021.
3
A New -Regularized Log-Linear Poisson Graphical Model with Applications to RNA Sequencing Data.

本文引用的文献

1
The huge Package for High-dimensional Undirected Graph Estimation in R.R语言中用于高维无向图估计的庞大软件包。
J Mach Learn Res. 2012 Apr;13:1059-1062.
2
Gene network inference by fusing data from diverse distributions.通过融合来自不同分布的数据进行基因网络推断。
Bioinformatics. 2015 Jun 15;31(12):i230-9. doi: 10.1093/bioinformatics/btv258.
3
EM for regularized zero-inflated regression models with applications to postoperative morbidity after cardiac surgery in children.用于正则化零膨胀回归模型的期望最大化算法及其在儿童心脏手术后发病情况中的应用
一种新的正则化对数线性泊松图模型及其在 RNA 测序数据中的应用。
4
A zero inflated log-normal model for inference of sparse microbial association networks.零膨胀对数正态模型用于推断稀疏微生物关联网络。
PLoS Comput Biol. 2021 Jun 18;17(6):e1009089. doi: 10.1371/journal.pcbi.1009089. eCollection 2021 Jun.
5
Naught all zeros in sequence count data are the same.序列计数数据中的零并非都相同。
Comput Struct Biotechnol J. 2020 Sep 28;18:2789-2798. doi: 10.1016/j.csbj.2020.09.014. eCollection 2020.
Stat Med. 2014 Dec 20;33(29):5192-208. doi: 10.1002/sim.6314. Epub 2014 Sep 26.
4
Sex-dependent gene expression in human pluripotent stem cells.人类多能干细胞中的性别依赖性基因表达。
Cell Rep. 2014 Aug 21;8(4):923-32. doi: 10.1016/j.celrep.2014.07.013. Epub 2014 Aug 7.
5
A hierarchical poisson log-normal model for network inference from RNA sequencing data.基于 RNA 测序数据的网络推断的层次泊松对数正态模型。
PLoS One. 2013 Oct 17;8(10):e77503. doi: 10.1371/journal.pone.0077503. eCollection 2013.
6
A Local Poisson Graphical Model for inferring networks from sequencing data.基于测序数据推断网络的局部泊松图模型。
IEEE Trans Nanobioscience. 2013 Sep;12(3):189-98. doi: 10.1109/TNB.2013.2263838. Epub 2013 Aug 15.
7
GENIES: gene network inference engine based on supervised analysis.GENIES:基于监督分析的基因网络推理引擎。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W162-7. doi: 10.1093/nar/gks459. Epub 2012 May 18.
8
Ion channels in sperm physiology and male fertility and infertility.精子生理学以及男性生育与不育中的离子通道
J Androl. 2012 Sep-Oct;33(5):777-88. doi: 10.2164/jandrol.111.015552. Epub 2012 Mar 22.
9
Effects of gender on gene expression in the blood of ischemic stroke patients.性别对缺血性脑卒中患者血液中基因表达的影响。
J Cereb Blood Flow Metab. 2012 May;32(5):780-91. doi: 10.1038/jcbfm.2011.179. Epub 2011 Dec 14.
10
New variable selection methods for zero-inflated count data with applications to the substance abuse field.带有应用于物质滥用领域的零膨胀计数数据的新变量选择方法。
Stat Med. 2011 Aug 15;30(18):2326-40. doi: 10.1002/sim.4268. Epub 2011 May 12.