• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用RNA测序数据构建基因共变网络的泊松对数正态模型。

A Poisson Log-Normal Model for Constructing Gene Covariation Network Using RNA-seq Data.

作者信息

Choi Yoonha, Coram Marc, Peng Jie, Tang Hua

机构信息

1 Department of Genetics, Stanford University , Stanford, California.

2 Department of Health Research and Policy, Stanford University , Stanford, California.

出版信息

J Comput Biol. 2017 Jul;24(7):721-731. doi: 10.1089/cmb.2017.0053. Epub 2017 May 30.

DOI:10.1089/cmb.2017.0053
PMID:28557607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5510689/
Abstract

Constructing expression networks using transcriptomic data is an effective approach for studying gene regulation. A popular approach for constructing such a network is based on the Gaussian graphical model (GGM), in which an edge between a pair of genes indicates that the expression levels of these two genes are conditionally dependent, given the expression levels of all other genes. However, GGMs are not appropriate for non-Gaussian data, such as those generated in RNA-seq experiments. We propose a novel statistical framework that maximizes a penalized likelihood, in which the observed count data follow a Poisson log-normal distribution. To overcome the computational challenges, we use Laplace's method to approximate the likelihood and its gradients, and apply the alternating directions method of multipliers to find the penalized maximum likelihood estimates. The proposed method is evaluated and compared with GGMs using both simulated and real RNA-seq data. The proposed method shows improved performance in detecting edges that represent covarying pairs of genes, particularly for edges connecting low-abundant genes and edges around regulatory hubs.

摘要

利用转录组数据构建表达网络是研究基因调控的一种有效方法。构建此类网络的一种常用方法基于高斯图形模型(GGM),其中一对基因之间的边表示在所有其他基因的表达水平给定的情况下,这两个基因的表达水平是条件依赖的。然而,GGM不适用于非高斯数据,例如RNA测序实验中产生的数据。我们提出了一种新的统计框架,该框架最大化惩罚似然,其中观测到的计数数据遵循泊松对数正态分布。为了克服计算挑战,我们使用拉普拉斯方法来近似似然及其梯度,并应用乘子交替方向法来找到惩罚最大似然估计。使用模拟和真实RNA测序数据对所提出的方法进行评估并与GGM进行比较。所提出的方法在检测代表基因协变对的边方面表现出改进的性能,特别是对于连接低丰度基因的边和调控枢纽周围的边。

相似文献

1
A Poisson Log-Normal Model for Constructing Gene Covariation Network Using RNA-seq Data.一种使用RNA测序数据构建基因共变网络的泊松对数正态模型。
J Comput Biol. 2017 Jul;24(7):721-731. doi: 10.1089/cmb.2017.0053. Epub 2017 May 30.
2
Network analysis for count data with excess zeros.针对具有过多零值的计数数据的网络分析。
BMC Genet. 2017 Nov 6;18(1):93. doi: 10.1186/s12863-017-0561-z.
3
PLNseq: a multivariate Poisson lognormal distribution for high-throughput matched RNA-sequencing read count data.PLNseq:一种用于高通量匹配RNA测序读数计数数据的多元泊松对数正态分布。
Stat Med. 2015 Apr 30;34(9):1577-89. doi: 10.1002/sim.6449. Epub 2015 Jan 30.
4
Estimation of sparse directed acyclic graphs for multivariate counts data.多元计数数据的稀疏有向无环图估计
Biometrics. 2016 Sep;72(3):791-803. doi: 10.1111/biom.12467. Epub 2016 Feb 5.
5
A New -Regularized Log-Linear Poisson Graphical Model with Applications to RNA Sequencing Data.一种新的正则化对数线性泊松图模型及其在 RNA 测序数据中的应用。
6
Statistical methods on detecting differentially expressed genes for RNA-seq data.用于检测RNA测序数据中差异表达基因的统计方法。
BMC Syst Biol. 2011;5 Suppl 3(Suppl 3):S1. doi: 10.1186/1752-0509-5-S3-S1. Epub 2011 Dec 23.
7
Joint gene network construction by single-cell RNA sequencing data.基于单细胞 RNA 测序数据的基因共网络构建
Biometrics. 2023 Jun;79(2):915-925. doi: 10.1111/biom.13645. Epub 2022 Apr 15.
8
A hierarchical poisson log-normal model for network inference from RNA sequencing data.基于 RNA 测序数据的网络推断的层次泊松对数正态模型。
PLoS One. 2013 Oct 17;8(10):e77503. doi: 10.1371/journal.pone.0077503. eCollection 2013.
9
JRmGRN: joint reconstruction of multiple gene regulatory networks with common hub genes using data from multiple tissues or conditions.多组织或多条件数据下共同枢纽基因的多个基因调控网络的联合重建
Bioinformatics. 2018 Oct 15;34(20):3470-3478. doi: 10.1093/bioinformatics/bty354.
10
A Local Poisson Graphical Model for inferring networks from sequencing data.基于测序数据推断网络的局部泊松图模型。
IEEE Trans Nanobioscience. 2013 Sep;12(3):189-98. doi: 10.1109/TNB.2013.2263838. Epub 2013 Aug 15.

引用本文的文献

1
Internal and external normalization of nascent RNA sequencing run-on experiments.新生 RNA 测序延伸实验的内外部标准化。
BMC Bioinformatics. 2024 Jan 12;25(1):19. doi: 10.1186/s12859-023-05607-3.
2
Prioritizing Autism Risk Genes using Personalized Graphical Models Estimated from Single Cell RNA-seq Data.利用从单细胞RNA测序数据估计的个性化图形模型对自闭症风险基因进行优先级排序。
J Am Stat Assoc. 2022;117(537):38-51. doi: 10.1080/01621459.2021.1933495. Epub 2021 Jul 21.

本文引用的文献

1
The huge Package for High-dimensional Undirected Graph Estimation in R.R语言中用于高维无向图估计的庞大软件包。
J Mach Learn Res. 2012 Apr;13:1059-1062.
2
Inference of radio-responsive gene regulatory networks using the graphical lasso algorithm.基于图lasso 算法的放射敏感基因调控网络推断。
BMC Bioinformatics. 2014;15 Suppl 7(Suppl 7):S5. doi: 10.1186/1471-2105-15-S7-S5. Epub 2014 May 28.
3
BOOTSTRAP INFERENCE FOR NETWORK CONSTRUCTION WITH AN APPLICATION TO A BREAST CANCER MICROARRAY STUDY.用于网络构建的自助推断及其在乳腺癌微阵列研究中的应用
Ann Appl Stat. 2013 Mar 1;7(1):391-417. doi: 10.1214/12-AOAS589.
4
A hierarchical poisson log-normal model for network inference from RNA sequencing data.基于 RNA 测序数据的网络推断的层次泊松对数正态模型。
PLoS One. 2013 Oct 17;8(10):e77503. doi: 10.1371/journal.pone.0077503. eCollection 2013.
5
Transcriptome and genome sequencing uncovers functional variation in humans.转录组和基因组测序揭示了人类功能变异。
Nature. 2013 Sep 26;501(7468):506-11. doi: 10.1038/nature12531. Epub 2013 Sep 15.
6
A Local Poisson Graphical Model for inferring networks from sequencing data.基于测序数据推断网络的局部泊松图模型。
IEEE Trans Nanobioscience. 2013 Sep;12(3):189-98. doi: 10.1109/TNB.2013.2263838. Epub 2013 Aug 15.
7
Variation and genetic control of protein abundance in humans.人类蛋白质丰度的变化和遗传控制。
Nature. 2013 Jul 4;499(7456):79-82. doi: 10.1038/nature12223. Epub 2013 May 15.
8
Learning oncogenic pathways from binary genomic instability data.从二元基因组不稳定性数据中学习致癌途径。
Biometrics. 2011 Mar;67(1):164-73. doi: 10.1111/j.1541-0420.2010.01417.x.
9
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.edgeR:一个用于数字基因表达数据差异表达分析的 Bioconductor 包。
Bioinformatics. 2010 Jan 1;26(1):139-40. doi: 10.1093/bioinformatics/btp616. Epub 2009 Nov 11.
10
Partial Correlation Estimation by Joint Sparse Regression Models.基于联合稀疏回归模型的偏相关估计
J Am Stat Assoc. 2009 Jun 1;104(486):735-746. doi: 10.1198/jasa.2009.0126.