• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SEMtree:基于结构方程模型的树状结构学习方法。

SEMtree: tree-based structure learning methods with structural equation models.

机构信息

Department of Brain and Behavioral Sciences, University of Pavia, Pavia 27100, Italy.

出版信息

Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad377.

DOI:10.1093/bioinformatics/btad377
PMID:37294820
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10287946/
Abstract

MOTIVATION

With the exponential growth of expression and protein-protein interaction (PPI) data, the identification of functional modules in PPI networks that show striking changes in molecular activity or phenotypic signatures becomes of particular interest to reveal process-specific information that is correlated with cellular or disease states. This requires both the identification of network nodes with reliability scores and the availability of an efficient technique to locate the network regions with the highest scores. In the literature, a number of heuristic methods have been suggested. We propose SEMtree(), a set of tree-based structure discovery algorithms, combining graph and statistically interpretable parameters together with a user-friendly R package based on structural equation models framework.

RESULTS

Condition-specific changes from differential expression and gene-gene co-expression are recovered with statistical testing of node, directed edge, and directed path difference between groups. In the end, from a list of seed (i.e. disease) genes or gene P-values, the perturbed modules with undirected edges are generated with five state-of-the-art active subnetwork detection methods. The latter are supplied to causal additive trees based on Chu-Liu-Edmonds' algorithm (Chow and Liu, Approximating discrete probability distributions with dependence trees. IEEE Trans Inform Theory 1968;14:462-7) in SEMtree() to be converted in directed trees. This conversion allows to compare the methods in terms of directed active subnetworks. We applied SEMtree() to both Coronavirus disease (COVID-19) RNA-seq dataset (GEO accession: GSE172114) and simulated datasets with various differential expression patterns. Compared to existing methods, SEMtree() is able to capture biologically relevant subnetworks with simple visualization of directed paths, good perturbation extraction, and classifier performance.

AVAILABILITY AND IMPLEMENTATION

SEMtree() function is implemented in the R package SEMgraph, easily available at https://CRAN.R-project.org/package=SEMgraph.

摘要

动机

随着表达谱和蛋白质-蛋白质相互作用(PPI)数据的指数级增长,识别 PPI 网络中显示分子活性或表型特征显著变化的功能模块,对于揭示与细胞或疾病状态相关的特定过程信息变得尤为重要。这既需要识别具有可靠性评分的网络节点,又需要提供一种有效的技术来定位具有最高评分的网络区域。在文献中,已经提出了许多启发式方法。我们提出了 SEMtree(),这是一组基于树的结构发现算法,将图和可统计解释的参数与基于结构方程模型框架的用户友好的 R 包结合在一起。

结果

通过对节点、有向边和组间有向路径的差异进行统计检验,恢复了差异表达和基因-基因共表达的条件特异性变化。最后,从种子(即疾病)基因或基因 P 值列表中,使用五种最先进的主动子网检测方法生成具有无向边的扰动模块。后者基于 Chu-Liu-Edmonds 算法(Chow 和 Liu,使用依赖树逼近离散概率分布。IEEE Trans Inform Theory 1968;14:462-7)提供给 SEMtree()中的因果加法树,以转换为有向树。这种转换允许根据有向主动子网比较方法。我们将 SEMtree()应用于冠状病毒病(COVID-19)RNA-seq 数据集(GEO 注册号:GSE172114)和具有各种差异表达模式的模拟数据集。与现有方法相比,SEMTree()能够通过简单的有向路径可视化、良好的扰动提取和分类器性能,捕获具有生物学意义的子网。

可用性和实现

SEMTree()函数在 R 包 SEMgraph 中实现,可在 https://CRAN.R-project.org/package=SEMgraph 上轻松获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/012e/10287946/1136a2c84cc5/btad377f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/012e/10287946/cfbf8c2c4544/btad377f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/012e/10287946/1136a2c84cc5/btad377f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/012e/10287946/cfbf8c2c4544/btad377f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/012e/10287946/1136a2c84cc5/btad377f2.jpg

相似文献

1
SEMtree: tree-based structure learning methods with structural equation models.SEMtree:基于结构方程模型的树状结构学习方法。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad377.
2
SEMgsa: topology-based pathway enrichment analysis with structural equation models.SEMgsa:基于拓扑结构的路径富集分析与结构方程模型。
BMC Bioinformatics. 2022 Aug 17;23(1):344. doi: 10.1186/s12859-022-04884-8.
3
SEMgraph: an R package for causal network inference of high-throughput data with structural equation models.SEMgraph:一个用于使用结构方程模型对高通量数据进行因果网络推断的 R 包。
Bioinformatics. 2022 Oct 14;38(20):4829-4830. doi: 10.1093/bioinformatics/btac567.
4
A multi-objective genetic algorithm to find active modules in multiplex biological networks.一种用于在多重生物网络中发现活性模块的多目标遗传算法。
PLoS Comput Biol. 2021 Aug 30;17(8):e1009263. doi: 10.1371/journal.pcbi.1009263. eCollection 2021 Aug.
5
GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.GSNFS:肺癌表达数据的基因子网生物标志物识别
BMC Med Genomics. 2016 Dec 5;9(Suppl 3):70. doi: 10.1186/s12920-016-0231-4.
6
Detecting subnetwork-level dynamic correlations.检测子网级动态相关性。
Bioinformatics. 2017 Jan 15;33(2):256-265. doi: 10.1093/bioinformatics/btw616. Epub 2016 Sep 25.
7
Identifying functional modules in protein-protein interaction networks: an integrated exact approach.识别蛋白质-蛋白质相互作用网络中的功能模块:一种综合精确方法。
Bioinformatics. 2008 Jul 1;24(13):i223-31. doi: 10.1093/bioinformatics/btn161.
8
wTO: an R package for computing weighted topological overlap and a consensus network with integrated visualization tool.wTO:一个用于计算加权拓扑重叠和共识网络的 R 包,具有集成的可视化工具。
BMC Bioinformatics. 2018 Oct 24;19(1):392. doi: 10.1186/s12859-018-2351-7.
9
SILGGM: An extensive R package for efficient statistical inference in large-scale gene networks.SILGGM:一个用于大规模基因网络中高效统计推断的扩展 R 包。
PLoS Comput Biol. 2018 Aug 13;14(8):e1006369. doi: 10.1371/journal.pcbi.1006369. eCollection 2018 Aug.
10
Network-based analysis of omics data: the LEAN method.基于网络的组学数据分析:LEAN方法。
Bioinformatics. 2017 Mar 1;33(5):701-709. doi: 10.1093/bioinformatics/btw676.

引用本文的文献

1
Transcriptomic Module Discovery of Diarrhea-Predominant Irritable Bowel Syndrome: A Causal Network Inference Approach.腹泻型肠易激综合征的转录组模块发现:一种因果网络推断方法。
Int J Mol Sci. 2024 Aug 28;25(17):9322. doi: 10.3390/ijms25179322.

本文引用的文献

1
SEMgraph: an R package for causal network inference of high-throughput data with structural equation models.SEMgraph:一个用于使用结构方程模型对高通量数据进行因果网络推断的 R 包。
Bioinformatics. 2022 Oct 14;38(20):4829-4830. doi: 10.1093/bioinformatics/btac567.
2
SEMgsa: topology-based pathway enrichment analysis with structural equation models.SEMgsa:基于拓扑结构的路径富集分析与结构方程模型。
BMC Bioinformatics. 2022 Aug 17;23(1):344. doi: 10.1186/s12859-022-04884-8.
3
Could the CCR5-Delta32 mutation be protective in SARS-CoV-2 infection?
CCR5-Delta32 突变是否可能对 SARS-CoV-2 感染具有保护作用?
Physiol Res. 2021 Dec 16;70(S2):S249-S252. doi: 10.33549/physiolres.934725.
4
Identification of transcriptional regulatory network associated with response of host epithelial cells to SARS-CoV-2.鉴定与宿主上皮细胞对 SARS-CoV-2 反应相关的转录调控网络。
Sci Rep. 2021 Dec 14;11(1):23928. doi: 10.1038/s41598-021-03309-5.
5
SARS-CoV-2-host proteome interactions for antiviral drug discovery.SARS-CoV-2 宿主蛋白组与抗病毒药物发现的相互作用。
Mol Syst Biol. 2021 Nov;17(11):e10396. doi: 10.15252/msb.202110396.
6
Identification of driver genes for critical forms of COVID-19 in a deeply phenotyped young patient cohort.在一个深度表型年轻患者队列中鉴定 COVID-19 危急型的驱动基因。
Sci Transl Med. 2022 Jan 19;14(628):eabj7521. doi: 10.1126/scitranslmed.abj7521.
7
Predicting COVID-19-Comorbidity Pathway Crosstalk-Based Targets and Drugs: Towards Personalized COVID-19 Management.基于新冠肺炎合并症通路串扰预测靶点和药物:迈向个性化新冠肺炎管理
Biomedicines. 2021 May 17;9(5):556. doi: 10.3390/biomedicines9050556.
8
The Role of Autophagy and NLRP3 Inflammasome in Liver Fibrosis.自噬和 NLRP3 炎性小体在肝纤维化中的作用。
Biomed Res Int. 2020 Jul 11;2020:7269150. doi: 10.1155/2020/7269150. eCollection 2020.
9
pathfindR: An R Package for Comprehensive Identification of Enriched Pathways in Omics Data Through Active Subnetworks.pathfindR:一个通过活性子网全面识别组学数据中富集通路的R包。
Front Genet. 2019 Sep 25;10:858. doi: 10.3389/fgene.2019.00858. eCollection 2019.
10
A Comprehensive Survey of Tools and Software for Active Subnetwork Identification.用于活跃子网识别的工具和软件综合调查。
Front Genet. 2019 Mar 5;10:155. doi: 10.3389/fgene.2019.00155. eCollection 2019.