• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PriPath:通过分组、评分和建模,并结合嵌入式特征选择方法,从差异基因表达中识别失调途径。

PriPath: identifying dysregulated pathways from differential gene expression via grouping, scoring, and modeling with an embedded feature selection approach.

机构信息

Department of Information Systems, Zefat Academic College, 13206, Zefat, Israel.

Galilee Digital Health Research Center (GDH), Zefat Academic College, Zefat, Israel.

出版信息

BMC Bioinformatics. 2023 Feb 23;24(1):60. doi: 10.1186/s12859-023-05187-2.

DOI:10.1186/s12859-023-05187-2
PMID:36823571
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9947447/
Abstract

BACKGROUND

Cell homeostasis relies on the concerted actions of genes, and dysregulated genes can lead to diseases. In living organisms, genes or their products do not act alone but within networks. Subsets of these networks can be viewed as modules that provide specific functionality to an organism. The Kyoto encyclopedia of genes and genomes (KEGG) systematically analyzes gene functions, proteins, and molecules and combines them into pathways. Measurements of gene expression (e.g., RNA-seq data) can be mapped to KEGG pathways to determine which modules are affected or dysregulated in the disease. However, genes acting in multiple pathways and other inherent issues complicate such analyses. Many current approaches may only employ gene expression data and need to pay more attention to some of the existing knowledge stored in KEGG pathways for detecting dysregulated pathways. New methods that consider more precompiled information are required for a more holistic association between gene expression and diseases.

RESULTS

PriPath is a novel approach that transfers the generic process of grouping and scoring, followed by modeling to analyze gene expression with KEGG pathways. In PriPath, KEGG pathways are utilized as the grouping function as part of a machine learning algorithm for selecting the most significant KEGG pathways. A machine learning model is trained to differentiate between diseases and controls using those groups. We have tested PriPath on 13 gene expression datasets of various cancers and other diseases. Our proposed approach successfully assigned biologically and clinically relevant KEGG terms to the samples based on the differentially expressed genes. We have comparatively evaluated the performance of PriPath against other tools, which are similar in their merit. For each dataset, we manually confirmed the top results of PriPath in the literature and found that most predictions can be supported by previous experimental research.

CONCLUSIONS

PriPath can thus aid in determining dysregulated pathways, which applies to medical diagnostics. In the future, we aim to advance this approach so that it can perform patient stratification based on gene expression and identify druggable targets. Thereby, we cover two aspects of precision medicine.

摘要

背景

细胞内稳态依赖于基因的协同作用,失调的基因可能导致疾病。在活生物体中,基因或其产物并非单独起作用,而是在网络中起作用。这些网络的子集可以看作是为生物体提供特定功能的模块。京都基因与基因组百科全书(KEGG)系统地分析基因功能、蛋白质和分子,并将它们组合成途径。基因表达的测量(例如 RNA-seq 数据)可以映射到 KEGG 途径,以确定疾病中哪些模块受到影响或失调。然而,作用于多个途径的基因和其他内在问题使此类分析变得复杂。许多当前的方法可能仅使用基因表达数据,需要更多地关注 KEGG 途径中存储的一些现有知识,以检测失调的途径。需要新的方法来考虑更多预先编译的信息,以便更全面地将基因表达与疾病联系起来。

结果

PriPath 是一种新颖的方法,它将分组和评分的通用过程以及建模转移到使用 KEGG 途径分析基因表达的过程中。在 PriPath 中,KEGG 途径被用作分组功能,作为机器学习算法的一部分,用于选择最重要的 KEGG 途径。使用这些组训练机器学习模型来区分疾病和对照。我们已经在各种癌症和其他疾病的 13 个基因表达数据集上测试了 PriPath。我们的方法根据差异表达基因成功地将具有生物学和临床意义的 KEGG 术语分配给样本。我们比较了 PriPath 与其他类似工具的性能,这些工具在功能上相似。对于每个数据集,我们都在文献中手动确认了 PriPath 的顶级结果,并发现大多数预测都可以得到先前实验研究的支持。

结论

因此,PriPath 可以帮助确定失调的途径,这适用于医学诊断。在未来,我们的目标是推进这一方法,以便能够根据基因表达对患者进行分层,并识别可用药的靶点。从而,我们涵盖了精准医学的两个方面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/8874b574e7c9/12859_2023_5187_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/74af8c9e0924/12859_2023_5187_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/5895d69b19a6/12859_2023_5187_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/7c69270d53a1/12859_2023_5187_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/29e012657cbb/12859_2023_5187_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/8874b574e7c9/12859_2023_5187_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/74af8c9e0924/12859_2023_5187_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/5895d69b19a6/12859_2023_5187_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/7c69270d53a1/12859_2023_5187_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/29e012657cbb/12859_2023_5187_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aed/9948518/8874b574e7c9/12859_2023_5187_Fig5_HTML.jpg

相似文献

1
PriPath: identifying dysregulated pathways from differential gene expression via grouping, scoring, and modeling with an embedded feature selection approach.PriPath:通过分组、评分和建模,并结合嵌入式特征选择方法,从差异基因表达中识别失调途径。
BMC Bioinformatics. 2023 Feb 23;24(1):60. doi: 10.1186/s12859-023-05187-2.
2
CogNet: classification of gene expression data based on ranked active-subnetwork-oriented KEGG pathway enrichment analysis.CogNet:基于面向排名活性子网的KEGG通路富集分析的基因表达数据分类
PeerJ Comput Sci. 2021 Feb 22;7:e336. doi: 10.7717/peerj-cs.336. eCollection 2021.
3
GeNetOntology: identifying affected gene ontology terms via grouping, scoring, and modeling of gene expression data utilizing biological knowledge-based machine learning.基因本体论:通过利用基于生物知识的机器学习对基因表达数据进行分组、评分和建模来识别受影响的基因本体术语。
Front Genet. 2023 Aug 21;14:1139082. doi: 10.3389/fgene.2023.1139082. eCollection 2023.
4
Identifying dysregulated pathways in cancers from pathway interaction networks.从通路相互作用网络中鉴定癌症中的失调通路。
BMC Bioinformatics. 2012 Jun 7;13:126. doi: 10.1186/1471-2105-13-126.
5
The use of Gene Ontology terms and KEGG pathways for analysis and prediction of oncogenes.利用基因本体论术语和KEGG通路进行癌基因的分析与预测。
Biochim Biophys Acta. 2016 Nov;1860(11 Pt B):2725-34. doi: 10.1016/j.bbagen.2016.01.012. Epub 2016 Jan 20.
6
Systematical Identification of Breast Cancer-Related Circular RNA Modules for Deciphering circRNA Functions Based on the Non-Negative Matrix Factorization Algorithm.基于非负矩阵分解算法的乳腺癌相关环状 RNA 模块的系统识别,用于解析环状 RNA 功能。
Int J Mol Sci. 2019 Feb 20;20(4):919. doi: 10.3390/ijms20040919.
7
Detecting discordance enrichment among a series of two-sample genome-wide expression data sets.检测一系列双样本全基因组表达数据集之间的不一致性富集情况。
BMC Genomics. 2017 Jan 25;18(Suppl 1):1050. doi: 10.1186/s12864-016-3265-2.
8
9
The identification of key genes and pathways in hepatocellular carcinoma by bioinformatics analysis of high-throughput data.通过高通量数据的生物信息学分析鉴定肝细胞癌中的关键基因和信号通路。
Med Oncol. 2017 Jun;34(6):101. doi: 10.1007/s12032-017-0963-9. Epub 2017 Apr 21.
10
Path2enet: generation of human pathway-derived networks in an expression specific context.Path2enet:在特定表达背景下生成人类通路衍生网络。
BMC Genomics. 2016 Oct 25;17(Suppl 8):731. doi: 10.1186/s12864-016-3066-7.

引用本文的文献

1
RCE-IFE: recursive cluster elimination with intra-cluster feature elimination.RCE-IFE:带簇内特征消除的递归簇消除
PeerJ Comput Sci. 2025 Feb 7;11:e2528. doi: 10.7717/peerj-cs.2528. eCollection 2025.
2
Metastatic melanoma: An integrated analysis to identify critical regulators associated with prognosis, pathogenesis and targeted therapies.转移性黑色素瘤:一项综合分析,以确定与预后、发病机制和靶向治疗相关的关键调节因子。
PLoS One. 2025 Jan 16;20(1):e0312754. doi: 10.1371/journal.pone.0312754. eCollection 2025.
3
Topic selection for text classification using ensemble topic modeling with grouping, scoring, and modeling approach.

本文引用的文献

1
GediNET for discovering gene associations across diseases using knowledge based machine learning approach.基于知识的机器学习方法发现疾病间基因关联的 GediNET。
Sci Rep. 2022 Nov 19;12(1):19955. doi: 10.1038/s41598-022-24421-0.
2
miRModuleNet: Detecting miRNA-mRNA Regulatory Modules.miRModuleNet:检测微小RNA-信使核糖核酸调控模块
Front Genet. 2022 Apr 12;13:767455. doi: 10.3389/fgene.2022.767455. eCollection 2022.
3
Neuroimaging and biomarker evidence of neurodegeneration in asthma.神经影像学和生物标志物在哮喘神经退行性变中的证据。
使用具有分组、评分和建模方法的集成主题建模进行文本分类的主题选择
Sci Rep. 2024 Oct 9;14(1):23516. doi: 10.1038/s41598-024-74022-2.
4
Establishing a model composed of immune-related gene-modules to predict tumor immunotherapy response.建立一个由免疫相关基因模块组成的模型,以预测肿瘤免疫治疗反应。
Sci Rep. 2024 Jul 18;14(1):16630. doi: 10.1038/s41598-024-67742-y.
5
microBiomeGSM: the identification of taxonomic biomarkers from metagenomic data using grouping, scoring and modeling (G-S-M) approach.微生物群落GSM:使用分组、评分和建模(G-S-M)方法从宏基因组数据中识别分类学生物标志物。
Front Microbiol. 2023 Nov 22;14:1264941. doi: 10.3389/fmicb.2023.1264941. eCollection 2023.
6
TextNetTopics Pro, a topic model-based text classification for short text by integration of semantic and document-topic distribution information.TextNetTopics Pro,一种基于主题模型的短文本分类方法,通过整合语义和文档主题分布信息实现。
Front Genet. 2023 Oct 5;14:1243874. doi: 10.3389/fgene.2023.1243874. eCollection 2023.
7
GeNetOntology: identifying affected gene ontology terms via grouping, scoring, and modeling of gene expression data utilizing biological knowledge-based machine learning.基因本体论:通过利用基于生物知识的机器学习对基因表达数据进行分组、评分和建模来识别受影响的基因本体术语。
Front Genet. 2023 Aug 21;14:1139082. doi: 10.3389/fgene.2023.1139082. eCollection 2023.
8
Review of feature selection approaches based on grouping of features.基于特征分组的特征选择方法综述。
PeerJ. 2023 Jul 17;11:e15666. doi: 10.7717/peerj.15666. eCollection 2023.
9
Invention of 3Mint for feature grouping and scoring in multi-omics.用于多组学中特征分组和评分的3Mint的发明。
Front Genet. 2023 Mar 15;14:1093326. doi: 10.3389/fgene.2023.1093326. eCollection 2023.
10
miRdisNET: Discovering microRNA biomarkers that are associated with diseases utilizing biological knowledge-based machine learning.miRdisNET:利用基于生物学知识的机器学习发现与疾病相关的微小RNA生物标志物。
Front Genet. 2023 Jan 12;13:1076554. doi: 10.3389/fgene.2022.1076554. eCollection 2022.
J Allergy Clin Immunol. 2022 Feb;149(2):589-598.e6. doi: 10.1016/j.jaci.2021.09.010. Epub 2021 Sep 15.
4
FLAME: A Web Tool for Functional and Literature Enrichment Analysis of Multiple Gene Lists.FLAME:一个用于多个基因列表功能和文献富集分析的网络工具。
Biology (Basel). 2021 Jul 14;10(7):665. doi: 10.3390/biology10070665.
5
Upregulated NLGN1 predicts poor survival in colorectal cancer.上调的 NLGN1 可预测结直肠癌患者的不良预后。
BMC Cancer. 2021 Aug 2;21(1):884. doi: 10.1186/s12885-021-08621-x.
6
miRcorrNet: machine learning-based integration of miRNA and mRNA expression profiles, combined with feature grouping and ranking.miRcorrNet:基于机器学习的miRNA和mRNA表达谱整合,结合特征分组与排序
PeerJ. 2021 May 19;9:e11458. doi: 10.7717/peerj.11458. eCollection 2021.
7
Luteolin Ameliorates Experimental Pulmonary Arterial Hypertension via Suppressing Hippo-YAP/PI3K/AKT Signaling Pathway.木犀草素通过抑制Hippo-YAP/PI3K/AKT信号通路改善实验性肺动脉高压
Front Pharmacol. 2021 Apr 15;12:663551. doi: 10.3389/fphar.2021.663551. eCollection 2021.
8
PANX2 and brain lower grade glioma genesis: A bioinformatic analysis.PANX2 与脑低级别胶质瘤的发生:生物信息学分析。
Sci Prog. 2021 Apr-Jun;104(2):368504211011836. doi: 10.1177/00368504211011836.
9
CogNet: classification of gene expression data based on ranked active-subnetwork-oriented KEGG pathway enrichment analysis.CogNet:基于面向排名活性子网的KEGG通路富集分析的基因表达数据分类
PeerJ Comput Sci. 2021 Feb 22;7:e336. doi: 10.7717/peerj-cs.336. eCollection 2021.
10
Recursive Cluster Elimination based Rank Function (SVM-RCE-R) implemented in KNIME.基于递归聚类消除的秩函数(SVM-RCE-R)在 KNIME 中的实现。
F1000Res. 2020 Oct 19;9:1255. doi: 10.12688/f1000research.26880.2. eCollection 2020.