• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于改进的词频-逆文档频率(TF-IDF)算法的基因通路富集方法。

A gene pathway enrichment method based on improved TF-IDF algorithm.

作者信息

Xu Shutan, Leng Yinhui, Feng Guofu, Zhang Chenjing, Chen Ming

机构信息

College of Information Technology, Shanghai Ocean University, Shanghai, 201306, China.

Key Laboratory of Fisheries Information, Ministry of Agriculture, Shanghai, 201306, China.

出版信息

Biochem Biophys Rep. 2023 Mar 6;34:101421. doi: 10.1016/j.bbrep.2023.101421. eCollection 2023 Jul.

DOI:10.1016/j.bbrep.2023.101421
PMID:36923007
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10009669/
Abstract

Gene pathway enrichment analysis is a widely used method to analyze whether a gene set is statistically enriched on certain biological pathway network. Current gene pathway enrichment methods commonly consider local importance of genes in pathways without considering the interactions between genes. In this paper, we propose a gene pathway enrichment method (GIGSEA) based on improved TF-IDF algorithm. This method employs gene interaction data to calculate the influence of genes based on the local importance in a pathway as well as the global specificity. Computational experiment result shows that, compared with traditional gene set enrichment analysis method, our proposed method in this paper can find more specific enriched pathways related to phenotype with higher efficiency.

摘要

基因通路富集分析是一种广泛使用的方法,用于分析基因集在特定生物通路网络上是否存在统计学富集。当前的基因通路富集方法通常考虑基因在通路中的局部重要性,而不考虑基因之间的相互作用。在本文中,我们提出了一种基于改进的TF-IDF算法的基因通路富集方法(GIGSEA)。该方法利用基因相互作用数据,基于通路中的局部重要性以及全局特异性来计算基因的影响。计算实验结果表明,与传统的基因集富集分析方法相比,本文提出的方法能够更高效地找到与表型相关的更特定的富集通路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/770cc897b25d/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/f5cfff9e00e3/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/573fed0af424/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/9698b336c45b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/1b752bdef6e7/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/770cc897b25d/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/f5cfff9e00e3/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/573fed0af424/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/9698b336c45b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/1b752bdef6e7/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd6f/10009669/770cc897b25d/gr5.jpg

相似文献

1
A gene pathway enrichment method based on improved TF-IDF algorithm.一种基于改进的词频-逆文档频率(TF-IDF)算法的基因通路富集方法。
Biochem Biophys Rep. 2023 Mar 6;34:101421. doi: 10.1016/j.bbrep.2023.101421. eCollection 2023 Jul.
2
A CNN-Based Framework for Predicting Public Emotion and Multi-Level Behaviors Based on Network Public Opinion.一种基于卷积神经网络的、基于网络舆情预测公众情绪和多层次行为的框架。
Front Psychol. 2022 Jun 23;13:909439. doi: 10.3389/fpsyg.2022.909439. eCollection 2022.
3
A detection method for android application security based on TF-IDF and machine learning.基于 TF-IDF 和机器学习的安卓应用安全检测方法。
PLoS One. 2020 Sep 11;15(9):e0238694. doi: 10.1371/journal.pone.0238694. eCollection 2020.
4
Detecting discordance enrichment among a series of two-sample genome-wide expression data sets.检测一系列双样本全基因组表达数据集之间的不一致性富集情况。
BMC Genomics. 2017 Jan 25;18(Suppl 1):1050. doi: 10.1186/s12864-016-3265-2.
5
Pathway Enrichment Analysis with Networks.基于网络的通路富集分析
Genes (Basel). 2017 Sep 28;8(10):246. doi: 10.3390/genes8100246.
6
Comparative study on gene set and pathway topology-based enrichment methods.基于基因集和通路拓扑结构的富集方法的比较研究
BMC Bioinformatics. 2015 Oct 22;16:334. doi: 10.1186/s12859-015-0751-5.
7
Pathway Distiller - multisource biological pathway consolidation.Pathway Distiller - 多源生物途径整合。
BMC Genomics. 2012;13 Suppl 6(Suppl 6):S18. doi: 10.1186/1471-2164-13-S6-S18. Epub 2012 Oct 26.
8
A novel dysregulated pathway-identification analysis based on global influence of within-pathway effects and crosstalk between pathways.一种基于通路内效应的全局影响和通路间串扰的新型失调通路识别分析。
J R Soc Interface. 2015 Jan 6;12(102):20140937. doi: 10.1098/rsif.2014.0937.
9
Identification of Serum Exosome-Derived circRNA-miRNA-TF-mRNA Regulatory Network in Postmenopausal Osteoporosis Using Bioinformatics Analysis and Validation in Peripheral Blood-Derived Mononuclear Cells.基于生物信息学分析和外周血单核细胞验证鉴定绝经后骨质疏松症血清外泌体来源 circRNA-miRNA-TF-mRNA 调控网络
Front Endocrinol (Lausanne). 2022 Jun 9;13:899503. doi: 10.3389/fendo.2022.899503. eCollection 2022.
10
Analyzing the pathways enriched in genes associated with nicotine dependence in the context of human protein-protein interaction network.分析与尼古丁依赖相关基因在人类蛋白质-蛋白质相互作用网络背景下富集的途径。
J Biomol Struct Dyn. 2019 Mar;37(5):1177-1188. doi: 10.1080/07391102.2018.1453377. Epub 2018 Mar 24.

引用本文的文献

1
DrugBERT: a BERT-based approach integrating LDA topic embedding and efficacy-aware mechanism for predicting anti-tumor drug efficacy.DrugBERT:一种基于BERT的方法,集成LDA主题嵌入和疗效感知机制以预测抗肿瘤药物疗效。
J Transl Med. 2025 Aug 5;23(1):864. doi: 10.1186/s12967-025-06795-7.
2
Novel Approach to Personalized Physician Recommendations Using Semantic Features and Response Metrics: Model Evaluation Study.使用语义特征和响应指标的个性化医生推荐新方法:模型评估研究。
JMIR Hum Factors. 2024 Aug 15;11:e57670. doi: 10.2196/57670.

本文引用的文献

1
A p53-JAK-STAT connection involved in myeloproliferative neoplasm pathogenesis and progression to secondary acute myeloid leukemia.p53-JAK-STAT 连接参与骨髓增生性肿瘤发病机制和向继发性急性髓系白血病的进展。
Blood Rev. 2020 Jul;42:100712. doi: 10.1016/j.blre.2020.100712. Epub 2020 Jun 3.
2
JAK/STAT signaling in hepatocellular carcinoma.肝细胞癌中的JAK/STAT信号传导
Hepat Oncol. 2020 Mar 18;7(1):HEP18. doi: 10.2217/hep-2020-0001.
3
Integration of metabolomics and expression of enolase-phosphatase 1 links to hepatocellular carcinoma progression.
代谢组学与烯醇酶磷酸酶 1 表达的整合与肝细胞癌的进展有关。
Theranostics. 2019 May 26;9(12):3639-3652. doi: 10.7150/thno.31693. eCollection 2019.
4
p63 and p73 repress CXCR5 chemokine receptor gene expression in p53-deficient MCF-7 breast cancer cells during genotoxic stress.p63 和 p73 在 p53 缺陷型 MCF-7 乳腺癌细胞中抑制 CXCR5 趋化因子受体基因表达,从而在遗传毒性应激下。
Biochim Biophys Acta Gene Regul Mech. 2017 Dec;1860(12):1169-1178. doi: 10.1016/j.bbagrm.2017.10.003. Epub 2017 Nov 6.
5
Regulation of cytotoxic T-cell responses by p53 in cancer.p53在癌症中对细胞毒性T细胞反应的调控
Transl Cancer Res. 2016 Dec;5(6):692-697. doi: 10.21037/tcr.2016.11.76.
6
Exploring TCGA Pan-Cancer data at the UCSC Cancer Genomics Browser.探索 UCSC 癌症基因组浏览器中的 TCGA 泛癌症数据。
Sci Rep. 2013 Oct 2;3:2652. doi: 10.1038/srep02652.
7
BioServices: a common Python package to access biological Web Services programmatically.生物服务:一个用于通过编程访问生物 Web 服务的通用 Python 包。
Bioinformatics. 2013 Dec 15;29(24):3241-2. doi: 10.1093/bioinformatics/btt547. Epub 2013 Sep 23.
8
Differential expression analysis for pathways.通路的差异表达分析。
PLoS Comput Biol. 2013;9(3):e1002967. doi: 10.1371/journal.pcbi.1002967. Epub 2013 Mar 14.
9
Down-weighting overlapping genes improves gene set analysis.降低重叠基因的权重可以提高基因集分析的效果。
BMC Bioinformatics. 2012 Jun 19;13:136. doi: 10.1186/1471-2105-13-136.
10
Ten years of pathway analysis: current approaches and outstanding challenges.十年的通路分析:当前方法和突出挑战。
PLoS Comput Biol. 2012;8(2):e1002375. doi: 10.1371/journal.pcbi.1002375. Epub 2012 Feb 23.