• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

gdGSE:一种通过离散化基因表达值来评估通路富集的算法。

gdGSE: An algorithm to evaluate pathway enrichment by discretizing gene expression values.

作者信息

Luo Jiangti, Lu Qiqi, He Mengjiao, Zhang Xiaobo, Yang Xiang, Wang Xiaosheng

机构信息

Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China.

Intelligent Pharmacy Interdisciplinary Research Center, China Pharmaceutical University, Nanjing 211198, China.

出版信息

Comput Struct Biotechnol J. 2025 May 1;27:1772-1783. doi: 10.1016/j.csbj.2025.04.038. eCollection 2025.

DOI:10.1016/j.csbj.2025.04.038
PMID:40458635
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12127574/
Abstract

We proposed gdGSE, a novel computational framework for gene set enrichment analysis. Unlike conventional methods that rely on continuous gene expression values, gdGSE employs discretized gene expression profiles to assess pathway activity. This approach effectively mitigates discrepancies caused by data distributions. This algorithm consists of two steps: (1) applying statistical thresholds binarizing gene expression matrix, and (2) converting the binarized gene expression matrix into a gene set enrichment matrix. Our results demonstrated that gdGSE could robustly extract biological insights from a diverse array of simulated and real bulk or single-cell gene expression datasets. Notably, gene set enrichment scores by gdGSE exhibited enhanced utility in downstream applications: (1) precise quantification of cancer stemness with significant prognostic relevance; (2) enhanced clustering performance in stratifying tumor subtypes with distinct prognoses; and (3) more accurate identification of cell types. Remarkably, the pathway activity scores by gdGSE showed > 90 % concordance with experimentally validated drug mechanisms in patients-derived xenografts and estrogen receptor-positive breast cancer cell lines. Our algorithm proposes that discretizing gene expression values provides an alternative method for evaluating pathway enrichment, applicable to both bulk and single-cell data analysis.

摘要

我们提出了gdGSE,一种用于基因集富集分析的新型计算框架。与依赖连续基因表达值的传统方法不同,gdGSE采用离散化的基因表达谱来评估通路活性。这种方法有效地减轻了由数据分布引起的差异。该算法包括两个步骤:(1)应用统计阈值对基因表达矩阵进行二值化,以及(2)将二值化的基因表达矩阵转换为基因集富集矩阵。我们的结果表明,gdGSE能够从各种模拟和真实的批量或单细胞基因表达数据集中稳健地提取生物学见解。值得注意的是,gdGSE的基因集富集分数在下游应用中表现出更高的效用:(1)精确量化具有显著预后相关性的癌症干性;(2)在区分具有不同预后的肿瘤亚型时增强聚类性能;以及(3)更准确地识别细胞类型。值得注意的是,gdGSE的通路活性分数与患者来源的异种移植和雌激素受体阳性乳腺癌细胞系中经过实验验证的药物机制显示出> 90% 的一致性。我们的算法表明,离散化基因表达值为评估通路富集提供了一种替代方法,适用于批量和单细胞数据分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/bbd726c8182b/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/db0ea67cc872/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/16b50d620a68/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/bab2bb78ebc6/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/b46215e01bd3/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/0cf16303c1f5/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/dc2634cc3c85/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/bbd726c8182b/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/db0ea67cc872/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/16b50d620a68/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/bab2bb78ebc6/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/b46215e01bd3/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/0cf16303c1f5/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/dc2634cc3c85/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6ea/12127574/bbd726c8182b/gr6.jpg

相似文献

1
gdGSE: An algorithm to evaluate pathway enrichment by discretizing gene expression values.gdGSE:一种通过离散化基因表达值来评估通路富集的算法。
Comput Struct Biotechnol J. 2025 May 1;27:1772-1783. doi: 10.1016/j.csbj.2025.04.038. eCollection 2025.
2
Identification of a novel immune-related gene signature by single-cell and bulk sequencing for the prediction of the immune landscape and prognosis of breast cancer.通过单细胞和批量测序鉴定一种新型免疫相关基因特征以预测乳腺癌的免疫格局和预后
Cancer Cell Int. 2024 Dec 3;24(1):393. doi: 10.1186/s12935-024-03589-7.
3
Classification of lung adenocarcinoma based on stemness scores in bulk and single cell transcriptomes.基于批量和单细胞转录组干性评分的肺腺癌分类
Comput Struct Biotechnol J. 2022 Apr 6;20:1691-1701. doi: 10.1016/j.csbj.2022.04.004. eCollection 2022.
4
Metabolic Heterogeneity of Tumor Cells and its Impact on Colon Cancer Metastasis: Insights from Single-Cell and Bulk Transcriptome Analyses.肿瘤细胞的代谢异质性及其对结肠癌转移的影响:来自单细胞和批量转录组分析的见解
J Cancer. 2024 Jun 3;15(13):4175-4196. doi: 10.7150/jca.94630. eCollection 2024.
5
MGSEA - a multivariate Gene set enrichment analysis.MGSEA - 一种多变量基因集富集分析方法。
BMC Bioinformatics. 2019 Mar 18;20(1):145. doi: 10.1186/s12859-019-2716-6.
6
Pathway Enrichment-Based Unsupervised Learning Identifies Novel Subtypes of Cancer-Associated Fibroblasts in Pancreatic Ductal Adenocarcinoma.基于通路富集的无监督学习识别胰腺导管腺癌中癌症相关成纤维细胞的新亚型
Interdiscip Sci. 2025 Apr 24. doi: 10.1007/s12539-025-00705-7.
7
Subtyping of sarcomas based on pathway enrichment scores in bulk and single cell transcriptomes.基于 bulk 和单细胞转录组中通路富集分数对肉瘤进行亚型分类。
J Transl Med. 2022 Jan 29;20(1):48. doi: 10.1186/s12967-022-03248-3.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types.单细胞转录组数据中肿瘤内和肿瘤间基因簇异质性的特征分析。
Biol Open. 2022 Jun 15;11(6). doi: 10.1242/bio.059256. Epub 2022 Jun 23.
10
Identification of a glycolysis- and lactate-related gene signature for predicting prognosis, immune microenvironment, and drug candidates in colon adenocarcinoma.鉴定用于预测结肠腺癌预后、免疫微环境和候选药物的糖酵解和乳酸相关基因特征。
Front Cell Dev Biol. 2022 Aug 23;10:971992. doi: 10.3389/fcell.2022.971992. eCollection 2022.

本文引用的文献

1
Cell cycle machinery in oncology: A comprehensive review of therapeutic targets.肿瘤细胞周期机制:治疗靶点的综合综述。
FASEB J. 2024 Jun 15;38(11):e23734. doi: 10.1096/fj.202400769R.
2
Response to tumor-infiltrating lymphocyte adoptive therapy is associated with preexisting CD8 T-myeloid cell networks in melanoma.对肿瘤浸润淋巴细胞过继疗法的反应与黑色素瘤中预先存在的 CD8 T-髓样细胞网络有关。
Sci Immunol. 2024 Feb 2;9(92):eadg7995. doi: 10.1126/sciimmunol.adg7995.
3
Single-cell sequencing reveals Hippo signaling as a driver of fibrosis in hidradenitis suppurativa.
单细胞测序揭示 Hippo 信号通路在化脓性汗腺炎纤维化中的驱动作用。
J Clin Invest. 2024 Feb 1;134(3):e169225. doi: 10.1172/JCI169225.
4
Identifying cancer subtypes based on embryonic and hematopoietic stem cell signatures in pan-cancer.基于泛癌中胚胎和造血干细胞特征识别癌症亚型。
Cell Oncol (Dordr). 2024 Apr;47(2):587-605. doi: 10.1007/s13402-023-00886-7. Epub 2023 Oct 12.
5
Dictionary learning for integrative, multimodal and scalable single-cell analysis.基于字典学习的综合、多模态和可扩展的单细胞分析。
Nat Biotechnol. 2024 Feb;42(2):293-304. doi: 10.1038/s41587-023-01767-y. Epub 2023 May 25.
6
Systematic assessment of prognostic molecular features across cancers.跨癌症的预后分子特征的系统评估。
Cell Genom. 2023 Feb 2;3(3):100262. doi: 10.1016/j.xgen.2023.100262. eCollection 2023 Mar 8.
7
De novo analysis of bulk RNA-seq data at spatially resolved single-cell resolution.在空间分辨的单细胞分辨率下对批量 RNA-seq 数据进行从头分析。
Nat Commun. 2022 Oct 30;13(1):6498. doi: 10.1038/s41467-022-34271-z.
8
A single-cell atlas of the multicellular ecosystem of primary and metastatic hepatocellular carcinoma.原发性和转移性肝细胞癌的多细胞生态系统单细胞图谱。
Nat Commun. 2022 Aug 6;13(1):4594. doi: 10.1038/s41467-022-32283-3.
9
Proprotein convertase subtilisin/kexin type 9 is a psoriasis-susceptibility locus that is negatively related to IL36G.脯氨酸内切酶枯草溶菌素/克胰蛋白酶 9 型是一个银屑病易感基因座,与 IL36G 呈负相关。
JCI Insight. 2022 Aug 22;7(16):e141193. doi: 10.1172/jci.insight.141193.
10
Signature-scoring methods developed for bulk samples are not adequate for cancer single-cell RNA sequencing data.用于批量样本的特征评分方法对于癌症单细胞 RNA 测序数据来说并不充分。
Elife. 2022 Feb 25;11:e71994. doi: 10.7554/eLife.71994.