• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的癌症蛋白分泌途径研究。

Machine learning-based investigation of the cancer protein secretory pathway.

机构信息

Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden.

Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg, Sweden.

出版信息

PLoS Comput Biol. 2021 Apr 5;17(4):e1008898. doi: 10.1371/journal.pcbi.1008898. eCollection 2021 Apr.

DOI:10.1371/journal.pcbi.1008898
PMID:33819271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8049480/
Abstract

Deregulation of the protein secretory pathway (PSP) is linked to many hallmarks of cancer, such as promoting tissue invasion and modulating cell-cell signaling. The collection of secreted proteins processed by the PSP, known as the secretome, is often studied due to its potential as a reservoir of tumor biomarkers. However, there has been less focus on the protein components of the secretory machinery itself. We therefore investigated the expression changes in secretory pathway components across many different cancer types. Specifically, we implemented a dual approach involving differential expression analysis and machine learning to identify PSP genes whose expression was associated with key tumor characteristics: mutation of p53, cancer status, and tumor stage. Eight different machine learning algorithms were included in the analysis to enable comparison between methods and to focus on signals that were robust to algorithm type. The machine learning approach was validated by identifying PSP genes known to be regulated by p53, and even outperformed the differential expression analysis approach. Among the different analysis methods and cancer types, the kinesin family members KIF20A and KIF23 were consistently among the top genes associated with malignant transformation or tumor stage. However, unlike most cancer types which exhibited elevated KIF20A expression that remained relatively constant across tumor stages, renal carcinomas displayed a more gradual increase that continued with increasing disease severity. Collectively, our study demonstrates the complementary nature of a combined differential expression and machine learning approach for analyzing gene expression data, and highlights key PSP components relevant to features of tumor pathophysiology that may constitute potential therapeutic targets.

摘要

蛋白质分泌途径(PSP)的失调与癌症的许多特征有关,例如促进组织侵袭和调节细胞间信号。PSP 处理的分泌蛋白的集合,称为分泌组,由于其作为肿瘤生物标志物库的潜力而经常被研究。然而,对分泌机制本身的蛋白质成分的关注较少。因此,我们研究了许多不同癌症类型中分泌途径成分的表达变化。具体来说,我们采用了差异表达分析和机器学习的双重方法来识别与关键肿瘤特征相关的 PSP 基因的表达变化:p53 突变、癌症状态和肿瘤分期。该分析包括了 8 种不同的机器学习算法,以实现方法之间的比较,并关注对算法类型稳健的信号。通过识别已知受 p53 调节的 PSP 基因,对机器学习方法进行了验证,甚至优于差异表达分析方法。在不同的分析方法和癌症类型中,驱动蛋白家族成员 KIF20A 和 KIF23 一直是与恶性转化或肿瘤分期相关的最重要基因之一。然而,与大多数表现出相对稳定的 KIF20A 表达升高的癌症类型不同,肾细胞癌显示出更渐进的增加,随着疾病严重程度的增加而持续增加。总之,我们的研究表明,联合差异表达和机器学习方法分析基因表达数据具有互补性,并突出了与肿瘤病理生理学特征相关的关键 PSP 成分,这些成分可能构成潜在的治疗靶点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/2b0299d05e19/pcbi.1008898.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/1277b8403e95/pcbi.1008898.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/95e1b1cc3665/pcbi.1008898.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/e47b33e78a50/pcbi.1008898.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/8dbf390b4101/pcbi.1008898.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/2b0299d05e19/pcbi.1008898.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/1277b8403e95/pcbi.1008898.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/95e1b1cc3665/pcbi.1008898.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/e47b33e78a50/pcbi.1008898.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/8dbf390b4101/pcbi.1008898.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da04/8049480/2b0299d05e19/pcbi.1008898.g005.jpg

相似文献

1
Machine learning-based investigation of the cancer protein secretory pathway.基于机器学习的癌症蛋白分泌途径研究。
PLoS Comput Biol. 2021 Apr 5;17(4):e1008898. doi: 10.1371/journal.pcbi.1008898. eCollection 2021 Apr.
2
DriverML: a machine learning algorithm for identifying driver genes in cancer sequencing studies.DriverML:一种用于鉴定癌症测序研究中驱动基因的机器学习算法。
Nucleic Acids Res. 2019 May 7;47(8):e45. doi: 10.1093/nar/gkz096.
3
Machine Learning-based Classification of Diffuse Large B-cell Lymphoma Patients by Their Protein Expression Profiles.基于机器学习的弥漫性大B细胞淋巴瘤患者蛋白质表达谱分类
Mol Cell Proteomics. 2015 Nov;14(11):2947-60. doi: 10.1074/mcp.M115.050245. Epub 2015 Aug 26.
4
A probabilistic approach for automated discovery of perturbed genes using expression data from microarray or RNA-Seq.一种使用来自微阵列或RNA测序的表达数据自动发现受干扰基因的概率方法。
Comput Biol Med. 2015 Dec 1;67:29-40. doi: 10.1016/j.compbiomed.2015.07.029. Epub 2015 Aug 14.
5
HCSD: the human cancer secretome database.HCSD:人类癌症分泌蛋白组数据库。
Database (Oxford). 2015 Jun 14;2015:bav051. doi: 10.1093/database/bav051. Print 2015.
6
Overexpression of Kinesin Family Member 20A Correlates with Disease Progression and Poor Prognosis in Human Nasopharyngeal Cancer: A Retrospective Analysis of 105 Patients.驱动蛋白家族成员20A的过表达与人类鼻咽癌的疾病进展和不良预后相关:105例患者的回顾性分析
PLoS One. 2017 Jan 12;12(1):e0169280. doi: 10.1371/journal.pone.0169280. eCollection 2017.
7
Evaluation of Machine Learning Algorithm Utilization for Lung Cancer Classification Based on Gene Expression Levels.基于基因表达水平的肺癌分类机器学习算法应用评估
Asian Pac J Cancer Prev. 2016;17(2):835-8. doi: 10.7314/apjcp.2016.17.2.835.
8
Identification of a Tumor Microenvironment-relevant Gene set-based Prognostic Signature and Related Therapy Targets in Gastric Cancer.基于肿瘤微环境相关基因集的胃癌预后特征及相关治疗靶点的鉴定。
Theranostics. 2020 Jul 9;10(19):8633-8647. doi: 10.7150/thno.47938. eCollection 2020.
9
NRF1 motif sequence-enriched genes involved in ER/PR -ve HER2 +ve breast cancer signaling pathways.富含 NRF1 基序序列的基因参与 ER/PR-阴性、HER2 阳性乳腺癌信号通路。
Breast Cancer Res Treat. 2018 Nov;172(2):469-485. doi: 10.1007/s10549-018-4905-9. Epub 2018 Aug 20.
10
Machine learning on brain MRI data for differential diagnosis of Parkinson's disease and Progressive Supranuclear Palsy.基于脑磁共振成像数据的机器学习用于帕金森病和进行性核上性麻痹的鉴别诊断。
J Neurosci Methods. 2014 Jan 30;222:230-7. doi: 10.1016/j.jneumeth.2013.11.016. Epub 2013 Nov 26.

引用本文的文献

1
RnaXtract, a tool for extracting gene expression, variants, and cell-type composition from bulk RNA sequencing.RnaXtract,一种用于从大量RNA测序中提取基因表达、变异体和细胞类型组成的工具。
Sci Rep. 2025 Aug 24;15(1):31100. doi: 10.1038/s41598-025-16875-9.
2
Graph Neural Networks-Based Prediction of Drug Gene Interactions of RTK-VEGF4 Receptor Family in Periodontal Regeneration.基于图神经网络的牙周再生中RTK-VEGF4受体家族药物基因相互作用预测
J Clin Exp Dent. 2024 Dec 1;16(12):e1454-e1458. doi: 10.4317/jced.61880. eCollection 2024 Dec.
3
Role of artificial intelligence in cancer detection using protein p53: A Review.

本文引用的文献

1
Rab GTPases: Emerging Oncogenes and Tumor Suppressive Regulators for the Editing of Survival Pathways in Cancer.Rab GTP酶:癌症中用于编辑生存途径的新兴致癌基因和肿瘤抑制调节因子
Cancers (Basel). 2020 Jan 21;12(2):259. doi: 10.3390/cancers12020259.
2
Genome-scale reconstructions of the mammalian secretory pathway predict metabolic costs and limitations of protein secretion.哺乳动物分泌途径的基因组规模重建预测了蛋白质分泌的代谢成本和限制。
Nat Commun. 2020 Jan 2;11(1):68. doi: 10.1038/s41467-019-13867-y.
3
Comprehensive Identification and Characterization of Human Secretome Based on Integrative Proteomic and Transcriptomic Data.
人工智能在利用蛋白质p53进行癌症检测中的作用:综述
Mol Biol Rep. 2024 Dec 11;52(1):46. doi: 10.1007/s11033-024-10051-4.
4
A p53 transcriptional signature in primary and metastatic cancers derived using machine learning.利用机器学习得出的原发性和转移性癌症中的p53转录特征。
Front Genet. 2022 Aug 29;13:987238. doi: 10.3389/fgene.2022.987238. eCollection 2022.
5
Prostate cancer in omics era.组学时代的前列腺癌
Cancer Cell Int. 2022 Sep 5;22(1):274. doi: 10.1186/s12935-022-02691-y.
6
Integrative Pan-Cancer Analysis of KIF15 Reveals Its Diagnosis and Prognosis Value in Nasopharyngeal Carcinoma.KIF15的综合泛癌分析揭示了其在鼻咽癌中的诊断和预后价值。
Front Oncol. 2022 Mar 11;12:772816. doi: 10.3389/fonc.2022.772816. eCollection 2022.
基于蛋白质组学和转录组学整合数据的人类分泌蛋白组的全面鉴定与表征
Front Cell Dev Biol. 2019 Nov 21;7:299. doi: 10.3389/fcell.2019.00299. eCollection 2019.
4
The human secretome.人类分泌组。
Sci Signal. 2019 Nov 26;12(609):eaaz0274. doi: 10.1126/scisignal.aaz0274.
5
Identification of the expressome by machine learning on omics data.基于组学数据的机器学习鉴定表达谱。
Proc Natl Acad Sci U S A. 2019 Sep 3;116(36):18119-18125. doi: 10.1073/pnas.1813645116. Epub 2019 Aug 16.
6
Progression of the role of CRYAB in signaling pathways and cancers.CRYAB在信号通路和癌症中的作用进展。
Onco Targets Ther. 2019 May 30;12:4129-4139. doi: 10.2147/OTT.S201799. eCollection 2019.
7
MLSeq: Machine learning interface for RNA-sequencing data.MLSeq:用于 RNA-seq 数据的机器学习接口。
Comput Methods Programs Biomed. 2019 Jul;175:223-231. doi: 10.1016/j.cmpb.2019.04.007. Epub 2019 Apr 29.
8
A Systematic Investigation of the Malignant Functions and Diagnostic Potential of the Cancer Secretome.癌症分泌组的恶性功能与诊断潜能的系统研究
Cell Rep. 2019 Mar 5;26(10):2622-2635.e5. doi: 10.1016/j.celrep.2019.02.025.
9
Machine learning analysis of gene expression data reveals novel diagnostic and prognostic biomarkers and identifies therapeutic targets for soft tissue sarcomas.基于基因表达数据的机器学习分析揭示了软组织肉瘤的新型诊断和预后生物标志物,并确定了治疗靶点。
PLoS Comput Biol. 2019 Feb 20;15(2):e1006826. doi: 10.1371/journal.pcbi.1006826. eCollection 2019 Feb.
10
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.STRING v11:具有增强覆盖范围的蛋白质-蛋白质相互作用网络,支持在全基因组实验数据集的功能发现。
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613. doi: 10.1093/nar/gky1131.