• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepCC:一种基于深度学习的新型癌症分子亚型分类框架。

DeepCC: a novel deep learning-based framework for cancer molecular subtype classification.

作者信息

Gao Feng, Wang Wei, Tan Miaomiao, Zhu Lina, Zhang Yuchen, Fessler Evelyn, Vermeulen Louis, Wang Xin

机构信息

Department of Biomedical Sciences, City University of Hong Kong, Hong Kong SAR, China.

Department of Colorectal Surgery, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.

出版信息

Oncogenesis. 2019 Aug 16;8(9):44. doi: 10.1038/s41389-019-0157-8.

DOI:10.1038/s41389-019-0157-8
PMID:31420533
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6697729/
Abstract

Molecular subtyping of cancer is a critical step towards more individualized therapy and provides important biological insights into cancer heterogeneity. Although gene expression signature-based classification has been widely demonstrated to be an effective approach in the last decade, the widespread implementation has long been limited by platform differences, batch effects, and the difficulty to classify individual patient samples. Here, we describe a novel supervised cancer classification framework, deep cancer subtype classification (DeepCC), based on deep learning of functional spectra quantifying activities of biological pathways. In two case studies about colorectal and breast cancer classification, DeepCC classifiers and DeepCC single sample predictors both achieved overall higher sensitivity, specificity, and accuracy compared with other widely used classification methods such as random forests (RF), support vector machine (SVM), gradient boosting machine (GBM), and multinomial logistic regression algorithms. Simulation analysis based on random subsampling of genes demonstrated the robustness of DeepCC to missing data. Moreover, deep features learned by DeepCC captured biological characteristics associated with distinct molecular subtypes, enabling more compact within-subtype distribution and between-subtype separation of patient samples, and therefore greatly reduce the number of unclassifiable samples previously. In summary, DeepCC provides a novel cancer classification framework that is platform independent, robust to missing data, and can be used for single sample prediction facilitating clinical implementation of cancer molecular subtyping.

摘要

癌症的分子亚型分类是迈向更个体化治疗的关键一步,并为癌症异质性提供了重要的生物学见解。尽管基于基因表达特征的分类在过去十年中已被广泛证明是一种有效的方法,但长期以来,其广泛应用一直受到平台差异、批次效应以及难以对个体患者样本进行分类的限制。在此,我们描述了一种基于深度学习功能谱来量化生物途径活性的新型监督式癌症分类框架——深度癌症亚型分类(DeepCC)。在两项关于结直肠癌和乳腺癌分类的案例研究中,与其他广泛使用的分类方法(如随机森林(RF)、支持向量机(SVM)、梯度提升机(GBM)和多项逻辑回归算法)相比,DeepCC分类器和DeepCC单样本预测器均实现了总体更高的敏感性、特异性和准确性。基于基因随机二次抽样的模拟分析证明了DeepCC对缺失数据的稳健性。此外,DeepCC学习到的深度特征捕捉到了与不同分子亚型相关的生物学特征,使患者样本在亚型内分布更紧凑、亚型间分离更明显,从而大大减少了之前无法分类的样本数量。总之,DeepCC提供了一种新型癌症分类框架,该框架与平台无关,对缺失数据具有稳健性,可用于单样本预测,有助于癌症分子亚型分类的临床应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b67/6697729/9aa63a8548c2/41389_2019_157_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b67/6697729/7ce719867d50/41389_2019_157_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b67/6697729/b6a816f94377/41389_2019_157_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b67/6697729/87a263a9ed52/41389_2019_157_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b67/6697729/9aa63a8548c2/41389_2019_157_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b67/6697729/7ce719867d50/41389_2019_157_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b67/6697729/b6a816f94377/41389_2019_157_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b67/6697729/87a263a9ed52/41389_2019_157_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b67/6697729/9aa63a8548c2/41389_2019_157_Fig4_HTML.jpg

相似文献

1
DeepCC: a novel deep learning-based framework for cancer molecular subtype classification.DeepCC:一种基于深度学习的新型癌症分子亚型分类框架。
Oncogenesis. 2019 Aug 16;8(9):44. doi: 10.1038/s41389-019-0157-8.
2
Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning.利用极简深度学习从差异基因表达水平识别结直肠癌亚型
BioData Min. 2022 Apr 23;15(1):12. doi: 10.1186/s13040-022-00295-w.
3
Urban Tree Species Classification Using a WorldView-2/3 and LiDAR Data Fusion Approach and Deep Learning.利用 WorldView-2/3 和 LiDAR 数据融合方法及深度学习进行城市树种分类
Sensors (Basel). 2019 Mar 14;19(6):1284. doi: 10.3390/s19061284.
4
Integration of Random Forest Classifiers and Deep Convolutional Neural Networks for Classification and Biomolecular Modeling of Cancer Driver Mutations.随机森林分类器与深度卷积神经网络的集成用于癌症驱动突变的分类和生物分子建模
Front Mol Biosci. 2019 Jun 11;6:44. doi: 10.3389/fmolb.2019.00044. eCollection 2019.
5
Molecular Subtyping of Cancer Based on Distinguishing Co-Expression Modules and Machine Learning.基于区分共表达模块和机器学习的癌症分子分型
Front Genet. 2022 May 2;13:866005. doi: 10.3389/fgene.2022.866005. eCollection 2022.
6
A hierarchical integration deep flexible neural forest framework for cancer subtype classification by integrating multi-omics data.一种通过整合多组学数据进行癌症亚型分类的层次化集成深度灵活神经森林框架。
BMC Bioinformatics. 2019 Oct 28;20(1):527. doi: 10.1186/s12859-019-3116-7.
7
Cancer survival classification using integrated data sets and intermediate information.基于整合数据集和中间信息的癌症生存分类。
Artif Intell Med. 2014 Sep;62(1):23-31. doi: 10.1016/j.artmed.2014.06.003. Epub 2014 Jun 21.
8
Prediction of lung cancer patient survival via supervised machine learning classification techniques.通过监督机器学习分类技术预测肺癌患者的生存情况。
Int J Med Inform. 2017 Dec;108:1-8. doi: 10.1016/j.ijmedinf.2017.09.013. Epub 2017 Sep 25.
9
MLSeq: Machine learning interface for RNA-sequencing data.MLSeq:用于 RNA-seq 数据的机器学习接口。
Comput Methods Programs Biomed. 2019 Jul;175:223-231. doi: 10.1016/j.cmpb.2019.04.007. Epub 2019 Apr 29.
10
Impact of Machine Learning With Multiparametric Magnetic Resonance Imaging of the Breast for Early Prediction of Response to Neoadjuvant Chemotherapy and Survival Outcomes in Breast Cancer Patients.机器学习联合乳腺多参数磁共振成像对乳腺癌新辅助化疗早期疗效及生存预后评估的影响。
Invest Radiol. 2019 Feb;54(2):110-117. doi: 10.1097/RLI.0000000000000518.

引用本文的文献

1
Knowledge-Informed Machine Learning for Cancer Diagnosis and Prognosis: A Review.用于癌症诊断和预后的知识驱动型机器学习综述
IEEE Trans Autom Sci Eng. 2025;22:10008-10028. doi: 10.1109/tase.2024.3515839. Epub 2024 Dec 18.
2
Integrated transcriptomic and functional modeling reveals AKT and mTOR synergy in colorectal cancer.整合转录组学和功能建模揭示了结直肠癌中AKT和mTOR的协同作用。
Sci Rep. 2025 Jul 31;15(1):26643. doi: 10.1038/s41598-025-08649-0.
3
Novel Lysosomal-Associated Transmembrane Protein 4B-Positive Stem-Like Cell Subpopulation Characterizes High-Risk Colorectal Cancer Subtypes.

本文引用的文献

1
Genome-wide Discovery and Identification of a Novel miRNA Signature for Recurrence Prediction in Stage II and III Colorectal Cancer.全基因组发现和鉴定用于预测 II 期和 III 期结直肠癌复发的新型 miRNA 标志物。
Clin Cancer Res. 2018 Aug 15;24(16):3867-3877. doi: 10.1158/1078-0432.CCR-17-3236. Epub 2018 Mar 7.
2
Large scale tissue histopathology image classification, segmentation, and visualization via deep convolutional activation features.通过深度卷积激活特征进行大规模组织病理图像分类、分割和可视化
BMC Bioinformatics. 2017 May 26;18(1):281. doi: 10.1186/s12859-017-1685-x.
3
DeepChrome: deep-learning for predicting gene expression from histone modifications.
新型溶酶体相关跨膜蛋白4B阳性的干细胞样细胞亚群可表征高危结直肠癌亚型。
MedComm (2020). 2025 Jul 13;6(7):e70284. doi: 10.1002/mco2.70284. eCollection 2025 Jul.
4
MLOmics: Cancer Multi-Omics Database for Machine Learning.MLOmics:用于机器学习的癌症多组学数据库。
Sci Data. 2025 May 30;12(1):913. doi: 10.1038/s41597-025-05235-x.
5
Strategies to include prior knowledge in omics analysis with deep neural networks.在组学分析中利用深度神经网络纳入先验知识的策略。
Patterns (N Y). 2025 Mar 14;6(3):101203. doi: 10.1016/j.patter.2025.101203.
6
Role of AI in empowering and redefining the oncology care landscape: perspective from a developing nation.人工智能在赋能和重新定义肿瘤护理格局中的作用:来自一个发展中国家的视角。
Front Digit Health. 2025 Mar 4;7:1550407. doi: 10.3389/fdgth.2025.1550407. eCollection 2025.
7
Immune profiling of the macroenvironment in colorectal cancer unveils systemic dysfunction and plasticity of immune cells.结直肠癌宏观环境的免疫谱分析揭示了免疫细胞的全身性功能障碍和可塑性。
Clin Transl Med. 2025 Feb;15(2):e70175. doi: 10.1002/ctm2.70175.
8
A generative deep neural network for pan-digestive tract cancer survival analysis.用于全消化道癌症生存分析的生成式深度神经网络。
BioData Min. 2025 Jan 27;18(1):9. doi: 10.1186/s13040-025-00426-z.
9
Classification of non-TCGA cancer samples to TCGA molecular subtypes using compact feature sets.使用紧凑特征集将非TCGA癌症样本分类为TCGA分子亚型。
Cancer Cell. 2025 Feb 10;43(2):195-212.e11. doi: 10.1016/j.ccell.2024.12.002. Epub 2025 Jan 2.
10
AEGAN-Pathifier: a data augmentation method to improve cancer classification for imbalanced gene expression data.AEGAN-Pathifier:一种用于改善不平衡基因表达数据的癌症分类的数据增强方法。
BMC Bioinformatics. 2024 Dec 27;25(1):392. doi: 10.1186/s12859-024-06013-z.
深度铬:用于从组蛋白修饰预测基因表达的深度学习
Bioinformatics. 2016 Sep 1;32(17):i639-i648. doi: 10.1093/bioinformatics/btw427.
4
Pathway-Informed Classification System (PICS) for Cancer Analysis Using Gene Expression Data.使用基因表达数据进行癌症分析的通路知情分类系统(PICS)
Cancer Inform. 2016 Jul 27;15:151-61. doi: 10.4137/CIN.S40088. eCollection 2016.
5
Adjuvant chemotherapy and relative survival of patients with stage II colon cancer - A EURECCA international comparison between the Netherlands, Denmark, Sweden, England, Ireland, Belgium, and Lithuania.辅助化疗与 II 期结肠癌患者的相对生存率 - 荷兰、丹麦、瑞典、英国、爱尔兰、比利时和立陶宛之间的 EURECCA 国际比较。
Eur J Cancer. 2016 Aug;63:110-7. doi: 10.1016/j.ejca.2016.04.017. Epub 2016 Jun 11.
6
A multidimensional network approach reveals microRNAs as determinants of the mesenchymal colorectal cancer subtype.一种多维网络方法揭示了微小RNA是间充质结直肠癌亚型的决定因素。
Oncogene. 2016 Nov 17;35(46):6026-6037. doi: 10.1038/onc.2016.134. Epub 2016 May 9.
7
Gene expression inference with deep learning.基于深度学习的基因表达推断
Bioinformatics. 2016 Jun 15;32(12):1832-9. doi: 10.1093/bioinformatics/btw074. Epub 2016 Feb 11.
8
The feature selection bias problem in relation to high-dimensional gene data.与高维基因数据相关的特征选择偏差问题。
Artif Intell Med. 2016 Jan;66:63-71. doi: 10.1016/j.artmed.2015.11.001. Epub 2015 Nov 14.
9
The consensus molecular subtypes of colorectal cancer.结直肠癌的共识分子亚型
Nat Med. 2015 Nov;21(11):1350-6. doi: 10.1038/nm.3967. Epub 2015 Oct 12.
10
Predicting effects of noncoding variants with deep learning-based sequence model.使用基于深度学习的序列模型预测非编码变异的影响。
Nat Methods. 2015 Oct;12(10):931-4. doi: 10.1038/nmeth.3547. Epub 2015 Aug 24.