• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用极简深度学习从差异基因表达水平识别结直肠癌亚型

Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning.

作者信息

Li Shaochuan, Yang Yuning, Wang Xin, Li Jun, Yu Jun, Li Xiangtao, Wong Ka-Chun

机构信息

Department of Information Science and Technology, Northeast Normal University, Changchun, Jilin, China.

Department of Surgery, Chinese University of Hong Kong, Hong Kong SAR, China.

出版信息

BioData Min. 2022 Apr 23;15(1):12. doi: 10.1186/s13040-022-00295-w.

DOI:10.1186/s13040-022-00295-w
PMID:35461302
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9034628/
Abstract

BACKGROUND

Cancer molecular subtyping plays a critical role in individualized patient treatment. In previous studies, high-throughput gene expression signature-based methods have been proposed to identify cancer subtypes. Unfortunately, the existing ones suffer from the curse of dimensionality, data sparsity, and computational deficiency.

METHODS

To address those problems, we propose a computational framework for colorectal cancer subtyping without any exploitation in model complexity and generality. A supervised learning framework based on deep learning (DeepCSD) is proposed to identify cancer subtypes. Specifically, based on the differentially expressed genes under cancer consensus molecular subtyping, we design a minimalist feed-forward neural network to capture the distinct molecular features in different cancer subtypes. To mitigate the overfitting phenomenon of deep learning as much as possible, L and L regularization and dropout layers are added.

RESULTS

For demonstrating the effectiveness of DeepCSD, we compared it with other methods including Random Forest (RF), Deep forest (gcForest), support vector machine (SVM), XGBoost, and DeepCC on eight independent colorectal cancer datasets. The results reflect that DeepCSD can achieve superior performance over other algorithms. In addition, gene ontology enrichment and pathology analysis are conducted to reveal novel insights into the cancer subtype identification and characterization mechanisms.

CONCLUSIONS

DeepCSD considers all subtype-specific genes as input, which is pathologically necessary for its completeness. At the same time, DeepCSD shows remarkable robustness in handling cross-platform gene expression data, achieving similar performance on both training and test data without significant model overfitting or exploitation of model complexity.

摘要

背景

癌症分子亚型分类在患者个体化治疗中起着关键作用。在以往研究中,已提出基于高通量基因表达特征的方法来识别癌症亚型。不幸的是,现有的方法存在维度灾难、数据稀疏性和计算缺陷等问题。

方法

为解决这些问题,我们提出了一种用于结直肠癌亚型分类的计算框架,而无需在模型复杂性和通用性方面进行任何优化。提出了一种基于深度学习的监督学习框架(DeepCSD)来识别癌症亚型。具体而言,基于癌症共识分子亚型下的差异表达基因,我们设计了一个极简的前馈神经网络来捕捉不同癌症亚型中的独特分子特征。为尽可能减轻深度学习的过拟合现象,添加了L1和L2正则化以及随机失活层。

结果

为证明DeepCSD的有效性,我们在八个独立的结直肠癌数据集上,将其与其他方法进行了比较,包括随机森林(RF)、深度森林(gcForest)、支持向量机(SVM)、XGBoost和DeepCC。结果表明,DeepCSD比其他算法具有更优的性能。此外,还进行了基因本体富集和病理分析,以揭示癌症亚型识别和特征化机制的新见解。

结论

DeepCSD将所有亚型特异性基因作为输入,这在病理学上对于其完整性是必要的。同时,DeepCSD在处理跨平台基因表达数据时表现出显著的稳健性,在训练数据和测试数据上均取得了相似的性能,而没有明显的模型过拟合或对模型复杂性的过度依赖。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/0b1b3e770f8d/13040_2022_295_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/9a4dfc0c62e0/13040_2022_295_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/ebda59a80302/13040_2022_295_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/3ea78afbfb2b/13040_2022_295_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/fdbaf24ba265/13040_2022_295_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/b70f3d9efb2b/13040_2022_295_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/e3aed6f62cf0/13040_2022_295_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/a452640bdfd2/13040_2022_295_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/0b1b3e770f8d/13040_2022_295_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/9a4dfc0c62e0/13040_2022_295_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/ebda59a80302/13040_2022_295_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/3ea78afbfb2b/13040_2022_295_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/fdbaf24ba265/13040_2022_295_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/b70f3d9efb2b/13040_2022_295_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/e3aed6f62cf0/13040_2022_295_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/a452640bdfd2/13040_2022_295_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f07d/9034628/0b1b3e770f8d/13040_2022_295_Fig8_HTML.jpg

相似文献

1
Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning.利用极简深度学习从差异基因表达水平识别结直肠癌亚型
BioData Min. 2022 Apr 23;15(1):12. doi: 10.1186/s13040-022-00295-w.
2
DeepCC: a novel deep learning-based framework for cancer molecular subtype classification.DeepCC:一种基于深度学习的新型癌症分子亚型分类框架。
Oncogenesis. 2019 Aug 16;8(9):44. doi: 10.1038/s41389-019-0157-8.
3
Automated exploitation of deep learning for cancer patient stratification across multiple types.基于深度学习的癌症患者多类型分层自动化挖掘
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad654.
4
BCDForest: a boosting cascade deep forest model towards the classification of cancer subtypes based on gene expression data.BCDForest:基于基因表达数据的癌症亚型分类的提升级联深森林模型。
BMC Bioinformatics. 2018 Apr 11;19(Suppl 5):118. doi: 10.1186/s12859-018-2095-4.
5
A Contrastive-Learning-Based Deep Neural Network for Cancer Subtyping by Integrating Multi-Omics Data.基于对比学习的深度学习神经网络,通过整合多组学数据进行癌症亚型分类。
Interdiscip Sci. 2024 Dec;16(4):966-975. doi: 10.1007/s12539-024-00641-y. Epub 2024 Sep 4.
6
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用:以新生儿呼吸暂停预测为例的研究
Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.
7
Molecular Subtyping of Cancer Based on Distinguishing Co-Expression Modules and Machine Learning.基于区分共表达模块和机器学习的癌症分子分型
Front Genet. 2022 May 2;13:866005. doi: 10.3389/fgene.2022.866005. eCollection 2022.
8
Machine learning assisted analysis of breast cancer gene expression profiles reveals novel potential prognostic biomarkers for triple-negative breast cancer.机器学习辅助分析乳腺癌基因表达谱揭示了三阴性乳腺癌新的潜在预后生物标志物。
Comput Struct Biotechnol J. 2022 Mar 24;20:1618-1631. doi: 10.1016/j.csbj.2022.03.019. eCollection 2022.
9
A laminar augmented cascading flexible neural forest model for classification of cancer subtypes based on gene expression data.基于基因表达数据的癌症亚型分类的层增级联柔性神经森林模型。
BMC Bioinformatics. 2021 Oct 2;22(1):475. doi: 10.1186/s12859-021-04391-2.
10
Machine Learning-Based Identification of Colon Cancer Candidate Diagnostics Genes.基于机器学习的结肠癌候选诊断基因识别
Biology (Basel). 2022 Feb 25;11(3):365. doi: 10.3390/biology11030365.

引用本文的文献

1
Amogel: a multi-omics classification framework using associative graph neural networks with prior knowledge for biomarker identification.Amogel:一种使用具有先验知识的关联图神经网络进行生物标志物识别的多组学分类框架。
BMC Bioinformatics. 2025 Mar 28;26(1):94. doi: 10.1186/s12859-025-06111-6.
2
The role of deep learning in diagnosing colorectal cancer.深度学习在结直肠癌诊断中的作用。
Prz Gastroenterol. 2023;18(3):266-273. doi: 10.5114/pg.2023.129494. Epub 2023 Jul 17.
3
Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified.

本文引用的文献

1
DeepCC: a novel deep learning-based framework for cancer molecular subtype classification.DeepCC:一种基于深度学习的新型癌症分子亚型分类框架。
Oncogenesis. 2019 Aug 16;8(9):44. doi: 10.1038/s41389-019-0157-8.
2
Metascape provides a biologist-oriented resource for the analysis of systems-level datasets.Metascape 为系统水平数据集的分析提供了面向生物学家的资源。
Nat Commun. 2019 Apr 3;10(1):1523. doi: 10.1038/s41467-019-09234-6.
3
Machine learning predicts individual cancer patient responses to therapeutic drugs with high accuracy.
深度学习为解释乳腺癌亚型如何分类生成定制的逻辑回归模型。
PLoS One. 2023 May 22;18(5):e0286072. doi: 10.1371/journal.pone.0286072. eCollection 2023.
机器学习可以高精度地预测个体癌症患者对治疗药物的反应。
Sci Rep. 2018 Nov 6;8(1):16444. doi: 10.1038/s41598-018-34753-5.
4
Colorectal Cancer Consensus Molecular Subtypes Translated to Preclinical Models Uncover Potentially Targetable Cancer Cell Dependencies.结直肠癌共识分子亚型转化为临床前模型揭示了潜在可靶向的癌细胞依赖性。
Clin Cancer Res. 2018 Feb 15;24(4):794-806. doi: 10.1158/1078-0432.CCR-17-1234. Epub 2017 Dec 14.
5
Limits to Personalized Cancer Medicine.个性化癌症医学的局限性。
N Engl J Med. 2016 Sep 29;375(13):1289-94. doi: 10.1056/NEJMsb1607705.
6
Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features.通过全自动显微镜病理图像特征预测非小细胞肺癌预后。
Nat Commun. 2016 Aug 16;7:12474. doi: 10.1038/ncomms12474.
7
Adjuvant chemotherapy and relative survival of patients with stage II colon cancer - A EURECCA international comparison between the Netherlands, Denmark, Sweden, England, Ireland, Belgium, and Lithuania.辅助化疗与 II 期结肠癌患者的相对生存率 - 荷兰、丹麦、瑞典、英国、爱尔兰、比利时和立陶宛之间的 EURECCA 国际比较。
Eur J Cancer. 2016 Aug;63:110-7. doi: 10.1016/j.ejca.2016.04.017. Epub 2016 Jun 11.
8
The consensus molecular subtypes of colorectal cancer.结直肠癌的共识分子亚型
Nat Med. 2015 Nov;21(11):1350-6. doi: 10.1038/nm.3967. Epub 2015 Oct 12.
9
Multi-level gene/MiRNA feature selection using deep belief nets and active learning.使用深度信念网络和主动学习的多层次基因/微小RNA特征选择
Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:3957-60. doi: 10.1109/EMBC.2014.6944490.
10
Support vector machine-based nomogram predicts postoperative distant metastasis for patients with oesophageal squamous cell carcinoma.基于支持向量机的列线图预测食管鳞癌患者术后远处转移
Br J Cancer. 2013 Sep 3;109(5):1109-16. doi: 10.1038/bjc.2013.379. Epub 2013 Aug 13.