• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于转录组数据的肿瘤状态和组织起源分类深度学习模型。

A deep learning model to classify neoplastic state and tissue origin from transcriptomic data.

机构信息

Krembil Research Institute, University Health Network, 399 Bathurst Street, Suite 4W-449, Toronto, ON, M5T 2S8, Canada.

Division of Neurosurgery, Department of Surgery, University of Toronto, 149 College Street, 5th Floor, Toronto, ON, M5T 1P5, Canada.

出版信息

Sci Rep. 2022 Jun 11;12(1):9669. doi: 10.1038/s41598-022-13665-5.

DOI:10.1038/s41598-022-13665-5
PMID:35690622
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9188604/
Abstract

Application of deep learning methods to transcriptomic data has the potential to enhance the accuracy and efficiency of tissue classification and cell state identification. Herein, we developed a multitask deep learning model for tissue classification combining publicly available whole transcriptomic (RNA-seq) datasets of non-neoplastic, neoplastic and peri-neoplastic tissue to classify disease state, tissue origin and neoplastic subclass. RNA-seq data from a total of 10,116 patient samples processed through a common pipeline were used for model training and validation. The model achieved 99% accuracy for disease state classification (ROC-AUC of 0.98) and 97% accuracy for tissue origin (ROC-AUC of 0.99). Moreover, the model achieved an accuracy of 92% (ROC-AUC 0.95) for neoplastic subclassification. This is the first multitask deep learning algorithm developed for tissue classification employing a uniform pipeline analysis of transcriptomic data with multiple tissue classifiers. This model serves as a framework for incorporating large transcriptomic datasets across conditions to facilitate clinical diagnosis and cell-based treatment strategies.

摘要

深度学习方法在转录组数据中的应用有可能提高组织分类和细胞状态识别的准确性和效率。在此,我们开发了一种多任务深度学习模型,用于结合公开的非肿瘤、肿瘤和肿瘤周围组织的全转录组(RNA-seq)数据集进行组织分类,以分类疾病状态、组织来源和肿瘤亚类。通过通用管道处理的总计 10116 个患者样本的 RNA-seq 数据用于模型训练和验证。该模型对疾病状态分类的准确率达到 99%(ROC-AUC 为 0.98),对组织来源的准确率达到 97%(ROC-AUC 为 0.99)。此外,该模型对肿瘤亚类的准确率达到 92%(ROC-AUC 为 0.95)。这是第一个使用多组织分类器对转录组数据进行统一管道分析的多任务深度学习算法。该模型为整合跨条件的大型转录组数据集以促进临床诊断和基于细胞的治疗策略提供了一个框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/afa535306351/41598_2022_13665_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/07c3de14e53c/41598_2022_13665_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/9071988a7999/41598_2022_13665_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/c0e5f56a159b/41598_2022_13665_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/79de3ab5bb2e/41598_2022_13665_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/afa535306351/41598_2022_13665_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/07c3de14e53c/41598_2022_13665_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/9071988a7999/41598_2022_13665_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/c0e5f56a159b/41598_2022_13665_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/79de3ab5bb2e/41598_2022_13665_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8bd/9188604/afa535306351/41598_2022_13665_Fig5_HTML.jpg

相似文献

1
A deep learning model to classify neoplastic state and tissue origin from transcriptomic data.基于转录组数据的肿瘤状态和组织起源分类深度学习模型。
Sci Rep. 2022 Jun 11;12(1):9669. doi: 10.1038/s41598-022-13665-5.
2
Automated deep learning design for medical image classification by health-care professionals with no coding experience: a feasibility study.无编码经验的医疗保健专业人员进行医学图像分类的自动化深度学习设计:一项可行性研究。
Lancet Digit Health. 2019 Sep;1(5):e232-e242. doi: 10.1016/S2589-7500(19)30108-6. Epub 2019 Sep 5.
3
Deep learning of 2D-Restructured gene expression representations for improved low-sample therapeutic response prediction.二维重构基因表达图谱的深度学习,以提高低样本治疗反应预测。
Comput Biol Med. 2023 Sep;164:107245. doi: 10.1016/j.compbiomed.2023.107245. Epub 2023 Jul 18.
4
Augmentation of Transcriptomic Data for Improved Classification of Patients with Respiratory Diseases of Viral Origin.转录组数据的增强可改善病毒性呼吸道疾病患者的分类。
Int J Mol Sci. 2022 Feb 24;23(5):2481. doi: 10.3390/ijms23052481.
5
Automated detection and classification of shoulder arthroplasty models using deep learning.利用深度学习技术自动检测和分类肩关节炎假体模型
Skeletal Radiol. 2020 Oct;49(10):1623-1632. doi: 10.1007/s00256-020-03463-3. Epub 2020 May 15.
6
Machine learning random forest for predicting oncosomatic variant NGS analysis.机器学习随机森林预测肿瘤体细胞变异 NGS 分析。
Sci Rep. 2021 Nov 8;11(1):21820. doi: 10.1038/s41598-021-01253-y.
7
Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data.利用转录组数据预测药物药理特性及药物再利用的深度学习应用
Mol Pharm. 2016 Jul 5;13(7):2524-30. doi: 10.1021/acs.molpharmaceut.6b00248. Epub 2016 Jun 8.
8
Computer-assisted medical image classification for early diagnosis of oral cancer employing deep learning algorithm.计算机辅助医学图像分类在口腔癌早期诊断中的应用深度学习算法。
J Cancer Res Clin Oncol. 2019 Apr;145(4):829-837. doi: 10.1007/s00432-018-02834-7. Epub 2019 Jan 3.
9
A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance.深度学习模型在不同类别不平衡程度的非结构化医疗记录文本分类中的对比研究。
BMC Med Res Methodol. 2022 Jul 2;22(1):181. doi: 10.1186/s12874-022-01665-y.
10
Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations.基于深度学习的 RNA-seq 数据癌症生存预后:方法与评估。
BMC Med Genomics. 2020 Apr 3;13(Suppl 5):41. doi: 10.1186/s12920-020-0686-1.

引用本文的文献

1
Learning the cellular origins across cancers using single-cell chromatin landscapes.利用单细胞染色质图谱了解癌症的细胞起源。
Nat Commun. 2025 Sep 17;16(1):8301. doi: 10.1038/s41467-025-63957-3.
2
Rethinking cancer of unknown primary: from diagnostic challenge to targeted treatment.重新审视原发灶不明的癌症:从诊断挑战到靶向治疗
Nat Rev Clin Oncol. 2025 Aug 4. doi: 10.1038/s41571-025-01060-8.
3
IC2Bert: masked gene expression pretraining and supervised fine tuning for robust immune checkpoint blockade (ICB) response prediction.

本文引用的文献

1
Gradient of Developmental and Injury Response transcriptional states defines functional vulnerabilities underpinning glioblastoma heterogeneity.发育和损伤反应转录状态梯度定义了支持胶质母细胞瘤异质性的功能脆弱性。
Nat Cancer. 2021 Feb;2(2):157-173. doi: 10.1038/s43018-020-00154-9. Epub 2021 Jan 4.
2
Verifying explainability of a deep learning tissue classifier trained on RNA-seq data.验证基于 RNA-seq 数据训练的深度学习组织分类器的可解释性。
Sci Rep. 2021 Jan 29;11(1):2641. doi: 10.1038/s41598-021-81773-9.
3
Deep learning-based cross-classifications reveal conserved spatial behaviors within tumor histological images.
IC2Bert:用于强大的免疫检查点阻断(ICB)反应预测的掩码基因表达预训练和监督微调
Sci Rep. 2025 Aug 1;15(1):28044. doi: 10.1038/s41598-025-14166-x.
4
Deep Learning Approach for Biomedical Image Classification.用于生物医学图像分类的深度学习方法
J Imaging Inform Med. 2025 Jul 8. doi: 10.1007/s10278-025-01590-8.
5
A variational autoencoder trained with priors from canonical pathways increases the interpretability of transcriptome data.基于经典通路先验信息训练的变分自编码器提高了转录组数据的可解释性。
PLoS Comput Biol. 2024 Jul 3;20(7):e1011198. doi: 10.1371/journal.pcbi.1011198. eCollection 2024 Jul.
6
New techniques to identify the tissue of origin for cancer of unknown primary in the era of precision medicine: progress and challenges.精准医学时代识别不明原发癌组织来源的新技术:进展与挑战。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae028.
7
Machine learning for pan-cancer classification based on RNA sequencing data.基于RNA测序数据的全癌种分类机器学习方法
Front Mol Biosci. 2023 Nov 10;10:1285795. doi: 10.3389/fmolb.2023.1285795. eCollection 2023.
8
The practical utility of AI-assisted molecular profiling in the diagnosis and management of cancer of unknown primary: an updated review.人工智能辅助分子谱分析在不明原发癌的诊断和治疗中的实际应用:最新综述。
Virchows Arch. 2024 Feb;484(2):369-375. doi: 10.1007/s00428-023-03708-1. Epub 2023 Nov 24.
基于深度学习的跨分类揭示了肿瘤组织学图像中保守的空间行为。
Nat Commun. 2020 Dec 11;11(1):6367. doi: 10.1038/s41467-020-20030-5.
4
The role of molecular profiling in the diagnosis and management of metastatic undifferentiated cancer of unknown primary: Molecular profiling of metastatic cancer of unknown primary.分子谱分析在不明原发转移性去分化癌的诊断和治疗中的作用:不明原发转移性癌症的分子谱分析。
Semin Diagn Pathol. 2021 Nov;38(6):193-198. doi: 10.1053/j.semdp.2020.12.001. Epub 2020 Dec 4.
5
Unsupervised and supervised learning with neural network for human transcriptome analysis and cancer diagnosis.基于神经网络的无监督和监督学习在人类转录组分析和癌症诊断中的应用。
Sci Rep. 2020 Nov 5;10(1):19106. doi: 10.1038/s41598-020-75715-0.
6
Predicting Cancer Tissue-of-Origin by a Machine Learning Method Using DNA Somatic Mutation Data.使用DNA体细胞突变数据通过机器学习方法预测癌症组织起源
Front Genet. 2020 Jul 14;11:674. doi: 10.3389/fgene.2020.00674. eCollection 2020.
7
A deep learning system accurately classifies primary and metastatic cancers using passenger mutation patterns.深度学习系统通过乘客突变模式准确分类原发性和转移性癌症。
Nat Commun. 2020 Feb 5;11(1):728. doi: 10.1038/s41467-019-13825-8.
8
Dissecting transcriptomic signatures of neuronal differentiation and maturation using iPSCs.利用 iPS 细胞解析神经分化和成熟的转录组特征。
Nat Commun. 2020 Jan 23;11(1):462. doi: 10.1038/s41467-019-14266-z.
9
DeePathology: Deep Multi-Task Learning for Inferring Molecular Pathology from Cancer Transcriptome.DeePathology:从癌症转录组推断分子病理学的深度多任务学习。
Sci Rep. 2019 Nov 11;9(1):16526. doi: 10.1038/s41598-019-52937-5.
10
Deep learning-based classification of mesothelioma improves prediction of patient outcome.基于深度学习的间皮瘤分类提高了患者预后的预测能力。
Nat Med. 2019 Oct;25(10):1519-1525. doi: 10.1038/s41591-019-0583-3. Epub 2019 Oct 7.