利用表观基因组数据进行人类疾病检测、亚型分类和治疗反应预测的深度学习

Deep Learning for Human Disease Detection, Subtype Classification, and Treatment Response Prediction Using Epigenomic Data.

作者信息

Nguyen Thi Mai, Kim Nackhyoung, Kim Da Hae, Le Hoang Long, Piran Md Jalil, Um Soo-Jong, Kim Jin Hee

机构信息

Department of Integrative Bioscience & Biotechnology, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea.

Department of Computer Science & Engineering, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea.

出版信息

Biomedicines. 2021 Nov 20;9(11):1733. doi: 10.3390/biomedicines9111733.

DOI:10.3390/biomedicines9111733

PMID:34829962

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8615388/

Abstract

Deep learning (DL) is a distinct class of machine learning that has achieved first-class performance in many fields of study. For epigenomics, the application of DL to assist physicians and scientists in human disease-relevant prediction tasks has been relatively unexplored until very recently. In this article, we critically review published studies that employed DL models to predict disease detection, subtype classification, and treatment responses, using epigenomic data. A comprehensive search on PubMed, Scopus, Web of Science, Google Scholar, and arXiv.org was performed following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Among 1140 initially identified publications, we included 22 articles in our review. DNA methylation and RNA-sequencing data are most frequently used to train the predictive models. The reviewed models achieved a high accuracy ranged from 88.3% to 100.0% for disease detection tasks, from 69.5% to 97.8% for subtype classification tasks, and from 80.0% to 93.0% for treatment response prediction tasks. We generated a workflow to develop a predictive model that encompasses all steps from first defining human disease-related tasks to finally evaluating model performance. DL holds promise for transforming epigenomic big data into valuable knowledge that will enhance the development of translational epigenomics.

摘要

深度学习（DL）是机器学习中的一个独特类别，在许多研究领域都取得了一流的表现。对于表观基因组学而言，直到最近，深度学习在协助医生和科学家进行人类疾病相关预测任务方面的应用仍相对未被探索。在本文中，我们批判性地回顾了已发表的研究，这些研究使用表观基因组数据，采用深度学习模型来预测疾病检测、亚型分类和治疗反应。按照系统评价和荟萃分析的首选报告项目指南，我们在PubMed、Scopus、科学网、谷歌学术和arXiv.org上进行了全面检索。在最初识别出的1140篇出版物中，我们纳入了22篇文章进行综述。DNA甲基化和RNA测序数据最常用于训练预测模型。综述中的模型在疾病检测任务中的准确率高达88.3%至100.0%，在亚型分类任务中的准确率为69.5%至97.8%，在治疗反应预测任务中的准确率为80.0%至93.0%。我们生成了一个开发预测模型的工作流程，涵盖从最初定义人类疾病相关任务到最终评估模型性能的所有步骤。深度学习有望将表观基因组大数据转化为有价值的知识，这将促进转化表观基因组学的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a1/8615388/e5b5bc3ad5aa/biomedicines-09-01733-g001.jpg

相似文献

Deep Learning for Human Disease Detection, Subtype Classification, and Treatment Response Prediction Using Epigenomic Data.利用表观基因组数据进行人类疾病检测、亚型分类和治疗反应预测的深度学习

Biomedicines. 2021 Nov 20;9(11):1733. doi: 10.3390/biomedicines9111733.

Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测：机器学习在 1 型糖尿病中的应用。

Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.

Deep learning algorithms for detection of diabetic retinopathy in retinal fundus photographs: A systematic review and meta-analysis.深度学习算法在眼底视网膜照片糖尿病性视网膜病变检测中的应用：系统评价和荟萃分析。

Comput Methods Programs Biomed. 2020 Jul;191:105320. doi: 10.1016/j.cmpb.2020.105320. Epub 2020 Jan 16.

Deep learning in periodontology and oral implantology: A scoping review.深度学习在牙周病学和口腔种植学中的应用：范围综述。

J Periodontal Res. 2022 Oct;57(5):942-951. doi: 10.1111/jre.13037. Epub 2022 Jul 20.

Evaluating characteristics of PROSPERO records as predictors of eventual publication of non-Cochrane systematic reviews: a meta-epidemiological study protocol.评价 PROSPERO 记录特征对非 Cochrane 系统评价最终发表的预测作用：一项meta 流行病学研究方案。

Syst Rev. 2018 Mar 9;7(1):43. doi: 10.1186/s13643-018-0709-6.

What Are the Applications and Limitations of Artificial Intelligence for Fracture Detection and Classification in Orthopaedic Trauma Imaging? A Systematic Review.人工智能在骨科创伤影像中骨折检测和分类的应用及局限性：系统评价。

Clin Orthop Relat Res. 2019 Nov;477(11):2482-2491. doi: 10.1097/CORR.0000000000000848.

Clinical Decision-Support Systems for Detection of Systemic Inflammatory Response Syndrome, Sepsis, and Septic Shock in Critically Ill Patients: A Systematic Review.用于检测重症患者全身炎症反应综合征、脓毒症和脓毒性休克的临床决策支持系统：一项系统评价

Methods Inf Med. 2019 Dec;58(S 02):e43-e57. doi: 10.1055/s-0039-1695717. Epub 2019 Sep 9.

A deep learning approach to automate whole-genome prediction of diverse epigenomic modifications in plants.深度学习方法自动化植物中多种表观遗传修饰的全基因组预测。

New Phytol. 2021 Oct;232(2):880-897. doi: 10.1111/nph.17630. Epub 2021 Aug 12.

Deep learning for drug response prediction in cancer.深度学习在癌症药物反应预测中的应用。

Brief Bioinform. 2021 Jan 18;22(1):360-379. doi: 10.1093/bib/bbz171.

Comprehensive review of deep learning in orthopaedics: Applications, challenges, trustworthiness, and fusion.深度学习在骨科领域的综合综述：应用、挑战、可信度和融合。

Artif Intell Med. 2024 Sep;155:102935. doi: 10.1016/j.artmed.2024.102935. Epub 2024 Jul 25.

引用本文的文献

Current Bioinformatics Tools in Precision Oncology.精准肿瘤学中的当前生物信息学工具

MedComm (2020). 2025 Jul 9;6(7):e70243. doi: 10.1002/mco2.70243. eCollection 2025 Jul.

Subtypes detection of papillary thyroid cancer from methylation assay via Deep Neural Network.通过深度神经网络从甲基化检测中进行甲状腺乳头状癌亚型检测。

Comput Struct Biotechnol J. 2025 Apr 29;27:1809-1817. doi: 10.1016/j.csbj.2025.04.034. eCollection 2025.

Advancing precision oncology with AI-powered genomic analysis.通过人工智能驱动的基因组分析推动精准肿瘤学发展。

Front Pharmacol. 2025 Apr 30;16:1591696. doi: 10.3389/fphar.2025.1591696. eCollection 2025.

P.O.L.A.R. Star: A New Framework Developed and Applied by One Mid-Sized Pharmaceutical Company to Drive Digital Transformation in R&D.P.O.L.A.R. 之星：一家中型制药公司开发并应用的新框架，推动研发领域的数字化转型。

Pharmaceut Med. 2024 Sep;38(5):343-353. doi: 10.1007/s40290-024-00533-y. Epub 2024 Aug 9.

A Novel Image Processing Method for Obtaining an Accurate Three-Dimensional Profile of Red Blood Cells in Digital Holographic Microscopy.一种用于在数字全息显微镜中获取红细胞精确三维轮廓的新型图像处理方法。

Biomimetics (Basel). 2023 Nov 22;8(8):563. doi: 10.3390/biomimetics8080563.

Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review.使用基因表达数据进行癌症分类的机器学习方法：综述

Bioengineering (Basel). 2023 Jan 28;10(2):173. doi: 10.3390/bioengineering10020173.

Prediction of Gastric Cancer-Related Genes Based on the Graph Transformer Network.基于图变换器网络的胃癌相关基因预测

Front Oncol. 2022 Jun 30;12:902616. doi: 10.3389/fonc.2022.902616. eCollection 2022.

Machine learning diagnosis by immunoglobulin N-glycan signatures for precision diagnosis of urological diseases.机器学习通过免疫球蛋白 N-糖基化特征进行诊断，以实现泌尿系统疾病的精准诊断。

Cancer Sci. 2022 Jul;113(7):2434-2445. doi: 10.1111/cas.15395. Epub 2022 May 25.

本文引用的文献

Cancer: A deep learning-based pan-cancer metastasis prediction model developed using multi-omics data.癌症：一种使用多组学数据开发的基于深度学习的泛癌转移预测模型。

Comput Struct Biotechnol J. 2021 Aug 9;19:4404-4411. doi: 10.1016/j.csbj.2021.08.006. eCollection 2021.

Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine.机器学习在表观基因组学中的应用：癌症生物学和医学的新视角。

Biochim Biophys Acta Rev Cancer. 2021 Dec;1876(2):188588. doi: 10.1016/j.bbcan.2021.188588. Epub 2021 Jul 7.

Zfp57 inactivation illustrates the role of ICR methylation in imprinted gene expression during neural differentiation of mouse ESCs.Zfp57 失活说明了 ICR 甲基化在小鼠胚胎干细胞神经分化过程中印迹基因表达中的作用。

Sci Rep. 2021 Jul 5;11(1):13802. doi: 10.1038/s41598-021-93297-3.

Role of Regulatory Non-Coding RNAs in Aggressive Thyroid Cancer: Prospective Applications of Neural Network Analysis.调控性非编码 RNA 在侵袭性甲状腺癌中的作用：神经网络分析的潜在应用。

Molecules. 2021 May 19;26(10):3022. doi: 10.3390/molecules26103022.

Artificial Intelligence in Epigenetic Studies: Shedding Light on Rare Diseases.表观遗传学研究中的人工智能：揭示罕见疾病

Front Mol Biosci. 2021 May 5;8:648012. doi: 10.3389/fmolb.2021.648012. eCollection 2021.

Interpretation of deep learning in genomics and epigenomics.深度学习在基因组学和表观基因组学中的应用。

Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa177.

Histone dynamics mediate DNA unwrapping and sliding in nucleosomes.组蛋白动力学介导核小体中 DNA 的解缠绕和滑动。

Nat Commun. 2021 Apr 22;12(1):2387. doi: 10.1038/s41467-021-22636-9.

Integrated multi-omics analysis of ovarian cancer using variational autoencoders.基于变分自动编码器的卵巢癌多组学综合分析。

Sci Rep. 2021 Mar 18;11(1):6265. doi: 10.1038/s41598-021-85285-4.

Deep learning in systems medicine.系统医学中的深度学习。

Brief Bioinform. 2021 Mar 22;22(2):1543-1559. doi: 10.1093/bib/bbaa237.

Prediction of survival and recurrence in patients with pancreatic cancer by integrating multi-omics data.通过整合多组学数据预测胰腺癌患者的生存和复发情况。

Sci Rep. 2020 Nov 3;10(1):18951. doi: 10.1038/s41598-020-76025-1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用表观基因组数据进行人类疾病检测、亚型分类和治疗反应预测的深度学习

Deep Learning for Human Disease Detection, Subtype Classification, and Treatment Response Prediction Using Epigenomic Data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献