• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从 DNA 甲基化和拷贝数变异估计基因表达:一种用于多组学整合的深度学习回归模型。

Estimating gene expression from DNA methylation and copy number variation: A deep learning regression model for multi-omics integration.

机构信息

A. K. Choudhury School of Information Technology, University of Calcutta, JD-2, Sector III, Salt Lake City, Kolkata 700106, India.

Novo Nordisk Research Center Seattle, Inc., 530 Fairview Ave N # 5000, Seattle, WA 98109, United States.

出版信息

Genomics. 2020 Jul;112(4):2833-2841. doi: 10.1016/j.ygeno.2020.03.021. Epub 2020 Mar 29.

DOI:10.1016/j.ygeno.2020.03.021
PMID:32234433
Abstract

Gene expression analysis plays a significant role for providing molecular insights in cancer. Various genetic and epigenetic factors (being dealt under multi-omics) affect gene expression giving rise to cancer phenotypes. A recent growth in understanding of multi-omics seems to provide a resource for integration in interdisciplinary biology since they altogether can draw the comprehensive picture of an organism's developmental and disease biology in cancers. Such large scale multi-omics data can be obtained from public consortium like The Cancer Genome Atlas (TCGA) and several other platforms. Integrating these multi-omics data from varied platforms is still challenging due to high noise and sensitivity of the platforms used. Currently, a robust integrative predictive model to estimate gene expression from these genetic and epigenetic data is lacking. In this study, we have developed a deep learning-based predictive model using Deep Denoising Auto-encoder (DDAE) and Multi-layer Perceptron (MLP) that can quantitatively capture how genetic and epigenetic alterations correlate with directionality of gene expression for liver hepatocellular carcinoma (LIHC). The DDAE used in the study has been trained to extract significant features from the input omics data to estimate the gene expression. These features have then been used for back-propagation learning by the multilayer perceptron for the task of regression and classification. We have benchmarked the proposed model against state-of-the-art regression models. Finally, the deep learning-based integration model has been evaluated for its disease classification capability, where an accuracy of 95.1% has been obtained.

摘要

基因表达分析在提供癌症分子见解方面起着重要作用。各种遗传和表观遗传因素(在多组学中处理)影响基因表达,导致癌症表型。对多组学的理解最近有了显著的增长,似乎为跨学科生物学的整合提供了资源,因为它们可以共同描绘癌症中生物体发育和疾病生物学的综合图景。这种大规模的多组学数据可以从公共联盟(如癌症基因组图谱(TCGA))和其他几个平台获得。由于所使用的平台具有高噪声和敏感性,因此整合来自不同平台的这些多组学数据仍然具有挑战性。目前,缺乏从这些遗传和表观遗传数据中估计基因表达的稳健综合预测模型。在这项研究中,我们使用深度去噪自动编码器(DDAE)和多层感知机(MLP)开发了一种基于深度学习的预测模型,该模型可以定量捕捉遗传和表观遗传改变如何与肝肝细胞癌(LIHC)的基因表达方向相关。研究中使用的 DDAE 经过训练,可以从输入的组学数据中提取重要特征,以估计基因表达。然后,这些特征通过多层感知机用于回归和分类任务的反向传播学习。我们对提出的模型进行了基准测试,以评估其回归模型的性能。最后,对基于深度学习的集成模型进行了疾病分类能力的评估,获得了 95.1%的准确率。

相似文献

1
Estimating gene expression from DNA methylation and copy number variation: A deep learning regression model for multi-omics integration.从 DNA 甲基化和拷贝数变异估计基因表达:一种用于多组学整合的深度学习回归模型。
Genomics. 2020 Jul;112(4):2833-2841. doi: 10.1016/j.ygeno.2020.03.021. Epub 2020 Mar 29.
2
DeFusion: a denoised network regularization framework for multi-omics integration.去融合:一种用于多组学整合的去噪网络正则化框架。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab057.
3
Integrating multi-omics data by learning modality invariant representations for improved prediction of overall survival of cancer.通过学习模态不变表示来整合多组学数据,以提高癌症总体生存预测的准确性。
Methods. 2021 May;189:74-85. doi: 10.1016/j.ymeth.2020.07.008. Epub 2020 Aug 5.
4
Deep learning based feature-level integration of multi-omics data for breast cancer patients survival analysis.基于深度学习的多组学生物标志物数据特征层融合在乳腺癌患者生存分析中的应用。
BMC Med Inform Decis Mak. 2020 Sep 15;20(1):225. doi: 10.1186/s12911-020-01225-8.
5
Classification of early and late stage liver hepatocellular carcinoma patients from their genomics and epigenomics profiles.从基因组学和表观基因组学特征对早期和晚期肝癌患者进行分类。
PLoS One. 2019 Sep 6;14(9):e0221476. doi: 10.1371/journal.pone.0221476. eCollection 2019.
6
A multi-omics data simulator for complex disease studies and its application to evaluate multi-omics data analysis methods for disease classification.用于复杂疾病研究的多组学数据模拟器及其在评估疾病分类的多组学数据分析方法中的应用。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz045.
7
Multi-omics Analysis of Primary Cell Culture Models Reveals Genetic and Epigenetic Basis of Intratumoral Phenotypic Diversity.多组学分析原发性细胞培养模型揭示了肿瘤内表型多样性的遗传和表观遗传基础。
Genomics Proteomics Bioinformatics. 2019 Dec;17(6):576-589. doi: 10.1016/j.gpb.2018.07.008. Epub 2020 Mar 20.
8
Deep Learning-Based Multi-Omics Integration Robustly Predicts Survival in Liver Cancer.基于深度学习的多组学整合可稳健预测肝癌患者的生存情况。
Clin Cancer Res. 2018 Mar 15;24(6):1248-1259. doi: 10.1158/1078-0432.CCR-17-0853. Epub 2017 Oct 5.
9
Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data.基于多组学数据预测卵巢癌生存的最小冗余最大相关性多视图特征选择。
BMC Med Genomics. 2018 Sep 14;11(Suppl 3):71. doi: 10.1186/s12920-018-0388-0.
10
Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas.单细胞三重组学测序揭示了肝细胞癌中的基因、表观遗传和转录组异质性。
Cell Res. 2016 Mar;26(3):304-19. doi: 10.1038/cr.2016.23. Epub 2016 Feb 23.

引用本文的文献

1
Multimodal analysis of cell-free DNA enhances differentiation of early-stage breast cancer from benign lesions and healthy individuals.游离DNA的多模态分析增强了早期乳腺癌与良性病变及健康个体之间的鉴别能力。
BMC Biol. 2025 Aug 20;23(1):259. doi: 10.1186/s12915-025-02371-z.
2
Towards universal modeling of transcript isoform expression levels.迈向转录本异构体表达水平的通用建模
bioRxiv. 2025 Jul 25:2025.07.21.665977. doi: 10.1101/2025.07.21.665977.
3
Identification of a 10-species microbial signature of inflammatory bowel disease by machine learning and external validation.
通过机器学习和外部验证识别炎症性肠病的10种微生物特征
Cell Regen. 2025 Jul 14;14(1):32. doi: 10.1186/s13619-025-00246-w.
4
DeepMethyGene: a deep-learning model to predict gene expression using DNA methylations.深度甲基化基因:一种利用DNA甲基化预测基因表达的深度学习模型。
BMC Bioinformatics. 2025 Apr 8;26(1):99. doi: 10.1186/s12859-025-06115-2.
5
Multi-Omics Integration for Liver Cancer Using Regression Analysis.使用回归分析对肝癌进行多组学整合
Curr Issues Mol Biol. 2024 Apr 19;46(4):3551-3562. doi: 10.3390/cimb46040222.
6
Application of informatics in cancer research and clinical practice: Opportunities and challenges.信息学在癌症研究与临床实践中的应用:机遇与挑战。
Cancer Innov. 2022 Jun 15;1(1):80-91. doi: 10.1002/cai2.9. eCollection 2022 Jun.
7
Application of deep learning in cancer epigenetics through DNA methylation analysis.深度学习在癌症表观遗传学中通过 DNA 甲基化分析的应用。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad411.
8
Integrated analysis of human DNA methylation, gene expression, and genomic variation in iMETHYL database using kernel tensor decomposition-based unsupervised feature extraction.基于核张量分解的无监督特征提取方法在 iMETHYL 数据库中对人类 DNA 甲基化、基因表达和基因组变异的综合分析。
PLoS One. 2023 Aug 9;18(8):e0289029. doi: 10.1371/journal.pone.0289029. eCollection 2023.
9
Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease.用于单细胞多组学整合的无监督神经网络(UMINT):在健康与疾病中的应用
Front Mol Biosci. 2023 May 24;10:1184748. doi: 10.3389/fmolb.2023.1184748. eCollection 2023.
10
Relationship between the deep features of the full-scan pathological map of mucinous gastric carcinoma and related genes based on deep learning.基于深度学习的黏液性胃癌全扫描病理图像深度特征与相关基因的关系
Heliyon. 2023 Mar 9;9(3):e14374. doi: 10.1016/j.heliyon.2023.e14374. eCollection 2023 Mar.