• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 DNA 甲基化和基因表达数据的线性回归和深度学习方法在癌症中检测可靠的遗传改变。

A Linear Regression and Deep Learning Approach for Detecting Reliable Genetic Alterations in Cancer Using DNA Methylation and Gene Expression Data.

机构信息

Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA.

Department of Computer Science & Engineering, Aliah University, Newtown WB-700160, India.

出版信息

Genes (Basel). 2020 Aug 12;11(8):931. doi: 10.3390/genes11080931.

DOI:10.3390/genes11080931
PMID:32806782
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7465138/
Abstract

DNA methylation change has been useful for cancer biomarker discovery, classification, and potential treatment development. So far, existing methods use either differentially methylated CpG sites or combined CpG sites, namely differentially methylated regions, that can be mapped to genes. However, such methylation signal mapping has limitations. To address these limitations, in this study, we introduced a combinatorial framework using linear regression, differential expression, deep learning method for accurate biological interpretation of DNA methylation through integrating DNA methylation data and corresponding TCGA gene expression data. We demonstrated it for uterine cervical cancer. First, we pre-filtered outliers from the data set and then determined the predicted gene expression value from the pre-filtered methylation data through linear regression. We identified differentially expressed genes (DEGs) by Empirical Bayes test using Limma. Then we applied a deep learning method, "" to classify the cervical cancer label of those DEGs to determine all classification metrics including accuracy and area under curve (AUC) through 10-fold cross validation. We applied our approach to uterine cervical cancer DNA methylation dataset (NCBI accession ID: GSE30760, 27,578 features covering 63 tumor and 152 matched normal samples). After linear regression and differential expression analysis, we obtained 6287 DEGs with false discovery rate (FDR) <0.001. After performing deep learning analysis, we obtained average classification accuracy 90.69% (±1.97%) of the uterine cervical cancerous labels. This performance is better than that of other peer methods. We performed in-degree and out-degree hub gene network analysis using Cytoscape. We reported five top in-degree genes (PAIP2, GRWD1, VPS4B, CRADD and LLPH) and five top out-degree genes (MRPL35, FAM177A1, STAT4, ASPSCR1 and FABP7). After that, we performed KEGG pathway and Gene Ontology enrichment analysis of DEGs using tool . In summary, our proposed framework that integrated linear regression, differential expression, deep learning provides a robust approach to better interpret DNA methylation analysis and gene expression data in disease study.

摘要

DNA 甲基化变化已被用于癌症生物标志物的发现、分类和潜在治疗方法的开发。到目前为止,现有的方法要么使用差异甲基化 CpG 位点,要么使用可以映射到基因的差异甲基化区域(即差异甲基化区域)。然而,这种甲基化信号映射存在局限性。为了解决这些局限性,在本研究中,我们引入了一个组合框架,该框架使用线性回归、差异表达和深度学习方法,通过整合 DNA 甲基化数据和相应的 TCGA 基因表达数据,对 DNA 甲基化进行准确的生物学解释。我们用它来研究子宫颈癌。首先,我们从数据集中剔除了异常值,然后通过线性回归从预过滤的甲基化数据中确定预测的基因表达值。我们使用 Limma 的经验贝叶斯检验确定差异表达基因(DEGs)。然后,我们应用一种深度学习方法,“”来对这些 DEGs 的宫颈癌标签进行分类,通过 10 倍交叉验证确定所有分类指标,包括准确性和曲线下面积(AUC)。我们将我们的方法应用于子宫颈癌 DNA 甲基化数据集(NCBI 访问号:GSE30760,包含 63 个肿瘤和 152 个匹配正常样本的 27578 个特征)。在线性回归和差异表达分析后,我们得到了 6287 个 FDR<0.001 的 DEGs。在进行深度学习分析后,我们获得了子宫颈癌标签的平均分类准确率为 90.69%(±1.97%)。该性能优于其他同类方法。我们使用 Cytoscape 进行了入度和出度枢纽基因网络分析。我们报告了五个 top 入度基因(PAIP2、GRWD1、VPS4B、CRADD 和 LLPH)和五个 top 出度基因(MRPL35、FAM177A1、STAT4、ASPSCR1 和 FABP7)。之后,我们使用工具对 DEGs 进行了 KEGG 通路和基因本体论富集分析。总之,我们提出的集成线性回归、差异表达、深度学习的框架为更好地解释疾病研究中的 DNA 甲基化分析和基因表达数据提供了一种强大的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae65/7465138/7f74cf7b403e/genes-11-00931-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae65/7465138/e51a21f31793/genes-11-00931-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae65/7465138/1be19400c5c3/genes-11-00931-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae65/7465138/f2688e7ccd37/genes-11-00931-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae65/7465138/7f74cf7b403e/genes-11-00931-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae65/7465138/e51a21f31793/genes-11-00931-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae65/7465138/1be19400c5c3/genes-11-00931-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae65/7465138/f2688e7ccd37/genes-11-00931-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae65/7465138/7f74cf7b403e/genes-11-00931-g004.jpg

相似文献

1
A Linear Regression and Deep Learning Approach for Detecting Reliable Genetic Alterations in Cancer Using DNA Methylation and Gene Expression Data.基于 DNA 甲基化和基因表达数据的线性回归和深度学习方法在癌症中检测可靠的遗传改变。
Genes (Basel). 2020 Aug 12;11(8):931. doi: 10.3390/genes11080931.
2
Identification of aberrantly methylated differentially expressed genes in breast cancer by integrated bioinformatics analysis.整合生物信息学分析鉴定乳腺癌中异常甲基化差异表达基因。
J Cell Biochem. 2019 Sep;120(9):16229-16243. doi: 10.1002/jcb.28904. Epub 2019 May 12.
3
Integrated Analysis of Methylomic and Transcriptomic Data to Identify Potential Diagnostic Biomarkers for Major Depressive Disorder.整合甲基化组和转录组数据的分析,以鉴定重度抑郁症的潜在诊断生物标志物。
Genes (Basel). 2021 Jan 27;12(2):178. doi: 10.3390/genes12020178.
4
Identification of Methylation Markers and Differentially Expressed Genes with Prognostic Value in Breast Cancer.乳腺癌中具有预后价值的甲基化标记物和差异表达基因的鉴定
J Comput Biol. 2019 Dec;26(12):1394-1408. doi: 10.1089/cmb.2019.0179. Epub 2019 Jul 10.
5
Identification of key DNA methylation-driven genes in prostate adenocarcinoma: an integrative analysis of TCGA methylation data.前列腺腺癌中关键 DNA 甲基化驱动基因的鉴定:TCGA 甲基化数据的综合分析。
J Transl Med. 2019 Sep 18;17(1):311. doi: 10.1186/s12967-019-2065-2.
6
Bioinformatics-Based Identification of Methylated-Differentially Expressed Genes and Related Pathways in Gastric Cancer.基于生物信息学的胃癌甲基化差异表达基因及相关通路的鉴定
Dig Dis Sci. 2017 Nov;62(11):3029-3039. doi: 10.1007/s10620-017-4740-6. Epub 2017 Sep 15.
7
Genome-wide analysis of aberrant gene expression and methylation profiles reveals susceptibility genes and underlying mechanism of cervical cancer.全基因组范围内对异常基因表达和甲基化谱的分析揭示了宫颈癌的易感基因和潜在机制。
Eur J Obstet Gynecol Reprod Biol. 2016 Dec;207:147-152. doi: 10.1016/j.ejogrb.2016.10.017. Epub 2016 Oct 26.
8
Identification of potential blood biomarkers for Parkinson's disease by gene expression and DNA methylation data integration analysis.基于基因表达和 DNA 甲基化数据整合分析鉴定帕金森病的潜在血液生物标志物。
Clin Epigenetics. 2019 Feb 11;11(1):24. doi: 10.1186/s13148-019-0621-5.
9
Impact of mutations in DNA methylation modification genes on genome-wide methylation landscapes and downstream gene activations in pan-cancer.泛癌中 DNA 甲基化修饰基因突变对全基因组甲基化景观和下游基因激活的影响。
BMC Med Genomics. 2020 Feb 24;13(Suppl 3):27. doi: 10.1186/s12920-020-0659-4.
10
Epigenetic profiling and mRNA expression reveal candidate genes as biomarkers for colorectal cancer.表观遗传学分析和 mRNA 表达揭示候选基因作为结直肠癌的生物标志物。
J Cell Biochem. 2019 Jun;120(6):10767-10776. doi: 10.1002/jcb.28368. Epub 2019 Jan 22.

引用本文的文献

1
UNet with Attention Networks: A Novel Deep Learning Approach for DNA Methylation Prediction in HeLa Cells.带有注意力网络的UNet:一种用于预测HeLa细胞中DNA甲基化的新型深度学习方法。
Genes (Basel). 2025 May 28;16(6):655. doi: 10.3390/genes16060655.
2
Subtypes detection of papillary thyroid cancer from methylation assay via Deep Neural Network.通过深度神经网络从甲基化检测中进行甲状腺乳头状癌亚型检测。
Comput Struct Biotechnol J. 2025 Apr 29;27:1809-1817. doi: 10.1016/j.csbj.2025.04.034. eCollection 2025.
3
CXCL8 modulates M0 macrophage proliferation and polarization to influence tumor progression in cervical cancer.

本文引用的文献

1
Molecular signatures identified by integrating gene expression and methylation in non-seminoma and seminoma of testicular germ cell tumours.整合基因表达和甲基化鉴定非精原细胞瘤和精原细胞瘤的分子特征。
Epigenetics. 2021 Jan-Feb;16(2):162-176. doi: 10.1080/15592294.2020.1790108. Epub 2020 Jul 13.
2
FABP5 promotes lymph node metastasis in cervical cancer by reprogramming fatty acid metabolism.FABP5 通过重编程脂肪酸代谢促进宫颈癌淋巴结转移。
Theranostics. 2020 May 17;10(15):6561-6580. doi: 10.7150/thno.44868. eCollection 2020.
3
MicroRNA and transcription factor co-regulatory networks and subtype classification of seminoma and non-seminoma in testicular germ cell tumors.
CXCL8调节M0巨噬细胞的增殖和极化,以影响宫颈癌的肿瘤进展。
Sci Rep. 2025 Jan 4;15(1):790. doi: 10.1038/s41598-024-81726-y.
4
Cancer genetics and deep learning applications for diagnosis, prognosis, and categorization.癌症遗传学与深度学习在诊断、预后及分类中的应用。
J Biol Methods. 2024 Aug 9;11(3):e99010017. doi: 10.14440/jbm.2024.0016. eCollection 2024.
5
Advancing epigenetic profiling in cervical cancer: machine learning techniques for classifying DNA methylation patterns.宫颈癌中表观遗传谱分析的进展:用于分类DNA甲基化模式的机器学习技术
3 Biotech. 2024 Nov;14(11):264. doi: 10.1007/s13205-024-04107-2. Epub 2024 Oct 9.
6
Methods in DNA methylation array dataset analysis: A review.DNA甲基化阵列数据集分析方法:综述
Comput Struct Biotechnol J. 2024 May 17;23:2304-2325. doi: 10.1016/j.csbj.2024.05.015. eCollection 2024 Dec.
7
Application of deep learning in cancer epigenetics through DNA methylation analysis.深度学习在癌症表观遗传学中通过 DNA 甲基化分析的应用。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad411.
8
Identifying anti-TNF response biomarkers in ulcerative colitis using a diffusion-based signalling model.使用基于扩散的信号模型识别溃疡性结肠炎中的抗TNF反应生物标志物。
Bioinform Adv. 2021 Aug 18;1(1):vbab017. doi: 10.1093/bioadv/vbab017. eCollection 2021.
9
Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration.基于多组学数据整合的深度学习算法及伴随生物标志物识别用于 NSCLC 预测。
Biomolecules. 2022 Dec 8;12(12):1839. doi: 10.3390/biom12121839.
10
A Deep Survival EWAS approach estimating risk profile based on pre-diagnostic DNA methylation: An application to breast cancer time to diagnosis.基于预诊断 DNA 甲基化的深度生存 EWAS 方法估计风险概况:在乳腺癌诊断时间中的应用。
PLoS Comput Biol. 2022 Sep 26;18(9):e1009959. doi: 10.1371/journal.pcbi.1009959. eCollection 2022 Sep.
微小 RNA 和转录因子的共同调控网络以及睾丸生殖细胞肿瘤中精原细胞瘤和非精原细胞瘤的亚型分类。
Sci Rep. 2020 Jan 21;10(1):852. doi: 10.1038/s41598-020-57834-w.
4
Quercetin modulates signaling pathways and induces apoptosis in cervical cancer cells.槲皮素调节信号通路并诱导宫颈癌细胞凋亡。
Biosci Rep. 2019 Aug 13;39(8). doi: 10.1042/BSR20190720. Print 2019 Aug 30.
5
WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs.WebGestalt 2019:基因集分析工具包,具有全新的用户界面和 API。
Nucleic Acids Res. 2019 Jul 2;47(W1):W199-W205. doi: 10.1093/nar/gkz401.
6
BioMethyl: an R package for biological interpretation of DNA methylation data.BioMethyl:一个用于 DNA 甲基化数据生物学解释的 R 包。
Bioinformatics. 2019 Oct 1;35(19):3635-3641. doi: 10.1093/bioinformatics/btz137.
7
Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data.基于图和规则的学习算法:使用基因组数据对癌症类型分类和预后的应用的全面综述。
Brief Bioinform. 2020 Mar 23;21(2):368-394. doi: 10.1093/bib/bby120.
8
Exploration of methylation-driven genes for monitoring and prognosis of patients with lung adenocarcinoma.探索甲基化驱动基因用于肺腺癌患者的监测和预后评估
Cancer Cell Int. 2018 Nov 26;18:194. doi: 10.1186/s12935-018-0691-z. eCollection 2018.
9
An evaluation of supervised methods for identifying differentially methylated regions in Illumina methylation arrays.评估 Illumina 甲基化阵列中差异甲基化区域的监督方法。
Brief Bioinform. 2019 Nov 27;20(6):2224-2235. doi: 10.1093/bib/bby085.
10
MicroRNA expression profiles and networks in placentas complicated with selective intrauterine growth restriction.胎盘合并选择性宫内生长受限的 microRNA 表达谱和网络
Mol Med Rep. 2017 Nov;16(5):6650-6673. doi: 10.3892/mmr.2017.7462. Epub 2017 Sep 11.