带有注意力网络的UNet:一种用于预测HeLa细胞中DNA甲基化的新型深度学习方法。

UNet with Attention Networks: A Novel Deep Learning Approach for DNA Methylation Prediction in HeLa Cells.

作者信息

Handa Vikas, Batra Shalini, Arora Vinay

机构信息

Department of Biotechnology, Thapar Institute of Engineering & Technology, Patiala 147004, India.

Department of Computer Science & Engineering, Thapar Institute of Engineering & Technology, Patiala 147004, India.

出版信息

Genes (Basel). 2025 May 28;16(6):655. doi: 10.3390/genes16060655.

Abstract

: The purpose of the proposed study is to investigate the efficacy of UNet in predicting Deoxyribonucleic Acid methylation patterns in a cervical cancer cell line. The application of deep learning to analyse the factors affecting methylation in the context of cervical cancer has not yet been fully explored. : A comprehensive performance evaluation has been conducted based on multiple window sizes of DNA sequences. For this purpose, three different parameter-analysis techniques, namely, autoencoders, Generative Adversarial Networks, and Multi-Head Attention Networks, were used. This work presents a novel framework for methylation prediction in promoter regions of various genes. : Experimental results have proved that attention networks in association with UNet achieved a significant accuracy level of 91.01% along with a sensitivity of 89.65%, specificity of around 92.35%, and an area under curve of 0.910 on ENCODE database. The proposed model outperformed three state-of-the-art models: Convolutional Neural Network, Transfer Learning, and Feed Forward Neural Network with K-Nearest Neighbour. Moreover, validation of the model in five gene promoters achieved an accuracy of 81.60% with an area under curve score of 0.814, a -value of 3.62×10-19, and Cohen's Kappa value of 0.631. This novel approach has led to a better understanding of epigenetic variables and their implications in cervical cancer, offering potential insights into therapeutic strategies.

摘要

本研究的目的是探讨U-Net在预测宫颈癌细胞系中脱氧核糖核酸甲基化模式方面的功效。深度学习在分析宫颈癌背景下影响甲基化的因素方面的应用尚未得到充分探索。基于DNA序列的多个窗口大小进行了全面的性能评估。为此,使用了三种不同的参数分析技术,即自动编码器、生成对抗网络和多头注意力网络。这项工作提出了一个用于预测各种基因启动子区域甲基化的新框架。实验结果证明,与U-Net相关联的注意力网络在ENCODE数据库上达到了91.01%的显著准确率,灵敏度为89.65%,特异性约为92.35%,曲线下面积为0.910。所提出的模型优于三种先进模型:卷积神经网络、迁移学习和带K近邻的前馈神经网络。此外,该模型在五个基因启动子中的验证达到了81.60%的准确率,曲线下面积得分为0.814,p值为3.62×10-19,科恩卡帕值为0.631。这种新方法有助于更好地理解表观遗传变量及其在宫颈癌中的意义,为治疗策略提供了潜在的见解。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索