• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iDNA-ITLM:一种用于识别 DNA 甲基化的可解释和可迁移学习模型。

iDNA-ITLM: An interpretable and transferable learning model for identifying DNA methylation.

机构信息

School of Information and Communication Engineering, Hainan University, Haikou, Hainan, China.

Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, Haikou, Hainan, China.

出版信息

PLoS One. 2024 Oct 31;19(10):e0301791. doi: 10.1371/journal.pone.0301791. eCollection 2024.

DOI:10.1371/journal.pone.0301791
PMID:39480834
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11527195/
Abstract

In this study, from the perspective of image processing, we propose the iDNA-ITLM model, using a novel data enhance strategy by continuously self-replicating a short DNA sequence into a longer DNA sequence and then embedding it into a high-dimensional matrix to enlarge the receptive field, for identifying DNA methylation sites. Our model consistently outperforms the current state-of-the-art sequence-based DNA methylation site recognition methods when evaluated on 17 benchmark datasets that cover multiple species and include three DNA methylation modifications (4mC, 5hmC, and 6mA). The experimental results demonstrate the robustness and superior performance of our model across these datasets. In addition, our model can transfer learning to RNA methylation sequences and produce good results without modifying the hyperparameters in the model. The proposed iDNA-ITLM model can be considered a universal predictor across DNA and RNA methylation species.

摘要

在这项研究中,我们从图像处理的角度出发,提出了 iDNA-ITLM 模型,该模型采用了一种新颖的数据增强策略,通过将短 DNA 序列不断自我复制成长 DNA 序列,然后将其嵌入到高维矩阵中以扩大感受野,用于识别 DNA 甲基化位点。在评估涵盖多个物种并包含三种 DNA 甲基化修饰(4mC、5hmC 和 6mA)的 17 个基准数据集时,我们的模型在当前基于序列的 DNA 甲基化位点识别方法中始终表现出色。实验结果证明了我们的模型在这些数据集上的稳健性和卓越性能。此外,我们的模型可以将迁移学习应用于 RNA 甲基化序列,并在不修改模型中超参数的情况下产生良好的结果。所提出的 iDNA-ITLM 模型可以被视为跨 DNA 和 RNA 甲基化物种的通用预测器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/88b615ace129/pone.0301791.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/44378de8e584/pone.0301791.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/fd6bf9148a56/pone.0301791.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/b2edd1978eeb/pone.0301791.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/bd9d767bf072/pone.0301791.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/fabf1e917bd1/pone.0301791.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/54cf84683376/pone.0301791.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/a9ba363a752b/pone.0301791.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/39f67f9359c8/pone.0301791.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/88b615ace129/pone.0301791.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/44378de8e584/pone.0301791.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/fd6bf9148a56/pone.0301791.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/b2edd1978eeb/pone.0301791.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/bd9d767bf072/pone.0301791.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/fabf1e917bd1/pone.0301791.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/54cf84683376/pone.0301791.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/a9ba363a752b/pone.0301791.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/39f67f9359c8/pone.0301791.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4f/11527195/88b615ace129/pone.0301791.g009.jpg

相似文献

1
iDNA-ITLM: An interpretable and transferable learning model for identifying DNA methylation.iDNA-ITLM:一种用于识别 DNA 甲基化的可解释和可迁移学习模型。
PLoS One. 2024 Oct 31;19(10):e0301791. doi: 10.1371/journal.pone.0301791. eCollection 2024.
2
iDNA-OpenPrompt: OpenPrompt learning model for identifying DNA methylation.iDNA-OpenPrompt:用于识别DNA甲基化的OpenPrompt学习模型。
Front Genet. 2024 Apr 16;15:1377285. doi: 10.3389/fgene.2024.1377285. eCollection 2024.
3
iDNA-ABT: advanced deep learning model for detecting DNA methylation with adaptive features and transductive information maximization.iDNA-ABT:具有自适应特征和转导信息最大化的先进深度学习模型,用于检测 DNA 甲基化。
Bioinformatics. 2021 Dec 11;37(24):4603-4610. doi: 10.1093/bioinformatics/btab677.
4
EpiTEAmDNA: Sequence feature representation via transfer learning and ensemble learning for identifying multiple DNA epigenetic modification types across species.EpiTEAmDNA:通过迁移学习和集成学习进行序列特征表示,以跨物种识别多种 DNA 表观遗传修饰类型。
Comput Biol Med. 2023 Jun;160:107030. doi: 10.1016/j.compbiomed.2023.107030. Epub 2023 May 11.
5
Accurate prediction of DNA N-methylcytosine sites via boost-learning various types of sequence features.通过提升学习多种类型的序列特征来准确预测 DNA N-甲基胞嘧啶位点。
BMC Genomics. 2020 Sep 11;21(1):627. doi: 10.1186/s12864-020-07033-8.
6
iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool.iDNA-MT:基于神经网络工具利用多任务学习识别多个物种中的DNA修饰位点
Front Genet. 2021 Mar 31;12:663572. doi: 10.3389/fgene.2021.663572. eCollection 2021.
7
DNA-MP: a generalized DNA modifications predictor for multiple species based on powerful sequence encoding method.DNA-MP:一种基于强大序列编码方法的多物种通用DNA修饰预测器。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac546.
8
iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations.iDNA-ABF:用于可解释的 DNA 甲基化预测的多尺度深度生物语言学习模型。
Genome Biol. 2022 Oct 17;23(1):219. doi: 10.1186/s13059-022-02780-1.
9
Time series-based hybrid ensemble learning model with multivariate multidimensional feature coding for DNA methylation prediction.基于时间序列的混合集成学习模型,具有多维多维特征编码,用于 DNA 甲基化预测。
BMC Genomics. 2023 Dec 11;24(1):758. doi: 10.1186/s12864-023-09866-5.
10
iDNA-MS: An Integrated Computational Tool for Detecting DNA Modification Sites in Multiple Genomes.iDNA-MS:一种用于检测多个基因组中DNA修饰位点的综合计算工具。
iScience. 2020 Apr 24;23(4):100991. doi: 10.1016/j.isci.2020.100991. Epub 2020 Mar 19.

本文引用的文献

1
DRSN4mCPred: accurately predicting sites of DNA N4-methylcytosine using deep residual shrinkage network for diagnosis and treatment of gastrointestinal cancer in the precision medicine era.DRSN4mCPred:在精准医学时代,使用深度残差收缩网络准确预测DNA N4-甲基胞嘧啶位点以用于胃肠道癌的诊断和治疗。
Front Med (Lausanne). 2023 May 4;10:1187430. doi: 10.3389/fmed.2023.1187430. eCollection 2023.
2
EpiTEAmDNA: Sequence feature representation via transfer learning and ensemble learning for identifying multiple DNA epigenetic modification types across species.EpiTEAmDNA:通过迁移学习和集成学习进行序列特征表示,以跨物种识别多种 DNA 表观遗传修饰类型。
Comput Biol Med. 2023 Jun;160:107030. doi: 10.1016/j.compbiomed.2023.107030. Epub 2023 May 11.
3
iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations.iDNA-ABF:用于可解释的 DNA 甲基化预测的多尺度深度生物语言学习模型。
Genome Biol. 2022 Oct 17;23(1):219. doi: 10.1186/s13059-022-02780-1.
4
i6mA-Caps: a CapsuleNet-based framework for identifying DNA N6-methyladenine sites.i6mA-Caps:一种基于胶囊网络的 DNA N6-甲基腺嘌呤位点识别框架。
Bioinformatics. 2022 Aug 10;38(16):3885-3891. doi: 10.1093/bioinformatics/btac434.
5
Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction.Hyb4mC:一种基于 DNA2vec 的混合模型,用于预测 DNA N4-甲基胞嘧啶位点。
BMC Bioinformatics. 2022 Jun 29;23(1):258. doi: 10.1186/s12859-022-04789-6.
6
BERT6mA: prediction of DNA N6-methyladenine site using deep learning-based approaches.BERT6mA:基于深度学习的方法预测 DNA N6-甲基腺嘌呤位点。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac053.
7
Mouse4mC-BGRU: Deep learning for predicting DNA N4-methylcytosine sites in mouse genome.Mouse4mC-BGRU:用于预测小鼠基因组中 DNA N4-甲基胞嘧啶位点的深度学习方法。
Methods. 2022 Aug;204:258-262. doi: 10.1016/j.ymeth.2022.01.009. Epub 2022 Jan 31.
8
BiLSTM-5mC: A Bidirectional Long Short-Term Memory-Based Approach for Predicting 5-Methylcytosine Sites in Genome-Wide DNA Promoters.基于双向长短时记忆网络(BiLSTM)的 5-甲基胞嘧啶(5mC)位点预测方法:全基因组 DNA 启动子研究
Molecules. 2021 Dec 7;26(24):7414. doi: 10.3390/molecules26247414.
9
iDNA-ABT: advanced deep learning model for detecting DNA methylation with adaptive features and transductive information maximization.iDNA-ABT:具有自适应特征和转导信息最大化的先进深度学习模型,用于检测 DNA 甲基化。
Bioinformatics. 2021 Dec 11;37(24):4603-4610. doi: 10.1093/bioinformatics/btab677.
10
Leveraging the attention mechanism to improve the identification of DNA N6-methyladenine sites.利用注意力机制提高 DNA N6-甲基腺嘌呤位点的识别。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab351.