• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

i6mA-Fuse:通过融合多种特征表示来改进和增强蔷薇科基因组中 DNA 6mA 位点的预测

i6mA-Fuse: improved and robust prediction of DNA 6 mA sites in the Rosaceae genome by fusing multiple feature representation.

机构信息

Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan.

Japan Society for the Promotion of Science, 5-3-1 Kojimachi, Chiyoda-ku, Tokyo, 102-0083, Japan.

出版信息

Plant Mol Biol. 2020 May;103(1-2):225-234. doi: 10.1007/s11103-020-00988-y. Epub 2020 Mar 5.

DOI:10.1007/s11103-020-00988-y
PMID:32140819
Abstract

DNA N-methyladenine (6 mA) is one of the most vital epigenetic modifications and involved in controlling the various gene expression levels. With the avalanche of DNA sequences generated in numerous databases, the accurate identification of 6 mA plays an essential role for understanding molecular mechanisms. Because the experimental approaches are time-consuming and costly, it is desirable to develop a computation model for rapidly and accurately identifying 6 mA. To the best of our knowledge, we first proposed a computational model named i6mA-Fuse to predict 6 mA sites from the Rosaceae genomes, especially in Rosa chinensis and Fragaria vesca. We implemented the five encoding schemes, i.e., mononucleotide binary, dinucleotide binary, k-space spectral nucleotide, k-mer, and electron-ion interaction pseudo potential compositions, to build the five, single-encoding random forest (RF) models. The i6mA-Fuse uses a linear regression model to combine the predicted probability scores of the five, single encoding-based RF models. The resultant species-specific i6mA-Fuse achieved remarkably high performances with AUCs of 0.982 and 0.978 and with MCCs of 0.869 and 0.858 on the independent datasets of Rosa chinensis and Fragaria vesca, respectively. In the F. vesca-specific i6mA-Fuse, the MBE and EIIP contributed to 75% and 25% of the total prediction; in the R. chinensis-specific i6mA-Fuse, Kmer, MBE, and EIIP contribute to 15%, 65%, and 20% of the total prediction. To assist high-throughput prediction for DNA 6 mA identification, the i6mA-Fuse is publicly accessible at https://kurata14.bio.kyutech.ac.jp/i6mA-Fuse/.

摘要

DNA N-甲基腺嘌呤(6mA)是最重要的表观遗传修饰之一,参与调控各种基因表达水平。随着大量数据库中生成的 DNA 序列的涌现,准确识别 6mA 对于理解分子机制至关重要。由于实验方法既耗时又昂贵,因此需要开发一种计算模型来快速准确地识别 6mA。据我们所知,我们首次提出了一种名为 i6mA-Fuse 的计算模型,用于从蔷薇科基因组中预测 6mA 位点,特别是在玫瑰和草莓中。我们实现了五种编码方案,即单核碱基二进位、双核碱基二进位、k-空间谱核苷酸、k--mer 和电子-离子相互作用伪势组合,构建了五种单编码随机森林(RF)模型。i6mA-Fuse 使用线性回归模型来结合五种基于单编码的 RF 模型的预测概率得分。在独立的玫瑰和草莓数据集上,物种特异性的 i6mA-Fuse 分别达到了惊人的高性能,AUC 分别为 0.982 和 0.978,MCC 分别为 0.869 和 0.858。在草莓特异性的 i6mA-Fuse 中,MBE 和 EIIP 分别贡献了 75%和 25%的总预测;在玫瑰特异性的 i6mA-Fuse 中,Kmer、MBE 和 EIIP 分别贡献了 15%、65%和 20%的总预测。为了协助高通量预测 DNA 6mA 识别,i6mA-Fuse 可在 https://kurata14.bio.kyutech.ac.jp/i6mA-Fuse/ 上公开获取。

相似文献

1
i6mA-Fuse: improved and robust prediction of DNA 6 mA sites in the Rosaceae genome by fusing multiple feature representation.i6mA-Fuse:通过融合多种特征表示来改进和增强蔷薇科基因组中 DNA 6mA 位点的预测
Plant Mol Biol. 2020 May;103(1-2):225-234. doi: 10.1007/s11103-020-00988-y. Epub 2020 Mar 5.
2
Meta-i6mA: an interspecies predictor for identifying DNA N6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework.Meta-i6mA:利用集成机器学习框架中的信息特征,用于识别植物基因组中 DNA N6-甲基腺嘌呤位点的种间预测因子。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa202.
3
i6mA-stack: A stacking ensemble-based computational prediction of DNA N6-methyladenine (6mA) sites in the Rosaceae genome.i6mA-stack:基于堆叠集成法对蔷薇科基因组中DNA N6-甲基腺嘌呤(6mA)位点的计算预测。
Genomics. 2021 Jan;113(1 Pt 2):582-592. doi: 10.1016/j.ygeno.2020.09.054. Epub 2020 Oct 1.
4
i4mC-ROSE, a bioinformatics tool for the identification of DNA N4-methylcytosine sites in the Rosaceae genome.i4mC-ROSE,一种用于鉴定蔷薇科基因组中 DNA N4-甲基胞嘧啶位点的生物信息学工具。
Int J Biol Macromol. 2020 Aug 15;157:752-758. doi: 10.1016/j.ijbiomac.2019.12.009. Epub 2019 Dec 2.
5
i6mA-VC: A Multi-Classifier Voting Method for the Computational Identification of DNA N6-methyladenine Sites.i6mA-VC:一种用于计算鉴定 DNA N6-甲基腺嘌呤位点的多分类器投票方法。
Interdiscip Sci. 2021 Sep;13(3):413-425. doi: 10.1007/s12539-021-00429-4. Epub 2021 Apr 8.
6
i4mC-Mouse: Improved identification of DNA N4-methylcytosine sites in the mouse genome using multiple encoding schemes.i4mC-Mouse:使用多种编码方案改进对小鼠基因组中DNA N4-甲基胞嘧啶位点的识别。
Comput Struct Biotechnol J. 2020 Apr 8;18:906-912. doi: 10.1016/j.csbj.2020.04.001. eCollection 2020.
7
i6mA-DNCP: Computational Identification of DNA -Methyladenine Sites in the Rice Genome Using Optimized Dinucleotide-Based Features.i6mA-DNCP:利用优化的二核苷酸特征计算鉴定水稻基因组中的 DNA-甲基腺嘌呤位点。
Genes (Basel). 2019 Oct 20;10(10):828. doi: 10.3390/genes10100828.
8
i6mA-Vote: Cross-Species Identification of DNA N6-Methyladenine Sites in Plant Genomes Based on Ensemble Learning With Voting.i6mA-Vote:基于投票集成学习的植物基因组中DNA N6-甲基腺嘌呤位点的跨物种鉴定
Front Plant Sci. 2022 Feb 14;13:845835. doi: 10.3389/fpls.2022.845835. eCollection 2022.
9
ProIn-Fuse: improved and robust prediction of proinflammatory peptides by fusing of multiple feature representations.ProIn-Fuse:通过融合多种特征表示来改进和增强促炎肽的预测。
J Comput Aided Mol Des. 2020 Dec;34(12):1229-1236. doi: 10.1007/s10822-020-00343-9. Epub 2020 Sep 23.
10
IRC-Fuse: improved and robust prediction of redox-sensitive cysteine by fusing of multiple feature representations.IRC-Fuse:通过融合多种特征表示改进并稳健预测氧化还原敏感型半胱氨酸
J Comput Aided Mol Des. 2021 Mar;35(3):315-323. doi: 10.1007/s10822-020-00368-0. Epub 2021 Jan 4.

引用本文的文献

1
DeepRice6mA: A convolutional neural network approach for 6mA site prediction in the rice Genome.深度水稻6mA:一种用于水稻基因组中6mA位点预测的卷积神经网络方法。
PLoS One. 2025 Jun 18;20(6):e0325216. doi: 10.1371/journal.pone.0325216. eCollection 2025.
2
HD-6mAPred: a hybrid deep learning approach for accurate prediction of N6-methyladenine sites in plant species.HD-6mAPred:一种用于准确预测植物物种中N6-甲基腺嘌呤位点的混合深度学习方法。
PeerJ. 2025 May 15;13:e19463. doi: 10.7717/peerj.19463. eCollection 2025.
3
N6-methyladenine identification using deep learning and discriminative feature integration.

本文引用的文献

1
6mA-RicePred: A Method for Identifying DNA -Methyladenine Sites in the Rice Genome Based on Feature Fusion.6mA-RicePred:一种基于特征融合识别水稻基因组中DNA甲基腺嘌呤位点的方法。
Front Plant Sci. 2020 Jan 31;11:4. doi: 10.3389/fpls.2020.00004. eCollection 2020.
2
Machine intelligence in peptide therapeutics: A next-generation tool for rapid disease screening.机器智能在肽类药物治疗学中的应用:一种用于快速疾病筛查的下一代工具。
Med Res Rev. 2020 Jul;40(4):1276-1314. doi: 10.1002/med.21658. Epub 2020 Jan 10.
3
Machine learning and its applications in plant molecular studies.
利用深度学习和判别特征整合进行N6-甲基腺嘌呤识别
BMC Med Genomics. 2025 Mar 29;18(1):58. doi: 10.1186/s12920-025-02131-6.
4
Predicting multiple taste sensations with a multiobjective machine learning method.使用多目标机器学习方法预测多种味觉感受。
NPJ Sci Food. 2024 Jul 25;8(1):47. doi: 10.1038/s41538-024-00287-6.
5
6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site.6mA-StackingCV:一种用于预测DNA N6-甲基腺嘌呤位点的改进堆叠集成模型。
BioData Min. 2023 Nov 27;16(1):34. doi: 10.1186/s13040-023-00348-8.
6
Empirical comparison and recent advances of computational prediction of hormone binding proteins using machine learning methods.使用机器学习方法对激素结合蛋白进行计算预测的实证比较与最新进展
Comput Struct Biotechnol J. 2023 Mar 17;21:2253-2261. doi: 10.1016/j.csbj.2023.03.024. eCollection 2023.
7
Predicting N6-Methyladenosine Sites in Multiple Tissues of Mammals through Ensemble Deep Learning.通过集成深度学习预测哺乳动物多种组织中的 N6-甲基腺苷位点。
Int J Mol Sci. 2022 Dec 7;23(24):15490. doi: 10.3390/ijms232415490.
8
Toward a general and interpretable umami taste predictor using a multi-objective machine learning approach.采用多目标机器学习方法构建一种通用且可解释的鲜味预测模型。
Sci Rep. 2022 Dec 16;12(1):21735. doi: 10.1038/s41598-022-25935-3.
9
4acCPred: Weakly supervised prediction of -acetyldeoxycytosine DNA modification from sequences.4acCPred:从序列中对N4-乙酰脱氧胞嘧啶DNA修饰进行弱监督预测。
Mol Ther Nucleic Acids. 2022 Oct 14;30:337-345. doi: 10.1016/j.omtn.2022.10.004. eCollection 2022 Dec 13.
10
Deepm5C: A deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy.Deepm5C:一种基于深度学习的混合框架,使用堆叠策略识别人类 RNA N5-甲基胞嘧啶位点。
Mol Ther. 2022 Aug 3;30(8):2856-2867. doi: 10.1016/j.ymthe.2022.05.001. Epub 2022 May 6.
机器学习及其在植物分子研究中的应用。
Brief Funct Genomics. 2020 Jan 22;19(1):40-48. doi: 10.1093/bfgp/elz036.
4
i4mC-ROSE, a bioinformatics tool for the identification of DNA N4-methylcytosine sites in the Rosaceae genome.i4mC-ROSE,一种用于鉴定蔷薇科基因组中 DNA N4-甲基胞嘧啶位点的生物信息学工具。
Int J Biol Macromol. 2020 Aug 15;157:752-758. doi: 10.1016/j.ijbiomac.2019.12.009. Epub 2019 Dec 2.
5
Meta-iAVP: A Sequence-Based Meta-Predictor for Improving the Prediction of Antiviral Peptides Using Effective Feature Representation.Meta-iAVP:一种基于序列的元预测器,用于使用有效的特征表示来改进抗病毒肽的预测。
Int J Mol Sci. 2019 Nov 15;20(22):5743. doi: 10.3390/ijms20225743.
6
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.全面综述和评估基于 RNA 序列预测 RNA 转录后修饰位点的计算方法。
Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112.
7
Prediction of S-nitrosylation sites by integrating support vector machines and random forest.基于支持向量机和随机森林算法的 S-亚硝酰化位点预测
Mol Omics. 2019 Dec 2;15(6):451-458. doi: 10.1039/c9mo00098d.
8
SNNRice6mA: A Deep Learning Method for Predicting DNA N6-Methyladenine Sites in Rice Genome.SNNRice6mA:一种预测水稻基因组中DNA N6-甲基腺嘌呤位点的深度学习方法。
Front Genet. 2019 Oct 11;10:1071. doi: 10.3389/fgene.2019.01071. eCollection 2019.
9
4mCpred-EL: An Ensemble Learning Framework for Identification of DNA -methylcytosine Sites in the Mouse Genome.4mCpred-EL:用于鉴定小鼠基因组中 DNA-甲基胞嘧啶位点的集成学习框架。
Cells. 2019 Oct 28;8(11):1332. doi: 10.3390/cells8111332.
10
A comparison and assessment of computational method for identifying recombination hotspots in Saccharomyces cerevisiae.一种比较和评估鉴定酿酒酵母重组热点的计算方法。
Brief Bioinform. 2020 Sep 25;21(5):1568-1580. doi: 10.1093/bib/bbz123.