• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iDNA6mA-水稻:一种用于检测水稻中N6-甲基腺嘌呤位点的计算工具。

iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice.

作者信息

Lv Hao, Dao Fu-Ying, Guan Zheng-Xing, Zhang Dan, Tan Jiu-Xin, Zhang Yong, Chen Wei, Lin Hao

机构信息

Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.

Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, China.

出版信息

Front Genet. 2019 Sep 10;10:793. doi: 10.3389/fgene.2019.00793. eCollection 2019.

DOI:10.3389/fgene.2019.00793
PMID:31552096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6746913/
Abstract

DNA N6-methyladenine (6mA) is a dominant DNA modification form and involved in many biological functions. The accurate genome-wide identification of 6mA sites may increase understanding of its biological functions. Experimental methods for 6mA detection in eukaryotes genome are laborious and expensive. Therefore, it is necessary to develop computational methods to identify 6mA sites on a genomic scale, especially for plant genomes. Based on this consideration, the study aims to develop a machine learning-based method of predicting 6mA sites in the rice genome. We initially used mono-nucleotide binary encoding to formulate positive and negative samples. Subsequently, the machine learning algorithm named Random Forest was utilized to perform the classification for identifying 6mA sites. Our proposed method could produce an area under the receiver operating characteristic curve of 0.964 with an overall accuracy of 0.917, as indicated by the fivefold cross-validation test. Furthermore, an independent dataset was established to assess the generalization ability of our method. Finally, an area under the receiver operating characteristic curve of 0.981 was obtained, suggesting that the proposed method had good performance of predicting 6mA sites in the rice genome. For the convenience of retrieving 6mA sites, on the basis of the computational method, we built a freely accessible web server named iDNA6mA-Rice at http://lin-group.cn/server/iDNA6mA-Rice.

摘要

DNA N6-甲基腺嘌呤(6mA)是一种主要的DNA修饰形式,参与多种生物学功能。对全基因组范围内6mA位点进行准确鉴定,可能会增进我们对其生物学功能的理解。真核生物基因组中6mA检测的实验方法既费力又昂贵。因此,有必要开发计算方法来在基因组规模上鉴定6mA位点,尤其是针对植物基因组。基于这一考虑,本研究旨在开发一种基于机器学习的方法来预测水稻基因组中的6mA位点。我们最初使用单核苷酸二进制编码来构建正样本和负样本。随后,利用名为随机森林的机器学习算法进行分类,以鉴定6mA位点。如五折交叉验证测试所示,我们提出的方法在受试者工作特征曲线下的面积为0.964,总体准确率为0.917。此外,还建立了一个独立数据集来评估我们方法的泛化能力。最后,获得了受试者工作特征曲线下面积为0.981的结果,表明所提出的方法在预测水稻基因组中6mA位点方面具有良好性能。为了方便检索6mA位点,基于该计算方法,我们在http://lin-group.cn/server/iDNA6mA-Rice构建了一个可免费访问的网络服务器,名为iDNA6mA-Rice。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/4d4d7c7b40c9/fgene-10-00793-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/73652480ae67/fgene-10-00793-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/11697b272a8d/fgene-10-00793-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/bda01a2cd390/fgene-10-00793-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/7a623b8b55d5/fgene-10-00793-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/f03df77a9048/fgene-10-00793-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/4d4d7c7b40c9/fgene-10-00793-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/73652480ae67/fgene-10-00793-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/11697b272a8d/fgene-10-00793-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/bda01a2cd390/fgene-10-00793-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/7a623b8b55d5/fgene-10-00793-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/f03df77a9048/fgene-10-00793-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b0c/6746913/4d4d7c7b40c9/fgene-10-00793-g006.jpg

相似文献

1
iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice.iDNA6mA-水稻:一种用于检测水稻中N6-甲基腺嘌呤位点的计算工具。
Front Genet. 2019 Sep 10;10:793. doi: 10.3389/fgene.2019.00793. eCollection 2019.
2
iDNA6mA-Rice-DL: A local web server for identifying DNA N6-methyladenine sites in rice genome by deep learning method.iDNA6mA-Rice-DL:一个通过深度学习方法鉴定水稻基因组中 DNA N6-甲基腺嘌呤位点的本地网络服务器。
J Bioinform Comput Biol. 2021 Oct;19(5):2150019. doi: 10.1142/S0219720021500190. Epub 2021 Jul 21.
3
i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome.i6mA-Pred:鉴定水稻基因组中的 DNA N6-甲基腺嘌呤位点。
Bioinformatics. 2019 Aug 15;35(16):2796-2800. doi: 10.1093/bioinformatics/btz015.
4
iDNA-MS: An Integrated Computational Tool for Detecting DNA Modification Sites in Multiple Genomes.iDNA-MS:一种用于检测多个基因组中DNA修饰位点的综合计算工具。
iScience. 2020 Apr 24;23(4):100991. doi: 10.1016/j.isci.2020.100991. Epub 2020 Mar 19.
5
SNNRice6mA: A Deep Learning Method for Predicting DNA N6-Methyladenine Sites in Rice Genome.SNNRice6mA:一种预测水稻基因组中DNA N6-甲基腺嘌呤位点的深度学习方法。
Front Genet. 2019 Oct 11;10:1071. doi: 10.3389/fgene.2019.01071. eCollection 2019.
6
i6mA-DNCP: Computational Identification of DNA -Methyladenine Sites in the Rice Genome Using Optimized Dinucleotide-Based Features.i6mA-DNCP:利用优化的二核苷酸特征计算鉴定水稻基因组中的 DNA-甲基腺嘌呤位点。
Genes (Basel). 2019 Oct 20;10(10):828. doi: 10.3390/genes10100828.
7
Multi-scale DNA language model improves 6 mA binding sites prediction.多尺度 DNA 语言模型提高了 6mA 结合位点的预测。
Comput Biol Chem. 2024 Oct;112:108129. doi: 10.1016/j.compbiolchem.2024.108129. Epub 2024 Jul 18.
8
A convolution based computational approach towards DNA N6-methyladenine site identification and motif extraction in rice genome.基于卷积的计算方法,用于鉴定水稻基因组中的 DNA N6-甲基腺嘌呤位点并提取其基序。
Sci Rep. 2021 May 14;11(1):10357. doi: 10.1038/s41598-021-89850-9.
9
SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.SDM6A:一个基于网络的用于预测水稻基因组中6mA位点的综合机器学习框架。
Mol Ther Nucleic Acids. 2019 Dec 6;18:131-141. doi: 10.1016/j.omtn.2019.08.011. Epub 2019 Aug 16.
10
iDNA6mA-PseKNC: Identifying DNA N-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC.iDNA6mA-PseKNC:通过将核苷酸理化性质纳入 PseKNC 来鉴定 DNA N6-甲基腺苷位点。
Genomics. 2019 Jan;111(1):96-102. doi: 10.1016/j.ygeno.2018.01.005. Epub 2018 Jan 31.

引用本文的文献

1
Identification of DNA N6-methyladenine modifications in the rice genome with a fine-tuned large language model.利用微调的大语言模型鉴定水稻基因组中的DNA N6-甲基腺嘌呤修饰
Front Plant Sci. 2025 Jun 25;16:1626539. doi: 10.3389/fpls.2025.1626539. eCollection 2025.
2
DeepRice6mA: A convolutional neural network approach for 6mA site prediction in the rice Genome.深度水稻6mA:一种用于水稻基因组中6mA位点预测的卷积神经网络方法。
PLoS One. 2025 Jun 18;20(6):e0325216. doi: 10.1371/journal.pone.0325216. eCollection 2025.
3
HD-6mAPred: a hybrid deep learning approach for accurate prediction of N6-methyladenine sites in plant species.

本文引用的文献

1
iProEP: A Computational Predictor for Predicting Promoter.iProEP:一种用于预测启动子的计算预测工具。
Mol Ther Nucleic Acids. 2019 Sep 6;17:337-346. doi: 10.1016/j.omtn.2019.05.028. Epub 2019 Jun 13.
2
EmExplorer: a database for exploring time activation of gene expression in mammalian embryos.EmExplorer:一个用于探索哺乳动物胚胎中基因表达时间激活的数据库。
Open Biol. 2019 Jun 28;9(6):190054. doi: 10.1098/rsob.190054. Epub 2019 Jun 5.
3
Evaluation of different computational methods on 5-methylcytosine sites identification.
HD-6mAPred:一种用于准确预测植物物种中N6-甲基腺嘌呤位点的混合深度学习方法。
PeerJ. 2025 May 15;13:e19463. doi: 10.7717/peerj.19463. eCollection 2025.
4
6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site.6mA-StackingCV:一种用于预测DNA N6-甲基腺嘌呤位点的改进堆叠集成模型。
BioData Min. 2023 Nov 27;16(1):34. doi: 10.1186/s13040-023-00348-8.
5
MethSemble-6mA: an ensemble-based 6mA prediction server and its application on promoter region of LBD gene family in Poaceae.MethSemble-6mA:一种基于集成学习的6mA预测服务器及其在禾本科LBD基因家族启动子区域的应用
Front Plant Sci. 2023 Oct 9;14:1256186. doi: 10.3389/fpls.2023.1256186. eCollection 2023.
6
DRSN4mCPred: accurately predicting sites of DNA N4-methylcytosine using deep residual shrinkage network for diagnosis and treatment of gastrointestinal cancer in the precision medicine era.DRSN4mCPred:在精准医学时代,使用深度残差收缩网络准确预测DNA N4-甲基胞嘧啶位点以用于胃肠道癌的诊断和治疗。
Front Med (Lausanne). 2023 May 4;10:1187430. doi: 10.3389/fmed.2023.1187430. eCollection 2023.
7
Predicting N6-Methyladenosine Sites in Multiple Tissues of Mammals through Ensemble Deep Learning.通过集成深度学习预测哺乳动物多种组织中的 N6-甲基腺苷位点。
Int J Mol Sci. 2022 Dec 7;23(24):15490. doi: 10.3390/ijms232415490.
8
4acCPred: Weakly supervised prediction of -acetyldeoxycytosine DNA modification from sequences.4acCPred:从序列中对N4-乙酰脱氧胞嘧啶DNA修饰进行弱监督预测。
Mol Ther Nucleic Acids. 2022 Oct 14;30:337-345. doi: 10.1016/j.omtn.2022.10.004. eCollection 2022 Dec 13.
9
ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks.ENet-6mA:使用弹性网络和神经网络鉴定植物基因组中的 6mA 修饰位点。
Int J Mol Sci. 2022 Jul 27;23(15):8314. doi: 10.3390/ijms23158314.
10
Machine learning: its challenges and opportunities in plant system biology.机器学习:在植物系统生物学中的挑战与机遇。
Appl Microbiol Biotechnol. 2022 May;106(9-10):3507-3530. doi: 10.1007/s00253-022-11963-6. Epub 2022 May 16.
不同计算方法在 5-甲基胞嘧啶位点识别中的评估。
Brief Bioinform. 2020 May 21;21(3):982-995. doi: 10.1093/bib/bbz048.
4
Identification of hormone binding proteins based on machine learning methods.基于机器学习方法的激素结合蛋白鉴定
Math Biosci Eng. 2019 Mar 22;16(4):2466-2480. doi: 10.3934/mbe.2019123.
5
i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome.i6mA-Pred:鉴定水稻基因组中的 DNA N6-甲基腺嘌呤位点。
Bioinformatics. 2019 Aug 15;35(16):2796-2800. doi: 10.1093/bioinformatics/btz015.
6
Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique.使用两步特征选择技术鉴定酿酒酵母中的复制原点。
Bioinformatics. 2019 Jun 1;35(12):2075-2083. doi: 10.1093/bioinformatics/bty943.
7
Gene2vec: gene subsequence embedding for prediction of mammalian -methyladenosine sites from mRNA.Gene2vec:基于基因子序列的嵌体模型,用于从 mRNA 预测哺乳动物 m6A 修饰位点。
RNA. 2019 Feb;25(2):205-218. doi: 10.1261/rna.069112.118. Epub 2018 Nov 13.
8
LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse.LncRNA2Target v2.0:一个综合性的数据库,包含人类和小鼠中 lncRNA 的靶基因。
Nucleic Acids Res. 2019 Jan 8;47(D1):D140-D144. doi: 10.1093/nar/gky1051.
9
iTerm-PseKNC: a sequence-based tool for predicting bacterial transcriptional terminators.iTerm-PseKNC:一种基于序列的细菌转录终止子预测工具。
Bioinformatics. 2019 May 1;35(9):1469-1477. doi: 10.1093/bioinformatics/bty827.
10
Exploring sequence-based features for the improved prediction of DNA N4-methylcytosine sites in multiple species.探索基于序列的特征,以提高在多个物种中预测 DNA N4-甲基胞嘧啶位点的能力。
Bioinformatics. 2019 Apr 15;35(8):1326-1333. doi: 10.1093/bioinformatics/bty824.