• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于网络的DNA甲基胞嘧啶位点预测工具的实证比较与分析

Empirical Comparison and Analysis of Web-Based DNA -Methylcytosine Site Prediction Tools.

作者信息

Manavalan Balachandran, Hasan Md Mehedi, Basith Shaherin, Gosu Vijayakumar, Shin Tae-Hwan, Lee Gwang

机构信息

Department of Physiology, Ajou University School of Medicine, Suwon 16499, Republic of Korea.

Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Iizuka, Fukuoka 820-8502, Japan.

出版信息

Mol Ther Nucleic Acids. 2020 Sep 16;22:406-420. doi: 10.1016/j.omtn.2020.09.010. eCollection 2020 Dec 4.

DOI:10.1016/j.omtn.2020.09.010
PMID:33230445
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7533314/
Abstract

DNA -methylcytosine (4mC) is a crucial epigenetic modification involved in various biological processes. Accurate genome-wide identification of these sites is critical for improving our understanding of their biological functions and mechanisms. As experimental methods for 4mC identification are tedious, expensive, and labor-intensive, several machine learning-based approaches have been developed for genome-wide detection of such sites in multiple species. However, the predictions projected by these tools are difficult to quantify and compare. To date, no systematic performance comparison of 4mC tools has been reported. The aim of this study was to compare and critically evaluate 12 publicly available 4mC site prediction tools according to species specificity, based on a huge independent validation dataset. The tools 4mCCNN (), DNA4mC-LIP (), iDNA-MS (), DNA4mC-LIP and 4mCCNN (), and four tools for achieved excellent overall performance compared with their counterparts. However, none of the existing methods was suitable for , , and , thereby limiting their practical applicability. Model transferability to five species and non-transferability to three species are also discussed. The presented evaluation will assist researchers in selecting appropriate prediction tools that best suit their purpose and provide useful guidelines for the development of improved 4mC predictors in the future.

摘要

DNA甲基胞嘧啶(4mC)是一种参与多种生物过程的关键表观遗传修饰。对这些位点进行全基因组范围的准确识别,对于增进我们对其生物学功能和机制的理解至关重要。由于用于4mC识别的实验方法繁琐、昂贵且耗费人力,因此已开发出几种基于机器学习的方法来在多个物种中进行全基因组范围的此类位点检测。然而,这些工具所做出的预测难以进行量化和比较。迄今为止,尚未有关于4mC工具的系统性能比较的报道。本研究的目的是基于一个庞大的独立验证数据集,根据物种特异性对12种公开可用的4mC位点预测工具进行比较和批判性评估。与其他同类工具相比,工具4mCCNN()、DNA4mC-LIP()、iDNA-MS()、DNA4mC-LIP和4mCCNN()以及四种用于 的工具实现了出色的整体性能。然而,现有的方法均不适用于 、 和 ,从而限制了它们的实际适用性。还讨论了模型对五个物种的可转移性以及对三个物种的不可转移性。所呈现的评估将帮助研究人员选择最适合其目的的合适预测工具,并为未来改进4mC预测器的开发提供有用的指导方针。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/674f996cc4ee/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/021cf501b840/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/e899dc9fcf37/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/d7a21e8386bf/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/e3369eed22e5/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/674f996cc4ee/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/021cf501b840/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/e899dc9fcf37/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/d7a21e8386bf/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/e3369eed22e5/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3411/7533314/674f996cc4ee/gr4.jpg

相似文献

1
Empirical Comparison and Analysis of Web-Based DNA -Methylcytosine Site Prediction Tools.基于网络的DNA甲基胞嘧啶位点预测工具的实证比较与分析
Mol Ther Nucleic Acids. 2020 Sep 16;22:406-420. doi: 10.1016/j.omtn.2020.09.010. eCollection 2020 Dec 4.
2
DNA4mC-LIP: a linear integration method to identify N4-methylcytosine site in multiple species.DNA4mC-LIP:一种在多个物种中鉴定 N4-甲基胞嘧啶位点的线性整合方法。
Bioinformatics. 2020 Jun 1;36(11):3327-3335. doi: 10.1093/bioinformatics/btaa143.
3
4mCPred: machine learning methods for DNA N4-methylcytosine sites prediction.4mCPred:用于 DNA N4-甲基胞嘧啶位点预测的机器学习方法。
Bioinformatics. 2019 Feb 15;35(4):593-601. doi: 10.1093/bioinformatics/bty668.
4
SOMM4mC: a second-order Markov model for DNA N4-methylcytosine site prediction in six species.SOMM4mC:用于预测六种物种中 DNA N4-甲基胞嘧啶位点的二阶马尔可夫模型。
Bioinformatics. 2020 Aug 15;36(14):4103-4105. doi: 10.1093/bioinformatics/btaa507.
5
Critical evaluation of web-based DNA N6-methyladenine site prediction tools.基于网络的 DNA N6-甲基腺嘌呤位点预测工具的批判性评估。
Brief Funct Genomics. 2021 Jul 17;20(4):258-272. doi: 10.1093/bfgp/elaa028.
6
Identifying DNA N4-methylcytosine sites in the rosaceae genome with a deep learning model relying on distributed feature representation.使用基于分布式特征表示的深度学习模型识别蔷薇科基因组中的DNA N4-甲基胞嘧啶位点。
Comput Struct Biotechnol J. 2021 Mar 19;19:1612-1619. doi: 10.1016/j.csbj.2021.03.015. eCollection 2021.
7
Deep4mC: systematic assessment and computational prediction for DNA N4-methylcytosine sites by deep learning.Deep4mC:通过深度学习对 DNA N4-甲基胞嘧啶位点进行系统评估和计算预测。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa099.
8
Systematic Analysis and Accurate Identification of DNA N4-Methylcytosine Sites by Deep Learning.基于深度学习的DNA N4-甲基胞嘧啶位点的系统分析与准确识别
Front Microbiol. 2022 Mar 15;13:843425. doi: 10.3389/fmicb.2022.843425. eCollection 2022.
9
Exploring sequence-based features for the improved prediction of DNA N4-methylcytosine sites in multiple species.探索基于序列的特征,以提高在多个物种中预测 DNA N4-甲基胞嘧啶位点的能力。
Bioinformatics. 2019 Apr 15;35(8):1326-1333. doi: 10.1093/bioinformatics/bty824.
10
Accurate prediction of DNA N-methylcytosine sites via boost-learning various types of sequence features.通过提升学习多种类型的序列特征来准确预测 DNA N-甲基胞嘧啶位点。
BMC Genomics. 2020 Sep 11;21(1):627. doi: 10.1186/s12864-020-07033-8.

引用本文的文献

1
Shaping eukaryotic epigenetic systems by horizontal gene transfer.通过水平基因转移塑造真核表观遗传系统。
Bioessays. 2023 Jul;45(7):e2200232. doi: 10.1002/bies.202200232.
2
A Grid Search-Based Multilayer Dynamic Ensemble System to Identify DNA N4-Methylcytosine Using Deep Learning Approach.基于网格搜索的多层动态集成系统,利用深度学习方法识别 DNA N4-甲基胞嘧啶。
Genes (Basel). 2023 Feb 25;14(3):582. doi: 10.3390/genes14030582.
3
MultiScale-CNN-4mCPred: a multi-scale CNN and adaptive embedding-based method for mouse genome DNA N4-methylcytosine prediction.

本文引用的文献

1
DeepTorrent: a deep learning-based approach for predicting DNA N4-methylcytosine sites.DeepTorrent:一种基于深度学习的方法,用于预测 DNA N4-甲基胞嘧啶位点。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa124.
2
Deep4mC: systematic assessment and computational prediction for DNA N4-methylcytosine sites by deep learning.Deep4mC:通过深度学习对 DNA N4-甲基胞嘧啶位点进行系统评估和计算预测。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa099.
3
i4mC-Mouse: Improved identification of DNA N4-methylcytosine sites in the mouse genome using multiple encoding schemes.
多尺度 CNN-4mCPred:一种基于多尺度 CNN 和自适应嵌入的方法,用于预测小鼠基因组 DNA N4-甲基胞嘧啶。
BMC Bioinformatics. 2023 Jan 18;24(1):21. doi: 10.1186/s12859-023-05135-0.
4
Deepm5C: A deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy.Deepm5C:一种基于深度学习的混合框架,使用堆叠策略识别人类 RNA N5-甲基胞嘧啶位点。
Mol Ther. 2022 Aug 3;30(8):2856-2867. doi: 10.1016/j.ymthe.2022.05.001. Epub 2022 May 6.
5
Systematic Analysis and Accurate Identification of DNA N4-Methylcytosine Sites by Deep Learning.基于深度学习的DNA N4-甲基胞嘧啶位点的系统分析与准确识别
Front Microbiol. 2022 Mar 15;13:843425. doi: 10.3389/fmicb.2022.843425. eCollection 2022.
6
Prediction of serine phosphorylation sites mapping on Schizosaccharomyces Pombe by fusing three encoding schemes with the random forest classifier.通过将三种编码方案与随机森林分类器融合,预测丝氨酸磷酸化位点在裂殖酵母中的映射。
Sci Rep. 2022 Feb 16;12(1):2632. doi: 10.1038/s41598-022-06529-5.
7
SortPred: The first machine learning based predictor to identify bacterial sortases and their classes using sequence-derived information.SortPred:首个基于机器学习的预测工具,可利用序列衍生信息识别细菌分选酶及其类别。
Comput Struct Biotechnol J. 2021 Dec 14;20:165-174. doi: 10.1016/j.csbj.2021.12.014. eCollection 2022.
8
Comparative analysis of machine learning-based approaches for identifying therapeutic peptides targeting SARS-CoV-2.基于机器学习的方法识别针对 SARS-CoV-2 的治疗性肽的比较分析。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab412.
9
An Improved Computational Prediction Model for Lysine Succinylation Sites Mapping on Fusing Three Sequence Encoding Schemes with the Random Forest Classifier.一种改进的计算预测模型,用于通过融合三种序列编码方案与随机森林分类器来映射赖氨酸琥珀酰化位点
Curr Genomics. 2021 Feb;22(2):122-136. doi: 10.2174/1389202922666210219114211.
10
A convolution based computational approach towards DNA N6-methyladenine site identification and motif extraction in rice genome.基于卷积的计算方法,用于鉴定水稻基因组中的 DNA N6-甲基腺嘌呤位点并提取其基序。
Sci Rep. 2021 May 14;11(1):10357. doi: 10.1038/s41598-021-89850-9.
i4mC-Mouse:使用多种编码方案改进对小鼠基因组中DNA N4-甲基胞嘧啶位点的识别。
Comput Struct Biotechnol J. 2020 Apr 8;18:906-912. doi: 10.1016/j.csbj.2020.04.001. eCollection 2020.
4
CirRNAPL: A web server for the identification of circRNA based on extreme learning machine.CirRNAPL:一个基于极限学习机的环状RNA识别网络服务器。
Comput Struct Biotechnol J. 2020 Apr 2;18:834-842. doi: 10.1016/j.csbj.2020.03.028. eCollection 2020.
5
Early Diagnosis of Hepatocellular Carcinoma Using Machine Learning Method.基于机器学习方法的肝细胞癌早期诊断
Front Bioeng Biotechnol. 2020 Mar 27;8:254. doi: 10.3389/fbioe.2020.00254. eCollection 2020.
6
iDNA-MS: An Integrated Computational Tool for Detecting DNA Modification Sites in Multiple Genomes.iDNA-MS:一种用于检测多个基因组中DNA修饰位点的综合计算工具。
iScience. 2020 Apr 24;23(4):100991. doi: 10.1016/j.isci.2020.100991. Epub 2020 Mar 19.
7
A Deep Neural Network for Identifying DNA N4-Methylcytosine Sites.用于识别DNA N4-甲基胞嘧啶位点的深度神经网络
Front Genet. 2020 Mar 6;11:209. doi: 10.3389/fgene.2020.00209. eCollection 2020.
8
Deep Learning to Predict Protein Backbone Structure from High-Resolution Cryo-EM Density Maps.深度学习从高分辨率冷冻电镜密度图预测蛋白质骨架结构。
Sci Rep. 2020 Mar 9;10(1):4282. doi: 10.1038/s41598-020-60598-y.
9
HLPpred-Fuse: improved and robust prediction of hemolytic peptide and its activity by fusing multiple feature representation.HLPpred-Fuse:通过融合多种特征表示提高和增强溶血肽及其活性的预测
Bioinformatics. 2020 Jun 1;36(11):3350-3356. doi: 10.1093/bioinformatics/btaa160.
10
i6mA-Fuse: improved and robust prediction of DNA 6 mA sites in the Rosaceae genome by fusing multiple feature representation.i6mA-Fuse:通过融合多种特征表示来改进和增强蔷薇科基因组中 DNA 6mA 位点的预测
Plant Mol Biol. 2020 May;103(1-2):225-234. doi: 10.1007/s11103-020-00988-y. Epub 2020 Mar 5.