Huang Guohua, Huang Xiaohong, Luo Wei
School of Information Technology and Administration, Hunan University of Finance and Economics, Changsha, China.
College of Information Science and Engineering, Shaoyang University, Shaoyang, Hunan, 422000, China.
BioData Min. 2023 Nov 27;16(1):34. doi: 10.1186/s13040-023-00348-8.
DNA N6-adenine methylation (N6-methyladenine, 6mA) plays a key regulating role in the cellular processes. Precisely recognizing 6mA sites is of importance to further explore its biological functions. Although there are many developed computational methods for 6mA site prediction over the past decades, there is a large root left to improve. We presented a cross validation-based stacking ensemble model for 6mA site prediction, called 6mA-StackingCV. The 6mA-StackingCV is a type of meta-learning algorithm, which uses output of cross validation as input to the final classifier. The 6mA-StackingCV reached the state of the art performances in the Rosaceae independent test. Extensive tests demonstrated the stability and the flexibility of the 6mA-StackingCV. We implemented the 6mA-StackingCV as a user-friendly web application, which allows one to restrictively choose representations or learning algorithms. This application is freely available at http://www.biolscience.cn/6mA-stackingCV/ . The source code and experimental data is available at https://github.com/Xiaohong-source/6mA-stackingCV .
DNA N6-腺嘌呤甲基化(N6-甲基腺嘌呤,6mA)在细胞过程中发挥关键调控作用。精确识别6mA位点对于进一步探索其生物学功能至关重要。尽管在过去几十年中已经开发出许多用于6mA位点预测的计算方法,但仍有很大的改进空间。我们提出了一种基于交叉验证的堆叠集成模型用于6mA位点预测,称为6mA-StackingCV。6mA-StackingCV是一种元学习算法,它将交叉验证的输出作为最终分类器的输入。6mA-StackingCV在蔷薇科独立测试中达到了当前的最优性能。广泛的测试证明了6mA-StackingCV的稳定性和灵活性。我们将6mA-StackingCV实现为一个用户友好的网络应用程序,它允许用户有选择地选择特征表示或学习算法。该应用程序可在http://www.biolscience.cn/6mA-stackingCV/免费获取。源代码和实验数据可在https://github.com/Xiaohong-source/6mA-stackingCV获取。