Zeng Rao, Cheng Song, Liao Minghong
Department of Software Engineering, School of Informatics, Xiamen University, Xiamen, China.
Department of Thoracic Surgery, Heilongjiang Province Land Reclamation Headquarters General Hospital, Harbin, China.
Front Cell Dev Biol. 2021 May 10;9:664669. doi: 10.3389/fcell.2021.664669. eCollection 2021.
DNA methylation is one of the most extensive epigenetic modifications. DNA 4mC modification plays a key role in regulating chromatin structure and gene expression. In this study, we proposed a generic 4mC computational predictor, namely, 4mCPred-MTL using multi-task learning coupled with Transformer to predict 4mC sites in multiple species. In this predictor, we utilize a multi-task learning framework, in which each task is to train species-specific data based on Transformer. Extensive experimental results show that our multi-task predictive model can significantly improve the performance of the model based on single task and outperform existing methods on benchmarking comparison. Moreover, we found that our model can sufficiently capture better characteristics of 4mC sites as compared to existing commonly used feature descriptors, demonstrating the strong feature learning ability of our model. Therefore, based on the above results, it can be expected that our 4mCPred-MTL can be a useful tool for research communities of interest.
DNA甲基化是最广泛的表观遗传修饰之一。DNA 4mC修饰在调节染色质结构和基因表达中起关键作用。在本研究中,我们提出了一种通用的4mC计算预测器,即4mCPred-MTL,它使用多任务学习结合Transformer来预测多个物种中的4mC位点。在这个预测器中,我们使用了一个多任务学习框架,其中每个任务是基于Transformer训练特定物种的数据。大量实验结果表明,我们的多任务预测模型可以显著提高基于单任务的模型性能,并且在基准比较中优于现有方法。此外,我们发现与现有的常用特征描述符相比,我们的模型能够充分捕捉4mC位点的更好特征,证明了我们模型强大的特征学习能力。因此,基于上述结果,可以预期我们的4mCPred-MTL能够成为相关研究群体的有用工具。