IEEE/ACM Trans Comput Biol Bioinform. 2023 Nov-Dec;20(6):3600-3608. doi: 10.1109/TCBB.2023.3305992. Epub 2023 Dec 25.
The accurate annotation of miRNA promoters is critical for the mechanistic understanding of miRNA gene regulation. Various computational methods have been developed for the prediction of miRNA promoters solely employing a single classifier. Most of these computational methods extract either sequence features or one-sided signal features, and the accuracy and reliability of predictions need to be improved. To address these issues, we present miPTP, a three-level prediction method that combines SVM, RF, and correlation coefficients. It is capable of identifying miRNA promoters based on both DNA sequence and ChIP-Seq data (RPol II). By sequentially integrating these two types of information sources with the three methods selected, miPTP can identify miRNA promoters with higher accuracy and sensitivity compared to specific existing methods. Finally, the reliability of miPTP is validated by examining the conservation, CpG content, and activating histone marks in the identified miRNA promoters.
miRNA 启动子的准确注释对于理解 miRNA 基因调控的机制至关重要。已经开发了各种计算方法,仅使用单个分类器来预测 miRNA 启动子。这些计算方法大多数提取序列特征或单边信号特征,并且预测的准确性和可靠性需要提高。为了解决这些问题,我们提出了 miPTP,这是一种基于 SVM、RF 和相关系数的三级预测方法。它能够基于 DNA 序列和 ChIP-Seq 数据(RPol II)识别 miRNA 启动子。通过依次将这两种类型的信息源与三种选定的方法相结合,miPTP 可以比特定的现有方法更准确和敏感地识别 miRNA 启动子。最后,通过检查鉴定的 miRNA 启动子中的保守性、CpG 含量和激活的组蛋白标记来验证 miPTP 的可靠性。