School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China.
Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei, 230036, China.
BMC Genomics. 2024 May 30;25(1):531. doi: 10.1186/s12864-024-10439-3.
Non-coding RNAs (ncRNAs) are recognized as pivotal players in the regulation of essential physiological processes such as nutrient homeostasis, development, and stress responses in plants. Common methods for predicting ncRNAs are susceptible to significant effects of experimental conditions and computational methods, resulting in the need for significant investment of time and resources. Therefore, we constructed an ncRNA predictor(MFPINC), to predict potential ncRNA in plants which is based on the PINC tool proposed by our previous studies. Specifically, sequence features were carefully refined using variance thresholding and F-test methods, while deep features were extracted and feature fusion were performed by applying the GRU model. The comprehensive evaluation of multiple standard datasets shows that MFPINC not only achieves more comprehensive and accurate identification of gene sequences, but also significantly improves the expressive and generalization performance of the model, and MFPINC significantly outperforms the existing competing methods in ncRNA identification. In addition, it is worth mentioning that our tool can also be found on Github ( https://github.com/Zhenj-Nie/MFPINC ) the data and source code can also be downloaded for free.
非编码 RNA(ncRNAs)被认为是植物中营养稳态、发育和应激反应等重要生理过程调节的关键因素。常见的 ncRNA 预测方法容易受到实验条件和计算方法的显著影响,因此需要大量的时间和资源投入。因此,我们构建了一个 ncRNA 预测器(MFPINC),用于预测植物中的潜在 ncRNA,该预测器是基于我们之前研究中提出的 PINC 工具。具体来说,序列特征使用方差阈值和 F 检验方法进行了仔细的细化,而深度特征则通过应用 GRU 模型进行了提取和特征融合。对多个标准数据集的综合评估表明,MFPINC 不仅实现了对基因序列更全面、更准确的识别,而且显著提高了模型的表达和泛化性能,在 ncRNA 识别方面,MFPINC 显著优于现有的竞争方法。此外,值得一提的是,我们的工具也可以在 Github(https://github.com/Zhenj-Nie/MFPINC)上找到,数据和源代码也可以免费下载。