Huiwen Jiang, Kai Song
School of Mathematics and Statistics, Qingdao University, Qingdao, Shandong, China.
Microrna. 2024;13(2):155-165. doi: 10.2174/0122115366288068240322064431.
Long non-coding RNA (lncRNA) plays a crucial role in various biological processes, and mutations or imbalances of lncRNAs can lead to several diseases, including cancer, Prader-Willi syndrome, autism, Alzheimer's disease, cartilage-hair hypoplasia, and hearing loss. Understanding lncRNA-protein interactions (LPIs) is vital for elucidating basic cellular processes, human diseases, viral replication, transcription, and plant pathogen resistance. Despite the development of several LPI calculation methods, predicting LPI remains challenging, with the selection of variables and deep learning structure being the focus of LPI research.
We propose a deep learning framework called AR-LPI, which extracts sequence and secondary structure features of proteins and lncRNAs. The framework utilizes an auto-encoder for feature extraction and employs SE-ResNet for prediction. Additionally, we apply transfer learning to the deep neural network SE-ResNet for predicting small-sample datasets.
Through comprehensive experimental comparison, we demonstrate that the AR-LPI architecture performs better in LPI prediction. Specifically, the accuracy of AR-LPI increases by 2.86% to 94.52%, while the F-value of AR-LPI increases by 2.71% to 94.73%.
Our experimental results show that the overall performance of AR-LPI is better than that of other LPI prediction tools.
长链非编码RNA(lncRNA)在各种生物过程中起着至关重要的作用,lncRNA的突变或失衡会导致多种疾病,包括癌症、普拉德-威利综合征、自闭症、阿尔茨海默病、软骨毛发发育不全和听力损失。了解lncRNA-蛋白质相互作用(LPI)对于阐明基本细胞过程、人类疾病、病毒复制、转录和植物病原体抗性至关重要。尽管已经开发了几种LPI计算方法,但预测LPI仍然具有挑战性,变量的选择和深度学习结构是LPI研究的重点。
我们提出了一个名为AR-LPI的深度学习框架,该框架提取蛋白质和lncRNA的序列和二级结构特征。该框架利用自动编码器进行特征提取,并采用SE-ResNet进行预测。此外,我们将迁移学习应用于深度神经网络SE-ResNet以预测小样本数据集。
通过全面的实验比较,我们证明AR-LPI架构在LPI预测中表现更好。具体而言,AR-LPI的准确率提高了2.86%,达到94.52%,而AR-LPI的F值提高了2.71%,达到94.73%。
我们的实验结果表明,AR-LPI的整体性能优于其他LPI预测工具。