School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China.
BMC Bioinformatics. 2023 Oct 30;24(1):410. doi: 10.1186/s12859-023-05536-1.
LncRNA-protein interactions are ubiquitous in organisms and play a crucial role in a variety of biological processes and complex diseases. Many computational methods have been reported for lncRNA-protein interaction prediction. However, the experimental techniques to detect lncRNA-protein interactions are laborious and time-consuming. Therefore, to address this challenge, this paper proposes a reweighting boosting feature selection (RBFS) method model to select key features. Specially, a reweighted apporach can adjust the contribution of each observational samples to learning model fitting; let higher weights are given more influence samples than those with lower weights. Feature selection with boosting can efficiently rank to iterate over important features to obtain the optimal feature subset. Besides, in the experiments, the RBFS method is applied to the prediction of lncRNA-protein interactions. The experimental results demonstrate that our method achieves higher accuracy and less redundancy with fewer features.
lncRNA-蛋白质相互作用在生物体中普遍存在,在多种生物过程和复杂疾病中发挥着关键作用。已经有许多计算方法被报道用于 lncRNA-蛋白质相互作用预测。然而,用于检测 lncRNA-蛋白质相互作用的实验技术既费力又耗时。因此,为了解决这一挑战,本文提出了一种重新加权的增强特征选择(RBFS)方法模型来选择关键特征。特别地,重新加权方法可以调整每个观测样本对学习模型拟合的贡献;让更高权重的样本比低权重的样本具有更大的影响力。通过增强的特征选择可以有效地对重要特征进行排序,从而获得最优的特征子集。此外,在实验中,该方法被应用于 lncRNA-蛋白质相互作用的预测。实验结果表明,我们的方法在使用较少特征的情况下,能够获得更高的准确性和更少的冗余。