Andola Priyanka, Doble Mukesh
University of Hyderabad, Hyderabad-500046, India.
Department of Cariology, Saveetha Dental College and Hospitals, SIMATS, Chennai, 600077, India.
Curr Top Med Chem. 2025;25(2):209-227. doi: 10.2174/0115680266331755241008061915.
Epilepsy remains the most common and chronic disorder demanding longterm management. The impact of epilepsy disease is a cause of great concern and has resulted in efforts to develop treatment for epilepsy. It occurs due to an increase in neuronal excitability produced by changes affecting the voltage-dependent properties of Voltage-gated Sodium Channels (VGSCs).
Weka, a popular suite for machine learning techniques, was used on a dataset comprising 1781 chemical compounds, showing inhibition activity for sodium channel protein IX alpha subunit. After the analysis of the dataset obtained from ChEMBL, molecular fingerprints were computed for the molecules by the ChemDes server. Different classifiers available in the Weka software were explored to find out the algorithm that could be more suitable for the dataset or produce the highest accuracy for the classification of molecules as active or inactive.
In this work, a comprehensive comparison of different classifiers in the Weka suite for the prediction of active, inactive, and intermediate classes of molecules showing inhibition against human NaV1.7 protein was made. The prediction accuracy of these classifiers was assessed based on performance measures, including accuracy, Root Mean Squared Error (RMSE), Receiver Operating Characteristic (ROC), precision, Mathews Correlation Coefficient (MCC), recall, and Fmeasure. The comparison of results for model performance demonstrated that the OneR classifier performed best over others when validated using percentage split, cross-validation, and supplied test methods. J48 and Bagging also performed equally well in the prediction of different classes with an MCC value of 1, ROC area equal to 1, and RMSE close to 0.
Machine Learning (ML) tools provide a fast, reliable, and cost-effective approach required to identify or predict inhibitory molecules for the treatment of a disease. This study shows that the ML methods, particularly OneR, J48, and Bagging have the ability to identify active and inactive classes of compounds for the human NaV1.7 protein target. Such predictive models may provide a reliable and time-saving approach that can aid in the design of potential inhibitors for the treatment of epilepsy disease.
癫痫仍然是最常见的慢性疾病,需要长期管理。癫痫疾病的影响令人高度关注,促使人们努力研发癫痫治疗方法。癫痫的发生是由于影响电压门控钠通道(VGSCs)电压依赖性特性的变化导致神经元兴奋性增加所致。
Weka是一套流行的机器学习技术软件包,用于一个包含1781种化合物的数据集,这些化合物对钠通道蛋白IXα亚基具有抑制活性。在分析从ChEMBL获得的数据集后,通过ChemDes服务器为这些分子计算分子指纹。探索了Weka软件中可用的不同分类器,以找出更适合该数据集或对分子活性或非活性分类产生最高准确率的算法。
在这项工作中,对Weka软件包中不同分类器针对显示对人NaV1.7蛋白有抑制作用的分子的活性、非活性和中间类别预测进行了全面比较。基于包括准确率、均方根误差(RMSE)、受试者工作特征曲线(ROC)、精确率、马修斯相关系数(MCC)、召回率和F值等性能指标评估了这些分类器的预测准确率。模型性能结果的比较表明,当使用百分比分割、交叉验证和提供的测试方法进行验证时,OneR分类器的表现优于其他分类器。J48和Bagging在不同类别的预测中表现同样出色,MCC值为1,ROC面积等于1,RMSE接近0。
机器学习(ML)工具提供了一种快速、可靠且具有成本效益的方法,用于识别或预测治疗疾病的抑制性分子。本研究表明,ML方法,特别是OneR、J48和Bagging,有能力识别针对人NaV1.7蛋白靶点的活性和非活性化合物类别。这种预测模型可能提供一种可靠且节省时间的方法,有助于设计治疗癫痫疾病的潜在抑制剂。