Alhalaweh Amjad, Alzghoul Ahmad, Kaialy Waseem
Department of Health Science, Luleå University of Technology , Luleå , Sweden .
Drug Dev Ind Pharm. 2014 Jul;40(7):904-9. doi: 10.3109/03639045.2013.789906. Epub 2013 Apr 29.
Abstract Computational data mining is of interest in the pharmaceutical arena for the analysis of massive amounts of data and to assist in the management and utilization of the data. In this study, a data mining approach was used to predict the miscibility of a drug and several excipients, using Hansen solubility parameters (HSPs) as the data set. The K-means clustering algorithm was applied to predict the miscibility of indomethacin with a set of more than 30 compounds based on their partial solubility parameters [dispersion forces (δd), polar forces (δp) and hydrogen bonding (δh)]. The miscibility of the compounds was determined experimentally, using differential scanning calorimetry (DSC), in a separate study. The results of the K-means algorithm and DSC were compared to evaluate the K-means clustering prediction performance using the HSPs three-dimensional parameters, the two-dimensional parameters such as volume-dependent solubility (δv) and hydrogen bonding (δh) and selected single (one-dimensional) parameters. Using HSPs, the prediction of miscibility by the K-means algorithm correlated well with the DSC results, with an overall accuracy of 94%. The prediction accuracy was the same (94%) when the two-dimensional parameters or the hydrogen-bonding (one-dimensional) parameter were used. The hydrogen-bonding parameter was thus a determining factor in predicting miscibility in such set of compounds, whereas the dispersive and polar parameters had only a weak correlation. The results show that data mining approach is a valuable tool for predicting drug-excipient miscibility because it is easy to use, is time and cost-effective, and is material sparing.
摘要 计算数据挖掘在制药领域备受关注,可用于分析大量数据,并协助数据的管理和利用。在本研究中,采用数据挖掘方法,以 Hansen 溶解度参数(HSPs)作为数据集,预测药物与几种辅料的混溶性。应用 K 均值聚类算法,根据吲哚美辛与 30 多种化合物的部分溶解度参数[色散力(δd)、极性力(δp)和氢键(δh)]预测其混溶性。在另一项研究中,使用差示扫描量热法(DSC)通过实验确定化合物的混溶性。比较 K 均值算法和 DSC 的结果,以评估使用 HSPs 三维参数、二维参数(如体积依赖性溶解度(δv)和氢键(δh))以及选定的单一(一维)参数时 K 均值聚类的预测性能。使用 HSPs 时,K 均值算法对混溶性的预测与 DSC 结果相关性良好,总体准确率为 94%。使用二维参数或氢键(一维)参数时,预测准确率相同(94%)。因此,氢键参数是预测此类化合物混溶性的决定性因素,而色散和极性参数的相关性较弱。结果表明,数据挖掘方法是预测药物-辅料混溶性的有价值工具,因为它易于使用、具有时间和成本效益且节省材料。