Laboratoire de Chémoinformatique UMR 7140 CNRS, Institut Le Bel, University of Strasbourg, 4 Rue Blaise Pascal, 67081 Strasbourg, France.
Institut de Pharmacologie et de Biologie Structurale, Université de Toulouse CNRS, UPS, 205 Route de Narbonne, 31077 Toulouse, France.
Molecules. 2021 Jun 28;26(13):3950. doi: 10.3390/molecules26133950.
In this paper, we report comprehensive experimental and chemoinformatics analyses of the solubility of small organic molecules ("fragments") in dimethyl sulfoxide (DMSO) in the context of their ability to be tested in screening experiments. Here, DMSO solubility of 939 fragments has been measured experimentally using an NMR technique. A Support Vector Classification model was built on the obtained data using the ISIDA fragment descriptors. The analysis revealed 34 outliers: experimental issues were retrospectively identified for 28 of them. The updated model performs well in 5-fold cross-validation (balanced accuracy = 0.78). The datasets are available on the Zenodo platform (DOI:10.5281/zenodo.4767511) and the model is available on the website of the Laboratory of Chemoinformatics.
在本文中,我们报告了对小有机分子(“片段”)在二甲亚砜(DMSO)中的溶解度的综合实验和化学信息学分析,以评估它们在筛选实验中的可测试性。在这里,我们使用 NMR 技术实验测量了 939 个片段在 DMSO 中的溶解度。我们使用 ISIDA 片段描述符在获得的数据上构建了一个支持向量分类模型。分析揭示了 34 个异常值:其中 28 个异常值的实验问题已经被追溯识别。更新后的模型在 5 倍交叉验证中表现良好(平衡准确性 = 0.78)。数据集可在 Zenodo 平台上获得(DOI:10.5281/zenodo.4767511),模型可在化学信息学实验室的网站上获得。