Birolo Rebecca, Özçelik Rıza, Aramini Andrea, Gobetto Roberto, Chierotti Michele R, Grisoni Francesca
Institute for Complex Molecular Systems, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands.
Department of Chemistry and NIS Centre, University of Torino, Torino, Italy.
Angew Chem Int Ed Engl. 2025 Jul;64(29):e202507835. doi: 10.1002/anie.202507835. Epub 2025 May 30.
Approximately 40% of marketed drugs exhibit suboptimal pharmacokinetic profiles. Co-crystallization, where pairs of molecules form a multicomponent crystal, constitutes a promising strategy to enhance physicochemical properties without compromising the pharmacological activity. However, finding promising co-crystal pairs is resource-intensive due to the large and diverse range of possible molecular combinations. We present DeepCocrystal, a novel deep learning approach designed to predict co-crystal formation by processing the "chemical language" from a supramolecular vantage point. Rigorous validation of DeepCocrystal showed a balanced accuracy of 78% in realistic scenarios, outperforming existing models. Explainable AI approaches uncovered the decision-making process of DeepCocrystal, showing its capability to learn chemically relevant aspects of the "supramolecular language" that match experimental co-crystallization patterns. By leveraging properties of molecular string representations, DeepCocrystal can also estimate the uncertainty of its predictions. We harnessed this capability in a challenging prospective study and successfully discovered two novel co-crystals of diflunisal, an anti-inflammatory drug. This study underscores the potential of deep learning-and in particular of chemical language processing-to accelerate co-crystallization and ultimately drug development, in both academic and industrial contexts. DeepCocrystal is available as an easy-to-use web application at https://deepcocrystal.streamlit.app/.
大约40%的已上市药物表现出不理想的药代动力学特征。共结晶是指分子对形成多组分晶体,是一种在不影响药理活性的情况下增强物理化学性质的有前景的策略。然而,由于可能的分子组合范围广泛且多样,寻找有前景的共结晶对需要大量资源。我们提出了DeepCocrystal,这是一种新颖的深度学习方法,旨在从超分子角度处理“化学语言”来预测共结晶的形成。对DeepCocrystal的严格验证表明,在实际场景中其平衡准确率达到78%,优于现有模型。可解释人工智能方法揭示了DeepCocrystal的决策过程,显示出它能够学习与实验共结晶模式相匹配的“超分子语言”的化学相关方面。通过利用分子字符串表示的特性,DeepCocrystal还可以估计其预测的不确定性。我们在一项具有挑战性的前瞻性研究中利用了这一能力,成功发现了抗炎药物双氯芬酸的两种新型共晶体。这项研究强调了深度学习——尤其是化学语言处理——在加速共结晶以及最终在学术和工业环境中的药物开发方面的潜力。DeepCocrystal可作为一个易于使用的网络应用程序在https://deepcocrystal.streamlit.app/上获取。