Suppr超能文献

当前基于口袋的3D生成模型有多好?:基于蛋白质口袋的3D分子生成模型的基准集与评估

How Good are Current Pocket-Based 3D Generative Models?: The Benchmark Set and Evaluation of Protein Pocket-Based 3D Molecular Generative Models.

作者信息

Liu Haoyang, Qin Yifei, Niu Zhangming, Xu Mingyuan, Wu Jiaqiang, Xiao Xianglu, Lei Jinping, Ran Ting, Chen Hongming

机构信息

State Key Laboratory of Medicinal Chemical Biology and College of Life Sciences, Nankai University, 94 Weijin Road, Tianjin 300071, China.

Division of Drug and Vaccine Research, Guangzhou National Laboratory, Guangzhou 510005, Guangdong, China.

出版信息

J Chem Inf Model. 2024 Dec 23;64(24):9260-9275. doi: 10.1021/acs.jcim.4c01598. Epub 2024 Dec 4.

Abstract

The development of a three-dimensional (3D) molecular generative model based on protein pockets has recently attracted a lot of attention. This type of model aims to achieve the simultaneous generation of molecular graphs and 3D binding conformation under the constraint of protein binding. Various pocket-based generative models have been proposed; however, currently, there is a lack of systematic and objective evaluation metrics for these models. To address this issue, a comprehensive benchmark data set, named POKMOL-3D, is proposed to evaluate protein pocket-based 3D molecular generative models. It includes 32 protein targets together with their known active compounds as a test set to evaluate the versatility of generation models to mimic the real-world scenario. Additionally, a series of two-dimensional (2D) and 3D evaluation metrics with some newly created ones was integrated to assess the quality of generated molecular structures and their binding conformations. It is expected that this work can enhance our comprehension of the effectiveness and weakness of current 3D generative models and stimulate the discussion on challenges and useful guidance for developing the next wave of molecular generative models.

摘要

基于蛋白质口袋的三维(3D)分子生成模型的发展近来备受关注。这类模型旨在在蛋白质结合的约束下实现分子图和3D结合构象的同时生成。已经提出了各种基于口袋的生成模型;然而,目前对于这些模型缺乏系统且客观的评估指标。为解决这一问题,提出了一个名为POKMOL - 3D的综合基准数据集,用于评估基于蛋白质口袋的3D分子生成模型。它包括32个蛋白质靶点及其已知的活性化合物作为测试集,以评估生成模型模拟真实场景的通用性。此外,整合了一系列二维(2D)和3D评估指标以及一些新创建的指标,以评估生成的分子结构及其结合构象的质量。期望这项工作能够增进我们对当前3D生成模型的有效性和弱点的理解,并激发关于开发下一代分子生成模型的挑战和有用指导的讨论。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验