Suppr超能文献

通过清洗正、负化学相关性预测配体的生物活性。

Ligand biological activity predicted by cleaning positive and negative chemical correlations.

机构信息

Cavendish Laboratory, University of Cambridge, Cambridge CB3 0HE, United Kingdom;

Medicine Design, Pfizer Inc., Cambridge, MA 02139.

出版信息

Proc Natl Acad Sci U S A. 2019 Feb 26;116(9):3373-3378. doi: 10.1073/pnas.1810847116. Epub 2019 Feb 11.

Abstract

Predicting ligand biological activity is a key challenge in drug discovery. Ligand-based statistical approaches are often hampered by noise due to undersampling: The number of molecules known to be active or inactive is vastly less than the number of possible chemical features that might determine binding. We derive a statistical framework inspired by random matrix theory and combine the framework with high-quality negative data to discover important chemical differences between active and inactive molecules by disentangling undersampling noise. Our model outperforms standard benchmarks when tested against a set of challenging retrospective tests. We prospectively apply our model to the human muscarinic acetylcholine receptor M1, finding four experimentally confirmed agonists that are chemically dissimilar to all known ligands. The hit rate of our model is significantly higher than the state of the art. Our model can be interpreted and visualized to offer chemical insights about the molecular motifs that are synergistic or antagonistic to M1 agonism, which we have prospectively experimentally verified.

摘要

预测配体的生物活性是药物发现中的一个关键挑战。基于配体的统计方法通常受到采样不足的噪声的阻碍:已知具有活性或无活性的分子数量远远少于可能决定结合的可能化学特征的数量。我们从随机矩阵理论中得到了一个统计框架,并将该框架与高质量的负数据相结合,通过分离采样不足的噪声来发现活性和非活性分子之间的重要化学差异。我们的模型在一系列具有挑战性的回顾性测试中表现优于标准基准。我们前瞻性地将我们的模型应用于人类毒蕈碱乙酰胆碱受体 M1,发现了四种实验证实的与所有已知配体在化学上不同的激动剂。我们模型的命中率明显高于现有技术水平。我们的模型可以进行解释和可视化,提供关于与 M1 激动作用协同或拮抗的分子基序的化学见解,我们已经前瞻性地进行了实验验证。

相似文献

6
Homology modeling of human muscarinic acetylcholine receptors.人毒蕈碱型乙酰胆碱受体的同源建模。
J Chem Inf Model. 2014 Jan 27;54(1):243-53. doi: 10.1021/ci400502u. Epub 2013 Dec 23.
8
Predicting protein-ligand affinity with a random matrix framework.使用随机矩阵框架预测蛋白质-配体亲和力。
Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):13564-13569. doi: 10.1073/pnas.1611138113. Epub 2016 Nov 16.

引用本文的文献

4
Exploring chemical compound space with quantum-based machine learning.利用基于量子的机器学习探索化合物空间。
Nat Rev Chem. 2020 Jul;4(7):347-358. doi: 10.1038/s41570-020-0189-9. Epub 2020 Jun 12.
6
Ab Initio Machine Learning in Chemical Compound Space.从头开始的化合物空间中的机器学习。
Chem Rev. 2021 Aug 25;121(16):10001-10036. doi: 10.1021/acs.chemrev.0c01303. Epub 2021 Aug 13.
10
Analyzing Learned Molecular Representations for Property Prediction.分析用于性质预测的学习分子表示。
J Chem Inf Model. 2019 Aug 26;59(8):3370-3388. doi: 10.1021/acs.jcim.9b00237. Epub 2019 Aug 13.

本文引用的文献

1
MoleculeNet: a benchmark for molecular machine learning.分子网络:分子机器学习的一个基准
Chem Sci. 2017 Oct 31;9(2):513-530. doi: 10.1039/c7sc02664a. eCollection 2018 Jan 14.
3
Optimal Design of Experiments by Combining Coarse and Fine Measurements.通过结合粗测量和精细测量实现实验的优化设计
Phys Rev Lett. 2017 Nov 17;119(20):208101. doi: 10.1103/PhysRevLett.119.208101. Epub 2017 Nov 16.
5
The ChEMBL database in 2017.2017年的ChEMBL数据库。
Nucleic Acids Res. 2017 Jan 4;45(D1):D945-D954. doi: 10.1093/nar/gkw1074. Epub 2016 Nov 28.
6
Predicting protein-ligand affinity with a random matrix framework.使用随机矩阵框架预测蛋白质-配体亲和力。
Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):13564-13569. doi: 10.1073/pnas.1611138113. Epub 2016 Nov 16.
8
Molecular graph convolutions: moving beyond fingerprints.分子图卷积:超越指纹图谱
J Comput Aided Mol Des. 2016 Aug;30(8):595-608. doi: 10.1007/s10822-016-9938-8. Epub 2016 Aug 24.
9
Practical Aspects of Free-Energy Calculations: A Review.自由能计算的实际应用:综述
J Chem Theory Comput. 2014 Jul 8;10(7):2632-47. doi: 10.1021/ct500161f. Epub 2014 Jun 6.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验