基于机器学习的化学结合相似性，利用靶基因的进化关系。

Machine learning-based chemical binding similarity using evolutionary relationships of target genes.

机构信息

Natural Product Informatics Research Center, KIST Gangneung Institute of Natural Products, Gangneung 25451, Republic of Korea.

Department of Bioinformatics and Life Science, Soongsil University, Seoul 06978, Republic of Korea.

出版信息

Nucleic Acids Res. 2019 Nov 18;47(20):e128. doi: 10.1093/nar/gkz743.

DOI:10.1093/nar/gkz743

PMID:31504818

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6846180/

Abstract

Chemical similarity searching is a basic research tool that can be used to find small molecules which are similar in shape to known active molecules. Despite its popularity, the retrieval of local molecular features that are critical to functional activity related to target binding often fails. To overcome this limitation, we developed a novel machine learning-based chemical binding similarity score by using various evolutionary relationships of binding targets. The chemical similarity was defined by the probability of chemical compounds binding to identical targets. Comprehensive and heterogeneous multiple target-binding chemical data were integrated into a paired data format and processed using multiple classification similarity-learning models with various levels of target evolutionary information. Encoding evolutionary information to chemical compounds through their binding targets substantially expanded available chemical-target interaction data and significantly improved model performance. The output probability of our integrated model, referred to as ensemble evolutionary chemical binding similarity (ensECBS), was effective for finding hidden chemical relationships. The developed method can serve as a novel chemical similarity tool that uses evolutionarily conserved target binding information.

摘要

化学相似性搜索是一种基本的研究工具，可用于寻找与已知活性分子形状相似的小分子。尽管它很受欢迎，但经常无法检索到与靶标结合相关的功能活性关键的局部分子特征。为了克服这一限制，我们利用结合靶标的各种进化关系，开发了一种基于新型机器学习的化学结合相似性评分。化学相似性通过化合物与相同靶标结合的概率来定义。综合且异构的多种靶标结合化学数据被整合到配对数据格式中，并使用具有不同靶标进化信息水平的多种分类相似性学习模型进行处理。通过结合靶标对化合物进行编码进化信息，极大地扩展了可用的化学-靶标相互作用数据，并显著提高了模型性能。我们的集成模型的输出概率，称为集成进化化学结合相似性（ensECBS），对于发现隐藏的化学关系非常有效。所开发的方法可以作为一种新的化学相似性工具，利用进化上保守的靶标结合信息。

相似文献

Machine learning-based chemical binding similarity using evolutionary relationships of target genes.

Nucleic Acids Res. 2019 Nov 18;47(20):e128. doi: 10.1093/nar/gkz743.

Evolutionary chemical binding similarity approach integrated with 3D-QSAR method for effective virtual screening.

BMC Bioinformatics. 2020 Jul 14;21(1):309. doi: 10.1186/s12859-020-03643-x.

Redefining the Protein Kinase Conformational Space with Machine Learning.

Cell Chem Biol. 2018 Jul 19;25(7):916-924.e2. doi: 10.1016/j.chembiol.2018.05.002. Epub 2018 May 31.

Data structures for computational compound promiscuity analysis and exemplary applications to inhibitors of the human kinome.

J Comput Aided Mol Des. 2020 Jan;34(1):1-10. doi: 10.1007/s10822-019-00266-0. Epub 2019 Dec 2.

Kinome-Wide Profiling Prediction of Small Molecules.

ChemMedChem. 2018 Mar 20;13(6):495-499. doi: 10.1002/cmdc.201700180. Epub 2017 Jun 26.

A multimodal Transformer Network for protein-small molecule interactions enhances predictions of kinase inhibition and enzyme-substrate relationships.

PLoS Comput Biol. 2024 May 20;20(5):e1012100. doi: 10.1371/journal.pcbi.1012100. eCollection 2024 May.

Quantitative proteomics of kinase inhibitor targets and mechanisms.

ACS Chem Biol. 2015 Jan 16;10(1):201-12. doi: 10.1021/cb5008794. Epub 2014 Dec 17.

Prospects for pharmacological targeting of pseudokinases.

Nat Rev Drug Discov. 2019 Jul;18(7):501-526. doi: 10.1038/s41573-019-0018-3.

Assessing protein kinase target similarity: Comparing sequence, structure, and cheminformatics approaches.

Biochim Biophys Acta. 2015 Oct;1854(10 Pt B):1605-16. doi: 10.1016/j.bbapap.2015.05.004. Epub 2015 May 19.

Inhibitors paradoxically prime kinases.

Nat Chem Biol. 2009 Jul;5(7):448-9. doi: 10.1038/nchembio.f.11.

引用本文的文献

Developing a quantitative structure-property relationships (QSPR) model using Caco-2 cell bioavailability indicators (BA) to predict the BA of phytochemicals.

J Sci Food Agric. 2025 Sep;105(12):6850-6861. doi: 10.1002/jsfa.14400. Epub 2025 May 30.

Novel target identification towards drug repurposing based on biological activity profiles.

PLoS One. 2025 May 6;20(5):e0319865. doi: 10.1371/journal.pone.0319865. eCollection 2025.

New Small-Molecule SERCA Inhibitors Enhance Treatment Efficacy in Lenvatinib-Resistant Papillary Thyroid Cancer.

Int J Mol Sci. 2024 Oct 3;25(19):10646. doi: 10.3390/ijms251910646.

Iterative machine learning-based chemical similarity search to identify novel chemical inhibitors.

J Cheminform. 2023 Sep 23;15(1):86. doi: 10.1186/s13321-023-00760-6.

Ecdysteroids from the Korean Endemic Species with Activities against Glucocorticoid Receptors and 11β-Hydroxysteroid Dehydrogenase Type 1.

ACS Omega. 2023 Jul 13;8(29):26191-26200. doi: 10.1021/acsomega.3c02421. eCollection 2023 Jul 25.

Ginsenoside Rd ameliorates muscle wasting by suppressing the signal transducer and activator of transcription 3 pathway.

J Cachexia Sarcopenia Muscle. 2022 Dec;13(6):3149-3162. doi: 10.1002/jcsm.13084. Epub 2022 Sep 20.

Drug Discovery Using Evolutionary Similarities in Chemical Binding to Inhibit Patient-Derived Hepatocellular Carcinoma.

Int J Mol Sci. 2022 Jul 19;23(14):7971. doi: 10.3390/ijms23147971.

Bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs.

PLoS Comput Biol. 2022 Apr 25;18(4):e1010029. doi: 10.1371/journal.pcbi.1010029. eCollection 2022 Apr.

Identification of Tyrosinase Inhibitors and Their Structure-Activity Relationships via Evolutionary Chemical Binding Similarity and Structure-Based Methods.

Molecules. 2021 Jan 22;26(3):566. doi: 10.3390/molecules26030566.

Artificial Intelligence in Drug Discovery: A Comprehensive Review of Data-driven and Machine Learning Approaches.

Biotechnol Bioprocess Eng. 2020;25(6):895-930. doi: 10.1007/s12257-020-0049-y. Epub 2021 Jan 7.

本文引用的文献

Chemical Control of Mammalian Circadian Behavior through Dual Inhibition of Casein Kinase Iα and δ.

J Med Chem. 2019 Feb 28;62(4):1989-1998. doi: 10.1021/acs.jmedchem.8b01541. Epub 2019 Feb 15.

DrugBank 5.0: a major update to the DrugBank database for 2018.

Nucleic Acids Res. 2018 Jan 4;46(D1):D1074-D1082. doi: 10.1093/nar/gkx1037.

Gene3D: Extensive prediction of globular domains in proteins.

Nucleic Acids Res. 2018 Jan 4;46(D1):D435-D439. doi: 10.1093/nar/gkx1069.

20 years of the SMART protein domain annotation resource.

Nucleic Acids Res. 2018 Jan 4;46(D1):D493-D496. doi: 10.1093/nar/gkx922.

From machine learning to deep learning: progress in machine intelligence for rational drug discovery.

Drug Discov Today. 2017 Nov;22(11):1680-1685. doi: 10.1016/j.drudis.2017.08.010. Epub 2017 Sep 4.

Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery.

Sci Rep. 2017 May 25;7(1):2403. doi: 10.1038/s41598-017-02114-3.

Selecting high-quality negative samples for effectively predicting protein-RNA interactions.

BMC Syst Biol. 2017 Mar 14;11(Suppl 2):9. doi: 10.1186/s12918-017-0390-8.

Development of Potent, Selective SRPK1 Inhibitors as Potential Topical Therapeutics for Neovascular Eye Disease.

ACS Chem Biol. 2017 Mar 17;12(3):825-832. doi: 10.1021/acschembio.6b01048. Epub 2017 Feb 6.

InterPro in 2017-beyond protein family and domain annotations.

Nucleic Acids Res. 2017 Jan 4;45(D1):D190-D199. doi: 10.1093/nar/gkw1107. Epub 2016 Nov 29.

UniProt: the universal protein knowledgebase.

Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于机器学习的化学结合相似性，利用靶基因的进化关系。

Machine learning-based chemical binding similarity using evolutionary relationships of target genes.

机构信息

Natural Product Informatics Research Center, KIST Gangneung Institute of Natural Products, Gangneung 25451, Republic of Korea.

Department of Bioinformatics and Life Science, Soongsil University, Seoul 06978, Republic of Korea.

出版信息

Nucleic Acids Res. 2019 Nov 18;47(20):e128. doi: 10.1093/nar/gkz743.

DOI:10.1093/nar/gkz743

PMID:31504818

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6846180/

Abstract

摘要

基于机器学习的化学结合相似性，利用靶基因的进化关系。

Machine learning-based chemical binding similarity using evolutionary relationships of target genes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于机器学习的化学结合相似性，利用靶基因的进化关系。

Machine learning-based chemical binding similarity using evolutionary relationships of target genes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献