• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用无监督学习对已批准药物进行分子描述符分析,以实现药物再利用。

Molecular descriptor analysis of approved drugs using unsupervised learning for drug repurposing.

机构信息

Centre for Molecular Modeling, CSIR-Indian Institute of Chemical Technology, Hyderabad, 500007, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India.

Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India; Advanced Computation and Data Sciences Division, CSIR - North East Institute of Science and Technology, Jorhat, Assam, 785 006, India.

出版信息

Comput Biol Med. 2021 Nov;138:104856. doi: 10.1016/j.compbiomed.2021.104856. Epub 2021 Sep 10.

DOI:10.1016/j.compbiomed.2021.104856
PMID:34555571
Abstract

Machine learning and data-driven approaches are currently being widely used in drug discovery and development due to their potential advantages in decision-making based on the data leveraged from existing sources. Applying these approaches to drug repurposing (DR) studies can identify new relationships between drug molecules, therapeutic targets and diseases that will eventually help in generating new insights for developing novel therapeutics. In the current study, a dataset of 1671 approved drugs is analyzed using a combined approach involving unsupervised Machine Learning (ML) techniques (Principal Component Analysis (PCA) followed by k-means clustering) and Structure-Activity Relationships (SAR) predictions for DR. PCA is applied on all the two dimensional (2D) molecular descriptors of the dataset and the first five Principal Components (PC) were subsequently used to cluster the drugs into nine well separated clusters using k-means algorithm. We further predicted the biological activities for the drug-dataset using the PASS (Predicted Activities Spectra of Substances) tool. These predicted activity values are analyzed systematically to identify repurposable drugs for various diseases. Clustering patterns obtained from k-means showed that every cluster contains subgroups of structurally similar drugs that may or may not have similar therapeutic indications. We hypothesized that such structurally similar but therapeutically different drugs can be repurposed for the native indications of other drugs of the same cluster based on their high predicted biological activities obtained from PASS analysis. In line with this, we identified 66 drugs from the nine clusters which are structurally similar but have different therapeutic uses and can therefore be repurposed for one or more native indications of other drugs of the same cluster. Some of these drugs not only share a common substructure but also bind to the same target and may have a similar mechanism of action, further supporting our hypothesis. Furthermore, based on the analysis of predicted biological activities, we identified 1423 drugs that can be repurposed for 366 new indications against several diseases. In this study, an integrated approach of unsupervised ML and SAR analysis have been used to identify new indications for approved drugs and the study provides novel insights into clustering patterns generated through descriptor level analysis of approved drugs.

摘要

机器学习和数据驱动方法由于其在基于现有数据源进行决策方面的潜在优势,目前在药物发现和开发中得到了广泛应用。将这些方法应用于药物重定位 (DR) 研究,可以在药物分子、治疗靶点和疾病之间发现新的关系,最终有助于为开发新的治疗方法提供新的见解。在本研究中,使用一种综合方法分析了包含 1671 种已批准药物的数据集,该方法涉及无监督机器学习 (ML) 技术(主成分分析 (PCA) 后接 k-均值聚类)和结构-活性关系 (SAR) 预测用于 DR。PCA 应用于数据集的所有二维 (2D) 分子描述符,随后使用前五个主成分 (PC) 使用 k-均值算法将药物聚类为九个分离良好的簇。我们进一步使用 PASS(预测物质的活动谱)工具预测药物-数据集的生物活性。系统地分析这些预测的活性值,以确定用于各种疾病的可重定位药物。k-均值聚类模式表明,每个簇都包含结构相似药物的亚组,这些药物可能具有相似的治疗适应症,也可能没有。我们假设,基于 PASS 分析获得的高预测生物活性,可以将具有相似结构但治疗作用不同的药物重新用于同一簇中其他药物的天然适应症。根据这一假设,我们从九个簇中确定了 66 种药物,这些药物结构相似但具有不同的治疗用途,因此可以重新用于同一簇中其他药物的一种或多种天然适应症。其中一些药物不仅共享一个共同的亚结构,而且还与同一靶点结合,可能具有相似的作用机制,进一步支持我们的假设。此外,基于预测生物活性的分析,我们确定了 1423 种药物可重新用于针对 366 种新适应症的多种疾病。在这项研究中,使用了无监督 ML 和 SAR 分析的综合方法来确定已批准药物的新适应症,该研究为通过已批准药物的描述符级分析生成的聚类模式提供了新的见解。

相似文献

1
Molecular descriptor analysis of approved drugs using unsupervised learning for drug repurposing.使用无监督学习对已批准药物进行分子描述符分析,以实现药物再利用。
Comput Biol Med. 2021 Nov;138:104856. doi: 10.1016/j.compbiomed.2021.104856. Epub 2021 Sep 10.
2
Exploration of the mechanism of traditional Chinese medicine by AI approach using unsupervised machine learning for cellular functional similarity of compounds in heterogeneous networks, XiaoErFuPi granules as an example.基于无监督机器学习的 AI 方法探索中药作用机制——以小二扶脾颗粒为例,研究化合物在异质网络中细胞功能相似性。
Pharmacol Res. 2020 Oct;160:105077. doi: 10.1016/j.phrs.2020.105077. Epub 2020 Jul 17.
3
A New Approach to Drug Repurposing with Two-Stage Prediction, Machine Learning, and Unsupervised Clustering of Gene Expression.一种新的药物重定位方法,采用两阶段预测、机器学习和基因表达无监督聚类。
OMICS. 2022 Jun;26(6):339-347. doi: 10.1089/omi.2022.0026. Epub 2022 Jun 3.
4
A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration.一种通过异构数据集成进行药物重定位的两层无监督聚类方法。
BMC Bioinformatics. 2018 Apr 11;19(1):129. doi: 10.1186/s12859-018-2123-4.
5
Sheep's coping style can be identified by unsupervised machine learning from unlabeled data.通过对无标签数据进行无监督机器学习,可以识别出绵羊的应对方式。
Behav Processes. 2022 Jan;194:104559. doi: 10.1016/j.beproc.2021.104559. Epub 2021 Nov 25.
6
Predicting associations among drugs, targets and diseases by tensor decomposition for drug repositioning.通过张量分解预测药物、靶点和疾病之间的关联,实现药物重定位。
BMC Bioinformatics. 2019 Dec 16;20(Suppl 26):628. doi: 10.1186/s12859-019-3283-6.
7
A new computational drug repurposing method using established disease-drug pair knowledge.一种利用已建立的疾病-药物对知识的新型计算药物再利用方法。
Bioinformatics. 2019 Oct 1;35(19):3672-3678. doi: 10.1093/bioinformatics/btz156.
8
Drug repositioning of herbal compounds via a machine-learning approach.基于机器学习的中草药化合物的药物再定位。
BMC Bioinformatics. 2019 May 29;20(Suppl 10):247. doi: 10.1186/s12859-019-2811-8.
9
Exploration of critical care data by using unsupervised machine learning.使用无监督机器学习探索重症监护数据。
Comput Methods Programs Biomed. 2020 Oct;194:105507. doi: 10.1016/j.cmpb.2020.105507. Epub 2020 Apr 28.
10
How good are publicly available web services that predict bioactivity profiles for drug repurposing?现有的用于药物重定位的生物活性预测的公共网络服务有多好?
SAR QSAR Environ Res. 2017 Oct;28(10):843-862. doi: 10.1080/1062936X.2017.1399448.

引用本文的文献

1
Multidimensional in silico evaluation of fluorine-18 radiopharmaceuticals: integrating pharmacokinetics, ADMET, and clustering for diagnostic stratification.氟-18放射性药物的多维计算机模拟评估:整合药代动力学、药物代谢及毒性预测和聚类分析用于诊断分层
J Comput Aided Mol Des. 2025 Sep 3;39(1):75. doi: 10.1007/s10822-025-00655-8.
2
Hydrogen-centric machine learning approach for analyzing properties of tricyclic anti-depressant drugs.用于分析三环类抗抑郁药物性质的以氢为中心的机器学习方法。
Front Chem. 2025 Jun 3;13:1603948. doi: 10.3389/fchem.2025.1603948. eCollection 2025.
3
Cancer Drug Sensitivity Prediction Based on Deep Transfer Learning.
基于深度迁移学习的癌症药物敏感性预测
Int J Mol Sci. 2025 Mar 10;26(6):2468. doi: 10.3390/ijms26062468.
4
approaches supporting drug repurposing for Leishmaniasis: a scoping review.支持利什曼病药物再利用的方法:一项范围综述。
EXCLI J. 2024 Sep 3;23:1117-1169. doi: 10.17179/excli2024-7552. eCollection 2024.
5
Leveraging Artificial Intelligence for Synergies in Drug Discovery: From Computers to Clinics.利用人工智能实现药物发现协同增效:从计算机到临床。
Curr Pharm Des. 2024;30(28):2187-2205. doi: 10.2174/0113816128308066240529121148.
6
Molecular Property Diagnostic Suite Compound Library (MPDS-CL): a structure-based classification of the chemical space.分子性质诊断套件化合物库(MPDS-CL):化学空间的基于结构的分类
Mol Divers. 2024 Oct;28(5):3243-3259. doi: 10.1007/s11030-023-10752-1. Epub 2023 Oct 30.
7
Revolutionizing Medicinal Chemistry: The Application of Artificial Intelligence (AI) in Early Drug Discovery.变革药物化学:人工智能在早期药物发现中的应用。
Pharmaceuticals (Basel). 2023 Sep 6;16(9):1259. doi: 10.3390/ph16091259.
8
Reconstructing the cytokine view for the multi-view prediction of COVID-19 mortality.重建细胞因子视角,用于 COVID-19 死亡率的多视角预测。
BMC Infect Dis. 2023 Sep 21;23(1):622. doi: 10.1186/s12879-023-08291-z.
9
The Chemical Space of Terpenes: Insights from Data Science and AI.萜类化合物的化学空间:数据科学与人工智能的见解
Pharmaceuticals (Basel). 2023 Jan 29;16(2):202. doi: 10.3390/ph16020202.
10
Towards systematic exploration of chemical space: building the fragment library module in molecular property diagnostic suite.迈向化学空间的系统探索:构建分子性质诊断套件中的片段库模块。
Mol Divers. 2023 Jun;27(3):1459-1468. doi: 10.1007/s11030-022-10506-5. Epub 2022 Aug 4.