• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过支持向量机从大型化合物库中鉴定小分子聚集物。

Identification of small molecule aggregators from large compound libraries by support vector machines.

机构信息

College of Chemistry, Sichuan University, Chengdu 610064, People's Republic of China.

出版信息

J Comput Chem. 2010 Mar;31(4):752-63. doi: 10.1002/jcc.21347.

DOI:10.1002/jcc.21347
PMID:19569201
Abstract

Small molecule aggregators non-specifically inhibit multiple unrelated proteins, rendering them therapeutically useless. They frequently appear as false hits and thus need to be eliminated in high-throughput screening campaigns. Computational methods have been explored for identifying aggregators, which have not been tested in screening large compound libraries. We used 1319 aggregators and 128,325 non-aggregators to develop a support vector machines (SVM) aggregator identification model, which was tested by four methods. The first is five fold cross-validation, which showed comparable aggregator and significantly improved non-aggregator identification rates against earlier studies. The second is the independent test of 17 aggregators discovered independently from the training aggregators, 71% of which were correctly identified. The third is retrospective screening of 13M PUBCHEM and 168K MDDR compounds, which predicted 97.9% and 98.7% of the PUBCHEM and MDDR compounds as non-aggregators. The fourth is retrospective screening of 5527 MDDR compounds similar to the known aggregators, 1.14% of which were predicted as aggregators. SVM showed slightly better overall performance against two other machine learning methods based on five fold cross-validation studies of the same settings. Molecular features of aggregation, extracted by a feature selection method, are consistent with published profiles. SVM showed substantial capability in identifying aggregators from large libraries at low false-hit rates.

摘要

小分子聚集物非特异性地抑制多种不相关的蛋白质,使其在治疗上变得无用。它们经常作为假阳性出现,因此需要在高通量筛选中消除。已经探索了计算方法来识别聚集物,但这些方法尚未在筛选大型化合物库中进行测试。我们使用了 1319 种聚集物和 128325 种非聚集物来开发支持向量机(SVM)聚集物识别模型,该模型通过四种方法进行了测试。第一种是五重交叉验证,与早期研究相比,该方法显示出可比的聚集物和显著提高的非聚集物识别率。第二种是对从训练聚集物中独立发现的 17 种聚集物的独立测试,其中 71%被正确识别。第三种是对 13M PUBCHEM 和 168K MDDR 化合物的回溯筛选,预测了 97.9%和 98.7%的 PUBCHEM 和 MDDR 化合物是非聚集物。第四种是对与已知聚集物相似的 5527 种 MDDR 化合物的回溯筛选,其中 1.14%被预测为聚集物。SVM 在五重交叉验证研究中对两种其他机器学习方法的整体性能略好,这些研究具有相同的设置。通过特征选择方法提取的聚集分子特征与已发表的特征一致。SVM 在以低假阳性率从大型文库中识别聚集物方面具有很强的能力。

相似文献

1
Identification of small molecule aggregators from large compound libraries by support vector machines.通过支持向量机从大型化合物库中鉴定小分子聚集物。
J Comput Chem. 2010 Mar;31(4):752-63. doi: 10.1002/jcc.21347.
2
Virtual screening of Abl inhibitors from large compound libraries by support vector machines.利用支持向量机从大型化合物库中虚拟筛选Abl抑制剂
J Chem Inf Model. 2009 Sep;49(9):2101-10. doi: 10.1021/ci900135u.
3
Evaluation of virtual screening performance of support vector machines trained by sparsely distributed active compounds.稀疏分布活性化合物训练的支持向量机虚拟筛选性能评估。
J Chem Inf Model. 2008 Jun;48(6):1227-37. doi: 10.1021/ci800022e. Epub 2008 Jun 6.
4
Data mining PubChem using a support vector machine with the Signature molecular descriptor: classification of factor XIa inhibitors.使用带有特征分子描述符的支持向量机挖掘PubChem数据:凝血因子Xa抑制剂的分类
J Mol Graph Model. 2008 Nov;27(4):466-75. doi: 10.1016/j.jmgm.2008.08.004. Epub 2008 Aug 27.
5
Effect of training data size and noise level on support vector machines virtual screening of genotoxic compounds from large compound libraries.训练数据大小和噪声水平对支持向量机从大型化合物库中虚拟筛选遗传毒性化合物的影响。
J Comput Aided Mol Des. 2011 May;25(5):455-67. doi: 10.1007/s10822-011-9431-3. Epub 2011 May 10.
6
Identifying Novel Type ZBGs and Nonhydroxamate HDAC Inhibitors Through a SVM Based Virtual Screening Approach.通过基于支持向量机的虚拟筛选方法鉴定新型ZBG型和非异羟肟酸类组蛋白去乙酰化酶抑制剂。
Mol Inform. 2010 May 17;29(5):407-20. doi: 10.1002/minf.200900014.
7
Prediction of factor Xa inhibitors by machine learning methods.通过机器学习方法预测凝血因子Xa抑制剂
J Mol Graph Model. 2007 Sep;26(2):505-18. doi: 10.1016/j.jmgm.2007.03.003. Epub 2007 Mar 12.
8
GPU accelerated support vector machines for mining high-throughput screening data.GPU 加速的支持向量机在高通量筛选数据中的应用。
J Chem Inf Model. 2009 Dec;49(12):2718-25. doi: 10.1021/ci900337f.
9
Ligand prediction from protein sequence and small molecule information using support vector machines and fingerprint descriptors.利用支持向量机和指纹描述符从蛋白质序列和小分子信息进行配体预测。
J Chem Inf Model. 2009 Apr;49(4):767-79. doi: 10.1021/ci900004a.
10
Novel inhibitors of human histone deacetylase (HDAC) identified by QSAR modeling of known inhibitors, virtual screening, and experimental validation.通过对已知抑制剂进行定量构效关系建模、虚拟筛选和实验验证鉴定出的新型人类组蛋白去乙酰化酶(HDAC)抑制剂。
J Chem Inf Model. 2009 Feb;49(2):461-76. doi: 10.1021/ci800366f.

引用本文的文献

1
Advancing promiscuous aggregating inhibitor analysis with intelligent machine learning classification.通过智能机器学习分类推进混杂聚集抑制剂分析。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf205.
2
Predictions of Colloidal Molecular Aggregation Using AI/ML Models.使用人工智能/机器学习模型预测胶体分子聚集
ACS Omega. 2024 Jun 18;9(26):28691-28706. doi: 10.1021/acsomega.4c02886. eCollection 2024 Jul 2.
3
Gains from no real PAINS: Where 'Fair Trial Strategy' stands in the development of multi-target ligands.无实际痛苦的收获:“公平试验策略”在多靶点配体开发中的地位。
Acta Pharm Sin B. 2021 Nov;11(11):3417-3432. doi: 10.1016/j.apsb.2021.02.023. Epub 2021 Mar 4.
4
Colloidal aggregation: from screening nuisance to formulation nuance.胶体聚集:从筛选干扰到配方微调。
Nano Today. 2018 Apr;19:188-200. doi: 10.1016/j.nantod.2018.02.011. Epub 2018 Mar 10.
5
Getting the most out of PubChem for virtual screening.充分利用 PubChem 进行虚拟筛选。
Expert Opin Drug Discov. 2016 Sep;11(9):843-55. doi: 10.1080/17460441.2016.1216967. Epub 2016 Aug 5.
6
Mining Chemical Activity Status from High-Throughput Screening Assays.从高通量筛选实验中挖掘化学活性状态
PLoS One. 2015 Dec 14;10(12):e0144426. doi: 10.1371/journal.pone.0144426. eCollection 2015.
7
An Aggregation Advisor for Ligand Discovery.用于配体发现的聚集顾问程序。
J Med Chem. 2015 Sep 10;58(17):7076-87. doi: 10.1021/acs.jmedchem.5b01105. Epub 2015 Aug 28.
8
Exploiting PubChem for Virtual Screening.利用PubChem进行虚拟筛选。
Expert Opin Drug Discov. 2010 Dec;5(12):1205-1220. doi: 10.1517/17460441.2010.524924.