• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高通量筛选数据集的数据挖掘与计算建模

Data Mining and Computational Modeling of High-Throughput Screening Datasets.

作者信息

Ekins Sean, Clark Alex M, Dole Krishna, Gregory Kellan, Mcnutt Andrew M, Spektor Anna Coulon, Weatherall Charlie, Litterman Nadia K, Bunin Barry A

机构信息

Collaborations Pharmaceuticals, Inc., 840 Main Campus Drive, Lab 3510, Raleigh, NC, 27606, USA.

Collaborative Drug Discovery, Inc., Burlingame, CA, USA.

出版信息

Methods Mol Biol. 2018;1755:197-221. doi: 10.1007/978-1-4939-7724-6_14.

DOI:10.1007/978-1-4939-7724-6_14
PMID:29671272
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6181121/
Abstract

We are now seeing the benefit of investments made over the last decade in high-throughput screening (HTS) that is resulting in large structure activity datasets entering public and open databases such as ChEMBL and PubChem. The growth of academic HTS screening centers and the increasing move to academia for early stage drug discovery suggests a great need for the informatics tools and methods to mine such data and learn from it. Collaborative Drug Discovery, Inc. (CDD) has developed a number of tools for storing, mining, securely and selectively sharing, as well as learning from such HTS data. We present a new web based data mining and visualization module directly within the CDD Vault platform for high-throughput drug discovery data that makes use of a novel technology stack following modern reactive design principles. We also describe CDD Models within the CDD Vault platform that enables researchers to share models, share predictions from models, and create models from distributed, heterogeneous data. Our system is built on top of the Collaborative Drug Discovery Vault Activity and Registration data repository ecosystem which allows users to manipulate and visualize thousands of molecules in real time. This can be performed in any browser on any platform. In this chapter we present examples of its use with public datasets in CDD Vault. Such approaches can complement other cheminformatics tools, whether open source or commercial, in providing approaches for data mining and modeling of HTS data.

摘要

我们现在看到了过去十年在高通量筛选(HTS)方面投资的好处,这使得大量的结构活性数据集进入了诸如ChEMBL和PubChem等公共开放数据库。学术性高通量筛选中心的发展以及早期药物发现向学术界的日益转移表明,迫切需要信息学工具和方法来挖掘此类数据并从中学习。协作药物发现公司(CDD)已经开发了许多工具,用于存储、挖掘、安全且有选择地共享以及从此类高通量筛选数据中学习。我们在CDD Vault平台中直接展示了一个新的基于网络的数据挖掘和可视化模块,用于高通量药物发现数据,该模块采用了遵循现代响应式设计原则的新技术栈。我们还描述了CDD Vault平台中的CDD模型,该模型使研究人员能够共享模型、共享模型预测结果,并从分布式的异构数据创建模型。我们的系统建立在协作药物发现Vault活性和注册数据存储库生态系统之上,允许用户实时操作和可视化数千种分子。这可以在任何平台的任何浏览器中进行。在本章中,我们展示了其在CDD Vault中与公共数据集一起使用的示例。此类方法可以补充其他化学信息学工具,无论是开源的还是商业的,为高通量筛选数据的数据挖掘和建模提供方法。

相似文献

1
Data Mining and Computational Modeling of High-Throughput Screening Datasets.高通量筛选数据集的数据挖掘与计算建模
Methods Mol Biol. 2018;1755:197-221. doi: 10.1007/978-1-4939-7724-6_14.
2
Open Source Bayesian Models. 1. Application to ADME/Tox and Drug Discovery Datasets.开源贝叶斯模型。1. 在药物代谢动力学/药物毒性及药物发现数据集方面的应用
J Chem Inf Model. 2015 Jun 22;55(6):1231-45. doi: 10.1021/acs.jcim.5b00143. Epub 2015 Jun 3.
3
Drug discovery applications for KNIME: an open source data mining platform.KNIME 在药物发现中的应用:一个开源的数据挖掘平台。
Curr Top Med Chem. 2012;12(18):1965-79. doi: 10.2174/156802612804910331.
4
A collaborative database and computational models for tuberculosis drug discovery.用于结核病药物研发的协作数据库和计算模型。
Mol Biosyst. 2010 May;6(5):840-51. doi: 10.1039/b917766c. Epub 2010 Feb 9.
5
Enalos Suite: New Cheminformatics Platform for Drug Discovery and Computational Toxicology.Enalos套件:用于药物发现和计算毒理学的新化学信息学平台。
Methods Mol Biol. 2018;1800:287-311. doi: 10.1007/978-1-4939-7899-1_14.
6
Collaborative drug discovery for More Medicines for Tuberculosis (MM4TB).用于治疗更多结核病患者的合作药物研发(MM4TB)。
Drug Discov Today. 2017 Mar;22(3):555-565. doi: 10.1016/j.drudis.2016.10.009. Epub 2016 Nov 22.
7
The Collaborative Drug Discovery (CDD) database.协作药物发现(CDD)数据库。
Methods Mol Biol. 2013;993:139-54. doi: 10.1007/978-1-62703-342-8_10.
8
What Can We Learn from Bioactivity Data? Chemoinformatics Tools and Applications in Chemical Biology Research.我们能从生物活性数据中学到什么?化学生物学研究中的化学信息学工具与应用。
ACS Chem Biol. 2017 Jan 20;12(1):23-35. doi: 10.1021/acschembio.6b00706. Epub 2016 Dec 5.
9
Open Source Bayesian Models. 2. Mining a "Big Dataset" To Create and Validate Models with ChEMBL.开源贝叶斯模型。2. 使用 ChEMBL 挖掘“大数据集”来创建和验证模型。
J Chem Inf Model. 2015 Jun 22;55(6):1246-60. doi: 10.1021/acs.jcim.5b00144. Epub 2015 Jun 3.
10
Selection of Informative Examples in Chemogenomic Datasets.化学基因组学数据集中信息性示例的选择
Methods Mol Biol. 2018;1825:369-410. doi: 10.1007/978-1-4939-8639-2_13.

引用本文的文献

1
Estimation of the Prevalence of Nonalcoholic Fatty Liver Disease in an Adult Population in Northern China Using the Data Mining Approach.利用数据挖掘方法估算中国北方成年人群中非酒精性脂肪性肝病的患病率
Diabetes Metab Syndr Obes. 2021 Jul 28;14:3437-3445. doi: 10.2147/DMSO.S320808. eCollection 2021.
2
Estimation of Diabetes in a High-Risk Adult Chinese Population Using J48 Decision Tree Model.使用J48决策树模型对中国高危成年人群中的糖尿病进行评估。
Diabetes Metab Syndr Obes. 2020 Nov 26;13:4621-4630. doi: 10.2147/DMSO.S279329. eCollection 2020.
3
Extremely Low Hit Rate in a Diverse Chemical Drug Screen Targeting Mycobacterium abscessus.

本文引用的文献

1
Academic partnerships 2015.2015年学术合作
Nat Biotechnol. 2016 Apr;34(4):372. doi: 10.1038/nbt.3522.
2
Machine learning models identify molecules active against the Ebola virus .机器学习模型识别出对埃博拉病毒有活性的分子。
F1000Res. 2015 Oct 20;4:1091. doi: 10.12688/f1000research.7217.3. eCollection 2015.
3
Modelling the Tox21 10 K chemical profiles for in vivo toxicity prediction and mechanism characterization.为体内毒性预测和作用机制表征对Tox21 10 K化学图谱进行建模。
针对脓肿分枝杆菌的多样化化学药物筛选中极低的命中率
Antimicrob Agents Chemother. 2019 Oct 22;63(11). doi: 10.1128/AAC.01008-19. Print 2019 Nov.
Nat Commun. 2016 Jan 26;7:10425. doi: 10.1038/ncomms10425.
4
Open Source Bayesian Models. 3. Composite Models for Prediction of Binned Responses.开源贝叶斯模型。3. 用于预测分箱响应的复合模型。
J Chem Inf Model. 2016 Feb 22;56(2):275-85. doi: 10.1021/acs.jcim.5b00555. Epub 2016 Jan 19.
5
Mining Chemical Activity Status from High-Throughput Screening Assays.从高通量筛选实验中挖掘化学活性状态
PLoS One. 2015 Dec 14;10(12):e0144426. doi: 10.1371/journal.pone.0144426. eCollection 2015.
6
The ELF Honest Data Broker: informatics enabling public-private collaboration in a precompetitive arena.ELF 诚实数据经纪人:信息学在竞争前领域促进公私合作。
Drug Discov Today. 2016 Jan;21(1):97-102. doi: 10.1016/j.drudis.2015.11.005. Epub 2015 Nov 19.
7
Predicting Mouse Liver Microsomal Stability with "Pruned" Machine Learning Models and Public Data.使用“精简”机器学习模型和公共数据预测小鼠肝脏微粒体稳定性
Pharm Res. 2016 Feb;33(2):433-49. doi: 10.1007/s11095-015-1800-5. Epub 2015 Sep 28.
8
Thermodynamic Proxies to Compensate for Biases in Drug Discovery Methods.用于补偿药物发现方法中偏差的热力学替代指标
Pharm Res. 2016 Jan;33(1):194-205. doi: 10.1007/s11095-015-1779-y. Epub 2015 Aug 27.
9
Making Transporter Models for Drug-Drug Interaction Prediction Mobile.构建用于药物相互作用预测的移动转运体模型。
Drug Metab Dispos. 2015 Oct;43(10):1642-5. doi: 10.1124/dmd.115.064956. Epub 2015 Jul 21.
10
The promise and peril of chemical probes.化学探针的前景与风险
Nat Chem Biol. 2015 Aug;11(8):536-41. doi: 10.1038/nchembio.1867.