自动定量构效关系建模填补高通量筛选中的数据空白。

Automatic Quantitative Structure-Activity Relationship Modeling to Fill Data Gaps in High-Throughput Screening.

机构信息

Center for Computational and Integrative Biology, Rutgers University, Camden, NJ, USA.

Department of Chemistry, Rutgers University, Camden, NJ, USA.

出版信息

Methods Mol Biol. 2022;2474:169-187. doi: 10.1007/978-1-0716-2213-1_16.

DOI:10.1007/978-1-0716-2213-1_16

PMID:35294765

Abstract

Advances in high-throughput screening (HTS) revolutionized the environmental and health sciences data landscape. However, new compounds still need to be experimentally synthesized and tested to obtain HTS data, which will still be costly and time-consuming when a large set of new compounds need to be studied against many tests. Quantitative structure-activity relationship (QSAR) modeling is a standard method to fill data gaps for new compounds. The major challenge for many toxicologists, especially those with limited computational backgrounds, is efficiently developing optimized QSAR models for each assay with missing data for certain test compounds. This chapter aims to introduce a freely available and user-friendly QSAR modeling workflow, which trains and optimizes models using five algorithms without the need for a programming background.

摘要

高通量筛选 (HTS) 的进展彻底改变了环境和健康科学数据领域。然而，为了获得 HTS 数据，仍然需要对新化合物进行实验合成和测试，当需要对大量新化合物进行许多测试时，这仍然是昂贵和耗时的。定量构效关系 (QSAR) 建模是填补新化合物数据空白的标准方法。对于许多毒理学家来说，尤其是那些计算背景有限的毒理学家来说，主要的挑战是有效地为每个具有某些测试化合物缺失数据的测定法开发经过优化的 QSAR 模型。本章旨在介绍一个免费且易于使用的 QSAR 建模工作流程，该流程使用五种算法进行训练和优化模型，而无需编程背景。

相似文献

Automatic Quantitative Structure-Activity Relationship Modeling to Fill Data Gaps in High-Throughput Screening.自动定量构效关系建模填补高通量筛选中的数据空白。

Methods Mol Biol. 2022;2474:169-187. doi: 10.1007/978-1-0716-2213-1_16.

Curating and Preparing High-Throughput Screening Data for Quantitative Structure-Activity Relationship Modeling.为定量构效关系建模整理和准备高通量筛选数据

Methods Mol Biol. 2016;1473:161-72. doi: 10.1007/978-1-4939-6346-1_17.

QSAR modeling of imbalanced high-throughput screening data in PubChem.基于PubChem中不平衡高通量筛选数据的定量构效关系建模

J Chem Inf Model. 2014 Mar 24;54(3):705-12. doi: 10.1021/ci400737s. Epub 2014 Feb 28.

Profile-QSAR: a novel meta-QSAR method that combines activities across the kinase family to accurately predict affinity, selectivity, and cellular activity.谱定量构效关系（Profile-QSAR）：一种新型的元定量构效关系方法，它结合了激酶家族的各项活性，可准确预测亲和力、选择性和细胞活性。

J Chem Inf Model. 2011 Aug 22;51(8):1942-56. doi: 10.1021/ci1005004. Epub 2011 Jul 19.

Use of cell viability assay data improves the prediction accuracy of conventional quantitative structure-activity relationship models of animal carcinogenicity.细胞活力测定数据的使用提高了动物致癌性传统定量构效关系模型的预测准确性。

Environ Health Perspect. 2008 Apr;116(4):506-13. doi: 10.1289/ehp.10573.

High-Throughput Screening Assay Profiling for Large Chemical Databases.高通量筛选测定大化学数据库。

Methods Mol Biol. 2022;2474:125-132. doi: 10.1007/978-1-0716-2213-1_12.

CurveP Method for Rendering High-Throughput Screening Dose-Response Data into Digital Fingerprints.CurveP 方法将高通量筛选剂量反应数据转化为数字指纹。

Methods Mol Biol. 2022;2474:147-154. doi: 10.1007/978-1-0716-2213-1_14.

In Silico Study of In Vitro GPCR Assays by QSAR Modeling.通过定量构效关系（QSAR）建模对体外G蛋白偶联受体（GPCR）分析进行计算机模拟研究。

Methods Mol Biol. 2016;1425:361-81. doi: 10.1007/978-1-4939-3609-0_16.

A Distance-Based Boolean Applicability Domain for Classification of High Throughput Screening Data.基于距离的布尔适域在高通量筛选数据分类中的应用。

J Chem Inf Model. 2019 Jan 28;59(1):463-476. doi: 10.1021/acs.jcim.8b00499. Epub 2019 Jan 8.

Cluster based SVR-QSAR modelling for HTS records: an implementation for anticancer leads against human breast cancer.基于聚类的高通量筛选记录的支持向量回归定量构效关系建模：针对人类乳腺癌抗癌先导物的一种实现方法

Comb Chem High Throughput Screen. 2013 Jun 28;16(7):511-21. doi: 10.2174/1386207311316070002.

引用本文的文献

Application of Machine Learning and Mechanistic Modeling to Predict Intravenous Pharmacokinetic Profiles in Humans.机器学习与机理建模在预测人体静脉药代动力学特征中的应用

J Med Chem. 2025 Apr 10;68(7):7737-7750. doi: 10.1021/acs.jmedchem.5c00340. Epub 2025 Mar 27.

Predicting Chemical Immunotoxicity through Data-Driven QSAR Modeling of Aryl Hydrocarbon Receptor Agonism and Related Toxicity Mechanisms.通过基于数据驱动的芳烃受体激动作用及相关毒性机制的定量构效关系模型预测化学物质的免疫毒性

Environ Health (Wash). 2024 May 28;2(7):474-485. doi: 10.1021/envhealth.4c00026. eCollection 2024 Jul 19.

Hybrid non-animal modeling: A mechanistic approach to predict chemical hepatotoxicity.混合非动物模型：预测化学肝毒性的一种机制方法。

J Hazard Mater. 2024 Jun 5;471:134297. doi: 10.1016/j.jhazmat.2024.134297. Epub 2024 Apr 12.

Data-Driven Quantitative Structure-Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure.基于数据的定量构效关系模型在人类经口慢性暴露致癌性研究中的应用。

Environ Sci Technol. 2023 Apr 25;57(16):6573-6588. doi: 10.1021/acs.est.3c00648. Epub 2023 Apr 11.

本文引用的文献

Construction of a Virtual Opioid Bioprofile: A Data-Driven QSAR Modeling Study to Identify New Analgesic Opioids.虚拟阿片类生物图谱的构建：一项用于识别新型镇痛阿片类药物的数据驱动定量构效关系建模研究

ACS Sustain Chem Eng. 2021 Mar 15;9(10):3909-3919. doi: 10.1021/acssuschemeng.0c09139. Epub 2021 Mar 4.

Predictive modeling of estrogen receptor agonism, antagonism, and binding activities using machine- and deep-learning approaches.使用机器学习和深度学习方法预测雌激素受体激动剂、拮抗剂和结合活性。

Lab Invest. 2021 Apr;101(4):490-502. doi: 10.1038/s41374-020-00477-2. Epub 2020 Aug 10.

Advancing computer-aided drug discovery (CADD) by big data and data-driven machine learning modeling.通过大数据和数据驱动的机器学习建模来推动计算机辅助药物发现 (CADD)。

Drug Discov Today. 2020 Sep;25(9):1624-1638. doi: 10.1016/j.drudis.2020.07.005. Epub 2020 Jul 11.

Advancing Computational Toxicology in the Big Data Era by Artificial Intelligence: Data-Driven and Mechanism-Driven Modeling for Chemical Toxicity.推动大数据时代计算毒理学的发展：化学毒性的人工智能数据驱动和机制驱动建模。

Chem Res Toxicol. 2019 Apr 15;32(4):536-547. doi: 10.1021/acs.chemrestox.8b00393. Epub 2019 Mar 25.

Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction.比较多种机器学习算法和指标进行雌激素受体结合预测。

Mol Pharm. 2018 Oct 1;15(10):4361-4370. doi: 10.1021/acs.molpharmaceut.8b00546. Epub 2018 Aug 28.

Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets.使用多种药物发现数据集比较深度学习与多种机器学习方法和指标。

Mol Pharm. 2017 Dec 4;14(12):4462-4475. doi: 10.1021/acs.molpharmaceut.7b00578. Epub 2017 Nov 13.

Extended-connectivity fingerprints.扩展连接指纹。

J Chem Inf Model. 2010 May 24;50(5):742-54. doi: 10.1021/ci100050t.

An overview of the PubChem BioAssay resource.PubChem 生物测定资源概述。

Nucleic Acids Res. 2010 Jan;38(Database issue):D255-66. doi: 10.1093/nar/gkp965. Epub 2009 Nov 19.

A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction.一种使用多因素降维方法在不平衡数据集中进行上位性建模的平衡准确率函数。

Genet Epidemiol. 2007 May;31(4):306-15. doi: 10.1002/gepi.20211.

Diagnostic tests. 1: Sensitivity and specificity.诊断测试。1：敏感性和特异性。

BMJ. 1994 Jun 11;308(6943):1552. doi: 10.1136/bmj.308.6943.1552.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

自动定量构效关系建模填补高通量筛选中的数据空白。

Automatic Quantitative Structure-Activity Relationship Modeling to Fill Data Gaps in High-Throughput Screening.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献