• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SCRAMBLE'N'GAMBLE:一种用于快速轻松生成随机数据以对QSAR模型进行统计评估的工具。

SCRAMBLE'N'GAMBLE: a tool for fast and facile generation of random data for statistical evaluation of QSAR models.

作者信息

Lipiński Piotr F J, Szurmak Przemysław

机构信息

Department of Neuropeptides, Mossakowski Medical Research Centre Polish Academy of Sciences, 02-106 Warsaw, Poland.

ChemPharmSoft, 01-926 Warsaw, Poland.

出版信息

Chem Zvesti. 2017;71(11):2217-2232. doi: 10.1007/s11696-017-0215-7. Epub 2017 Jun 5.

DOI:10.1007/s11696-017-0215-7
PMID:29104352
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5655615/
Abstract

A common practice in modern QSAR modelling is to derive models by variable selection methods working on large descriptor pools. As pointed out previously, this is intrinsically burdened with the risk of finding random correlations. Therefore it is desirable to perform tests showing the performance of models built on random data. In this contribution, we introduce a simple and freely available software tool SCRAMBLE'N'GAMBLE that is aimed at facilitating data preparation for -randomization and pseudo-descriptors tests. Then, four close-to-real-world modelling situations are analysed. The tests indicate what the quality of obtained QSAR models is like in comparison to chance models derived from random data. The non-randomness is not the only requirement for a good QSAR model, however, it is a good practice to consider it together with internal statistical parameters and possible physical interpretations of a model.

摘要

现代定量构效关系(QSAR)建模中的一个常见做法是通过对大量描述符库进行变量选择方法来推导模型。如前所述,这本质上存在发现随机相关性的风险。因此,进行测试以展示基于随机数据构建的模型的性能是很有必要的。在本论文中,我们介绍了一个简单且免费可用的软件工具“SCRAMBLE'N'GAMBLE”,其目的是便于为随机化和伪描述符测试进行数据准备。然后,分析了四种接近实际情况的建模情形。这些测试表明,与从随机数据导出的机会模型相比,所获得的QSAR模型质量如何。然而,非随机性并不是一个好的QSAR模型的唯一要求,将其与内部统计参数以及模型可能的物理解释一起考虑是一种良好的做法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/beacd2a30229/11696_2017_215_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/861a1dca0c45/11696_2017_215_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/ab84fe9f7189/11696_2017_215_Fig2a_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/b682af3cf071/11696_2017_215_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/04e4b9e08fda/11696_2017_215_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/7bfaac80b12d/11696_2017_215_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/beacd2a30229/11696_2017_215_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/861a1dca0c45/11696_2017_215_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/ab84fe9f7189/11696_2017_215_Fig2a_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/b682af3cf071/11696_2017_215_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/04e4b9e08fda/11696_2017_215_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/7bfaac80b12d/11696_2017_215_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e9f/5655615/beacd2a30229/11696_2017_215_Fig6_HTML.jpg

相似文献

1
SCRAMBLE'N'GAMBLE: a tool for fast and facile generation of random data for statistical evaluation of QSAR models.SCRAMBLE'N'GAMBLE:一种用于快速轻松生成随机数据以对QSAR模型进行统计评估的工具。
Chem Zvesti. 2017;71(11):2217-2232. doi: 10.1007/s11696-017-0215-7. Epub 2017 Jun 5.
2
y-Randomization and its variants in QSPR/QSAR.定量构效关系/定量结构活性关系中的γ-随机化及其变体。
J Chem Inf Model. 2007 Nov-Dec;47(6):2345-57. doi: 10.1021/ci700157b. Epub 2007 Sep 20.
3
Beware of External Validation! - A Comparative Study of Several Validation Techniques used in QSAR Modelling.谨防外部验证!——QSAR建模中几种验证技术的比较研究。
Curr Comput Aided Drug Des. 2018;14(4):284-291. doi: 10.2174/1573409914666180426144304.
4
Legitimate utilization of large descriptor pools for QSPR/QSAR models.将大型描述符库合理用于定量构效关系/定量结构活性关系模型。
J Chem Inf Model. 2008 Nov;48(11):2207-13. doi: 10.1021/ci8002073.
5
Combinatorial QSAR of ambergris fragrance compounds.龙涎香香料化合物的组合定量构效关系
J Chem Inf Comput Sci. 2004 Mar-Apr;44(2):582-95. doi: 10.1021/ci034203t.
6
Consensus QSAR modelling of SIRT1 activators using simplex representation of molecular structure.基于分子结构单形表示的 SIRT1 激活剂共识定量构效关系建模。
SAR QSAR Environ Res. 2018 Apr;29(4):277-294. doi: 10.1080/1062936X.2018.1426626. Epub 2018 Feb 1.
7
Visual analytics in cheminformatics: user-supervised descriptor selection for QSAR methods.化学信息学中的可视化分析:用于定量构效关系方法的用户监督描述符选择
J Cheminform. 2015 Aug 19;7:39. doi: 10.1186/s13321-015-0092-4. eCollection 2015.
8
Application of GA-MLR for QSAR Modeling of the Arylthioindole Class of Tubulin Polymerization Inhibitors as Anticancer Agents.遗传算法-多元线性回归在作为抗癌剂的芳基硫代吲哚类微管蛋白聚合抑制剂定量构效关系建模中的应用。
Anticancer Agents Med Chem. 2017;17(4):552-565. doi: 10.2174/1871520616666160811162105.
9
Evaluation of a novel molecular vibration-based descriptor (EVA) for QSAR studies: 2. Model validation using a benchmark steroid dataset.用于定量构效关系(QSAR)研究的新型基于分子振动的描述符(EVA)的评估:2. 使用基准类固醇数据集进行模型验证。
J Comput Aided Mol Des. 1999 May;13(3):271-96. doi: 10.1023/a:1008012732081.
10
QSAR--how good is it in practice? Comparison of descriptor sets on an unbiased cross section of corporate data sets.定量构效关系——在实际应用中效果如何?企业数据集无偏横截面描述符集的比较。
J Chem Inf Model. 2006 Sep-Oct;46(5):1924-36. doi: 10.1021/ci050413p.

引用本文的文献

1
Stacking Ensemble Neural Network for Chemical Safety Assessment: A Case Study of Thyroid Peroxidase and Natural Product Screening.用于化学安全评估的堆叠集成神经网络:以甲状腺过氧化物酶和天然产物筛选为例
ACS Omega. 2025 Jul 10;10(28):30450-30466. doi: 10.1021/acsomega.5c02188. eCollection 2025 Jul 22.
2
Understanding the Enzyme ()-Norcoclaurine Synthase Promiscuity to Aldehydes and Ketones.理解酶()-诺卡屈嗪合成酶对醛和酮的混杂性。
J Chem Inf Model. 2024 Jun 10;64(11):4462-4474. doi: 10.1021/acs.jcim.3c01773. Epub 2024 May 22.
3
Protein characteristics substantially influence the propensity of activity cliffs among kinase inhibitors.

本文引用的文献

1
The drug situation in Europe: an overview of data available on illicit drugs and new psychoactive substances from European monitoring in 2015.欧洲毒品形势:对 2015 年欧洲毒品监测中非法药物和新精神活性物质相关数据的概述。
Addiction. 2016 Jan;111(1):34-48. doi: 10.1111/add.13056. Epub 2015 Sep 30.
2
Towards engineering hormone-binding globulins as drug delivery agents.致力于将激素结合球蛋白工程化为药物递送剂。
PLoS One. 2014 Nov 26;9(11):e113402. doi: 10.1371/journal.pone.0113402. eCollection 2014.
3
Human sex hormone-binding globulin binding affinities of 125 structurally diverse chemicals and comparison with their binding to androgen receptor, estrogen receptor, and α-fetoprotein.
蛋白质特性极大地影响了激酶抑制剂中活性峰的倾向。
Sci Rep. 2024 Apr 20;14(1):9058. doi: 10.1038/s41598-024-59501-w.
4
Viral Immunogenicity Prediction by Machine Learning Methods.基于机器学习方法的病毒免疫原性预测。
Int J Mol Sci. 2024 Mar 3;25(5):2949. doi: 10.3390/ijms25052949.
5
Combining machine learning and structure-based approaches to develop oncogene PIM kinase inhibitors.结合机器学习和基于结构的方法来开发致癌基因PIM激酶抑制剂。
Front Chem. 2023 Mar 10;11:1137444. doi: 10.3389/fchem.2023.1137444. eCollection 2023.
6
Cell Morphological Profiling Enables High-Throughput Screening for PROteolysis TArgeting Chimera (PROTAC) Phenotypic Signature.细胞形态特征分析可实现高通量筛选 PROteolysis TArgeting Chimera(PROTAC)表型特征。
ACS Chem Biol. 2022 Jul 15;17(7):1733-1744. doi: 10.1021/acschembio.2c00076. Epub 2022 Jul 6.
7
Docking-generated multiple ligand poses for bootstrapping bioactivity classifying Machine Learning: Repurposing covalent inhibitors for COVID-19-related TMPRSS2 as case study.对接生成的多个配体构象用于引导生物活性分类机器学习:以将共价抑制剂重新用于治疗与 COVID-19 相关的跨膜丝氨酸蛋白酶 2 为例进行研究。
Comput Struct Biotechnol J. 2021;19:4790-4824. doi: 10.1016/j.csbj.2021.08.023. Epub 2021 Aug 19.
8
Development of Machine Learning Models for Accurately Predicting and Ranking the Activity of Lead Molecules to Inhibit PRC2 Dependent Cancer.用于准确预测和排序先导分子抑制PRC2依赖性癌症活性的机器学习模型的开发
Pharmaceuticals (Basel). 2021 Jul 20;14(7):699. doi: 10.3390/ph14070699.
9
Molecular insights on ABL kinase activation using tree-based machine learning models and molecular docking.基于树的机器学习模型和分子对接的 ABL 激酶激活的分子见解。
Mol Divers. 2021 Aug;25(3):1301-1314. doi: 10.1007/s11030-021-10261-z. Epub 2021 Jun 30.
10
Prediction and mechanistic analysis of drug-induced liver injury (DILI) based on chemical structure.基于化学结构预测和机制分析药物性肝损伤(DILI)。
Biol Direct. 2021 Jan 18;16(1):6. doi: 10.1186/s13062-020-00285-0.
125种结构各异的化学物质与人性激素结合球蛋白的结合亲和力及其与雄激素受体、雌激素受体和甲胎蛋白结合情况的比较
Toxicol Sci. 2015 Feb;143(2):333-48. doi: 10.1093/toxsci/kfu231. Epub 2014 Oct 27.
4
Modelling the binding affinity of steroids to zebrafish sex hormone-binding globulin.建立类固醇与斑马鱼性激素结合球蛋白结合亲和力的模型。
SAR QSAR Environ Res. 2014;25(5):407-21. doi: 10.1080/1062936X.2014.909197. Epub 2014 May 29.
5
Fentanyl-related compounds and derivatives: current status and future prospects for pharmaceutical applications.芬太尼相关化合物及衍生物:药物应用的现状与未来前景
Future Med Chem. 2014 Mar;6(4):385-412. doi: 10.4155/fmc.13.215.
6
Sex hormone binding globulin and corticosteroid binding globulin as major effectors of steroid action.性激素结合球蛋白和皮质类固醇结合球蛋白作为类固醇作用的主要效应器。
Steroids. 2014 Mar;81:13-6. doi: 10.1016/j.steroids.2013.11.010. Epub 2013 Nov 20.
7
Fentanyl-associated fatalities among illicit drug users in Wayne County, Michigan (July 2005-May 2006).密歇根州韦恩县非法药物使用者中与芬太尼相关的死亡案例(2005 年 7 月-2006 年 5 月)。
J Med Toxicol. 2013 Mar;9(1):106-15. doi: 10.1007/s13181-012-0285-4.
8
Corticosteroid-binding globulin: structure-function implications from species differences.皮质类固醇结合球蛋白:种属差异的结构-功能意义。
PLoS One. 2012;7(12):e52759. doi: 10.1371/journal.pone.0052759. Epub 2012 Dec 26.
9
Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking.有用诱饵目录增强版(DUD-E):更好的配体和诱饵,用于更好的基准测试。
J Med Chem. 2012 Jul 26;55(14):6582-94. doi: 10.1021/jm300687e. Epub 2012 Jul 5.
10
Chirality measures of α-amino acids.α-氨基酸的手性测量。
J Chem Inf Model. 2012 Jun 25;52(6):1462-79. doi: 10.1021/ci300057h. Epub 2012 May 29.