• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自一致性分类器的合成孔径雷达

SAR based on self consistent classifier.

作者信息

Stolbov L A, Filimonov D A, Poroikov V V

机构信息

Laboratory of Structure-Function Based Drug Design, Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow, Russian Federation.

出版信息

SAR QSAR Environ Res. 2022 Oct;33(10):793-804. doi: 10.1080/1062936X.2022.2139751.

DOI:10.1080/1062936X.2022.2139751
PMID:36369710
Abstract

The accuracy and performance of (Q)SAR models depend significantly on the data used for training. Datasets prepared on the basis of publicly available databases contain structures belonging to different chemical classes and have a highly imbalanced actives/inactives ratio. Currently, hundreds of structural descriptors are used in (Q)SAR studies. The abundance of structural descriptors gives rise to the problem of the constructed (Q)SAR models stability. The methods frequently used for the selection of a small fraction of the 'best' descriptors usually do not have sufficient mathematical justification. We propose a new approach to a self-consistent classifier for SAR analysis in order to overcome these problems. Logistic (SCLC) and extreme (SCEC) extensions of self-consistent regression (SCR) were implemented to enhance the classification capabilities of SCR. The approach was applied to classification models' development for inhibiting activity endpoints in HIV-1-related data and toxicity endpoints with subsequent fivefold cross-validation to estimate the models' performance. Comparison of the proposed SCLC and SCEC models with those developed using the original SCR and support vector machine demonstrated the comparable accuracy. Advantages in feature selection using our approach provide more generalizable (Q)SAR models. In particular, the crucial factors responsible for the observed value are determined unambiguously.

摘要

(定量)构效关系(QSAR)模型的准确性和性能在很大程度上取决于用于训练的数据。基于公开可用数据库制备的数据集包含属于不同化学类别的结构,并且活性/非活性比例高度失衡。目前,(定量)构效关系研究中使用了数百种结构描述符。大量的结构描述符导致了所构建的(定量)构效关系模型稳定性的问题。常用于选择一小部分“最佳”描述符的方法通常没有充分的数学依据。为了克服这些问题,我们提出了一种用于SAR分析的自洽分类器的新方法。实施了自洽回归(SCR)的逻辑(SCLC)和极端(SCEC)扩展,以增强SCR的分类能力。该方法应用于HIV-1相关数据中抑制活性终点和毒性终点的分类模型开发,并随后进行五重交叉验证以评估模型的性能。将所提出的SCLC和SCEC模型与使用原始SCR和支持向量机开发的模型进行比较,结果表明准确性相当。使用我们的方法进行特征选择的优势提供了更具通用性的(定量)构效关系模型。特别是,明确确定了导致观测值的关键因素。

相似文献

1
SAR based on self consistent classifier.基于自一致性分类器的合成孔径雷达
SAR QSAR Environ Res. 2022 Oct;33(10):793-804. doi: 10.1080/1062936X.2022.2139751.
2
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
3
Comparison of Quantitative and Qualitative (Q)SAR Models Created for the Prediction of K and IC Values of Antitarget Inhibitors.为预测非靶向抑制剂的K值和IC值而创建的定量和定性(Q)SAR模型的比较。
Front Pharmacol. 2018 Oct 10;9:1136. doi: 10.3389/fphar.2018.01136. eCollection 2018.
4
A comprehensive support vector machine binary hERG classification model based on extensive but biased end point hERG data sets.基于广泛但存在偏倚的终点 hERG 数据集的全面支持向量机二进制 hERG 分类模型。
Chem Res Toxicol. 2011 Jun 20;24(6):934-49. doi: 10.1021/tx200099j. Epub 2011 May 6.
5
A support vector machine classifier reduces interscanner variation in the HRCT classification of regional disease pattern in diffuse lung disease: comparison to a Bayesian classifier.支持向量机分类器减少了弥漫性肺疾病中区域性疾病模式 HRCT 分类中的扫描仪间变异性:与贝叶斯分类器的比较。
Med Phys. 2013 May;40(5):051912. doi: 10.1118/1.4802214.
6
(Q)SAR Models of HIV-1 Protein Inhibition by Drug-Like Compounds.基于药物类似物的 HIV-1 蛋白抑制的定量构效关系模型。
Molecules. 2019 Dec 25;25(1):87. doi: 10.3390/molecules25010087.
7
A Machine Learning-Based QSAR Model for Benzimidazole Derivatives as Corrosion Inhibitors by Incorporating Comprehensive Feature Selection.基于机器学习的苯并咪唑衍生物作为缓蚀剂的 QSAR 模型,综合特征选择。
Interdiscip Sci. 2019 Dec;11(4):738-747. doi: 10.1007/s12539-019-00346-7. Epub 2019 Sep 4.
8
QSAR Models for Active Substances against Using Disk-Diffusion Test Data.基于纸片扩散试验数据的抗活性物质 QSAR 模型
Molecules. 2021 Mar 19;26(6):1734. doi: 10.3390/molecules26061734.
9
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
10
The utility of structure-activity relationship (SAR) models for prediction and covariate selection in developmental toxicity: comparative analysis of logistic regression and decision tree models.结构-活性关系(SAR)模型在发育毒性预测和协变量选择中的效用:逻辑回归模型与决策树模型的比较分析
SAR QSAR Environ Res. 2004 Feb;15(1):1-18. doi: 10.1080/1062936032000169633.

引用本文的文献

1
Quantitative Prediction of Human Immunodeficiency Virus Drug Resistance.定量预测人类免疫缺陷病毒耐药性。
Viruses. 2024 Jul 15;16(7):1132. doi: 10.3390/v16071132.