• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于ADME预测的新型自适应集成分类框架。

A novel adaptive ensemble classification framework for ADME prediction.

作者信息

Yang Ming, Chen Jialei, Xu Liwen, Shi Xiufeng, Zhou Xin, Xi Zhijun, An Rui, Wang Xinhong

机构信息

Department of Pharmacy, Longhua Hospital Affiliated to Shanghai University of TCM Shanghai People's Republic of China.

Department of Chemistry, College of Pharmacy, Shanghai University of Traditional Chinese Medicine Shanghai People's Republic of China

出版信息

RSC Adv. 2018 Mar 26;8(21):11661-11683. doi: 10.1039/c8ra01206g. eCollection 2018 Mar 21.

DOI:10.1039/c8ra01206g
PMID:35542768
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9079056/
Abstract

It has now become clear that prediction of ADME (absorption, distribution, metabolism, and elimination) characteristics is an important component of the drug discovery process. Therefore, there has been considerable interest in the development of modeling of ADME prediction in recent years. Despite the advances in this field, there remains challenges when facing the unbalanced and high dimensionality problems simultaneously. In this work, we introduce a novel adaptive ensemble classification framework named as AECF to deal with the above issues. AECF includes four components which are (1) data balancing, (2) generating individual models, (3) combining individual models, and (4) optimizing the ensemble. We considered five sampling methods, seven base modeling techniques, and ten ensemble rules to build a choice pool. The proper route of constructing predictive models was determined automatically according to the imbalance ratio (IR). With the adaptive characteristics of AECF, it can be used to work on the different kinds of ADME data, and the balanced data is a special case in AECF. We evaluated the performance of our approach using five extensive ADME datasets concerning Caco-2 cell permeability (CacoP), human intestinal absorption (HIA), oral bioavailability (OB), and P-glycoprotein (P-gp) binders (substrates/inhibitors, PS/PI). The performance of AECF was evaluated on two independent datasets, and the average AUC values were 0.8574-0.8602, 0.8968-0.9182, 0.7821-0.7981, 0.8139-0.8311, and 0.8874-0.8898 for CacoP, HIA, OB, PS and PI, respectively. Our results show that AECF can provide better performance and generality compared with individual models and two representative ensemble methods bagging and boosting. Furthermore, the degree of complementarity among the AECF ensemble members was investigated for the purpose of elucidating the potential advantages of our framework. We found that AECF can effectively select complementary members to construct predictive models by our auto-adaptive optimization approach, and the additional diversity in both sample and feature space mainly contribute to the complementarity of ensemble members.

摘要

现已明确,预测药物的吸收、分布、代谢和排泄(ADME)特性是药物研发过程的一个重要组成部分。因此,近年来人们对ADME预测模型的开发颇感兴趣。尽管该领域取得了进展,但在同时面对不平衡和高维问题时仍存在挑战。在这项工作中,我们引入了一种名为AECF的新型自适应集成分类框架来处理上述问题。AECF包括四个组件,即(1)数据平衡,(2)生成个体模型,(3)组合个体模型,以及(4)优化集成。我们考虑了五种采样方法、七种基础建模技术和十种集成规则来构建一个选择池。根据不平衡率(IR)自动确定构建预测模型的合适途径。凭借AECF的自适应特性,它可用于处理不同类型的ADME数据,平衡数据是AECF中的一种特殊情况。我们使用五个关于Caco - 2细胞通透性(CacoP)、人体肠道吸收(HIA)、口服生物利用度(OB)和P - 糖蛋白(P - gp)结合物(底物/抑制剂,PS/PI)的广泛ADME数据集评估了我们方法的性能。在两个独立数据集上评估了AECF的性能,CacoP、HIA、OB、PS和PI的平均AUC值分别为0.8574 - 0.8602、0.8968 - 0.9182、0.7821 - 0.7981、0.8139 - 0.8311和0.8874 - 0.8898。我们的结果表明,与个体模型以及两种代表性的集成方法装袋法和提升法相比,AECF能提供更好的性能和通用性。此外,为了阐明我们框架的潜在优势,研究了AECF集成成员之间的互补程度。我们发现,AECF可以通过我们的自适应优化方法有效地选择互补成员来构建预测模型,样本和特征空间中的额外多样性主要促成了集成成员的互补性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/3c555b25bfd7/c8ra01206g-f16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/6b06630d6da3/c8ra01206g-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/e6dd55a7254e/c8ra01206g-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/5b6a0c003761/c8ra01206g-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/5b58bdab0eb4/c8ra01206g-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/e63adcb53fbd/c8ra01206g-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/a9de53ed7636/c8ra01206g-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/1884e3c424f4/c8ra01206g-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/9cc91b22c5a9/c8ra01206g-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/6168e4dc3ed9/c8ra01206g-f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/04311b1601d2/c8ra01206g-f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/ae06bcb00ce7/c8ra01206g-f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/5c5dec42031e/c8ra01206g-f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/ec28a8fe1016/c8ra01206g-f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/09155a4f9001/c8ra01206g-f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/1a9ec7acda92/c8ra01206g-f15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/3c555b25bfd7/c8ra01206g-f16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/6b06630d6da3/c8ra01206g-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/e6dd55a7254e/c8ra01206g-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/5b6a0c003761/c8ra01206g-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/5b58bdab0eb4/c8ra01206g-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/e63adcb53fbd/c8ra01206g-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/a9de53ed7636/c8ra01206g-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/1884e3c424f4/c8ra01206g-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/9cc91b22c5a9/c8ra01206g-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/6168e4dc3ed9/c8ra01206g-f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/04311b1601d2/c8ra01206g-f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/ae06bcb00ce7/c8ra01206g-f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/5c5dec42031e/c8ra01206g-f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/ec28a8fe1016/c8ra01206g-f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/09155a4f9001/c8ra01206g-f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/1a9ec7acda92/c8ra01206g-f15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f63/9079056/3c555b25bfd7/c8ra01206g-f16.jpg

相似文献

1
A novel adaptive ensemble classification framework for ADME prediction.一种用于ADME预测的新型自适应集成分类框架。
RSC Adv. 2018 Mar 26;8(21):11661-11683. doi: 10.1039/c8ra01206g. eCollection 2018 Mar 21.
2
In Silico Assessment of ADME Properties: Advances in Caco-2 Cell Monolayer Permeability Modeling.基于计算的药物代谢动力学性质评估:Caco-2 细胞单层渗透模型的研究进展。
Curr Top Med Chem. 2018;18(26):2209-2229. doi: 10.2174/1568026619666181130140350.
3
Development of in Silico Models for Predicting P-Glycoprotein Inhibitors Based on a Two-Step Approach for Feature Selection and Its Application to Chinese Herbal Medicine Screening.基于两步特征选择法的P-糖蛋白抑制剂计算机模拟模型的开发及其在中药筛选中的应用
Mol Pharm. 2015 Oct 5;12(10):3691-713. doi: 10.1021/acs.molpharmaceut.5b00465. Epub 2015 Sep 23.
4
Advances in computationally modeling human oral bioavailability.人体口服生物利用度计算建模的进展
Adv Drug Deliv Rev. 2015 Jun 23;86:11-6. doi: 10.1016/j.addr.2015.01.001. Epub 2015 Jan 9.
5
Connarus favosus Planch.: An inhibitor of the hemorrhagic activity of Bothrops atrox venom and a potential antioxidant and antibacterial agent.多褶牛栓藤(Connarus favosus Planch.):一种巴西矛头蝮蛇毒出血活性的抑制剂以及一种潜在的抗氧化和抗菌剂。
J Ethnopharmacol. 2016 May 13;183:166-175. doi: 10.1016/j.jep.2016.02.043. Epub 2016 Mar 3.
6
BE-DTI': Ensemble framework for drug target interaction prediction using dimensionality reduction and active learning.BE-DTI': 基于降维和主动学习的药物靶点相互作用预测集成框架。
Comput Methods Programs Biomed. 2018 Oct;165:151-162. doi: 10.1016/j.cmpb.2018.08.011. Epub 2018 Aug 22.
7
Predicting human intestinal absorption of diverse chemicals using ensemble learning based QSAR modeling approaches.使用基于集成学习的定量构效关系(QSAR)建模方法预测多种化学物质在人体肠道中的吸收情况。
Comput Biol Chem. 2016 Apr;61:178-96. doi: 10.1016/j.compbiolchem.2016.01.005. Epub 2016 Jan 29.
8
Estimation of ADME properties with substructure pattern recognition.基于子结构模式识别估算 ADME 性质。
J Chem Inf Model. 2010 Jun 28;50(6):1034-41. doi: 10.1021/ci100104j.
9
EKNN: Ensemble classifier incorporating connectivity and density into kNN with application to cancer diagnosis.EKNN:将连通性和密度纳入k近邻算法的集成分类器及其在癌症诊断中的应用
Artif Intell Med. 2021 Jan;111:101985. doi: 10.1016/j.artmed.2020.101985. Epub 2020 Nov 8.
10
Genetic programming based ensemble system for microarray data classification.基于遗传编程的微阵列数据分类集成系统。
Comput Math Methods Med. 2015;2015:193406. doi: 10.1155/2015/193406. Epub 2015 Feb 25.

引用本文的文献

1
TCMNPAS: a comprehensive analysis platform integrating network formulaology and network pharmacology for exploring traditional Chinese medicine.中医网络方剂学与网络药理学综合分析平台:用于探索中医药的平台
Chin Med. 2024 Mar 22;19(1):50. doi: 10.1186/s13020-024-00924-y.
2
The Efficacy of Ganoderma lucidum Extracts on Treating Endometrial Cancer: A Network Pharmacology Approach.灵芝提取物治疗子宫内膜癌的疗效:网络药理学方法。
Reprod Sci. 2024 Jul;31(7):1881-1894. doi: 10.1007/s43032-024-01500-3. Epub 2024 Mar 6.
3
Caged Polyprenylated Xanthones in and the Biological Activities of Them.

本文引用的文献

1
DrugBank 5.0: a major update to the DrugBank database for 2018.DrugBank 5.0:2018 年 DrugBank 数据库的重大更新。
Nucleic Acids Res. 2018 Jan 4;46(D1):D1074-D1082. doi: 10.1093/nar/gkx1037.
2
The Use of Rule-Based and QSPR Approaches in ADME Profiling: A Case Study on Caco-2 Permeability.基于规则和定量构效关系方法在药物吸收、分布、代谢和排泄特性分析中的应用:以Caco-2细胞通透性为例
Mol Inform. 2013 Jun;32(5-6):459-79. doi: 10.1002/minf.201200166. Epub 2013 May 15.
3
In Silico Prediction of Caco-2 Cell Permeability by a Classification QSAR Approach.
笼状多聚异戊烯基黄酮及其生物活性。
Drug Des Devel Ther. 2023 Dec 5;17:3625-3660. doi: 10.2147/DDDT.S426685. eCollection 2023.
4
ADMET property prediction via multi-task graph learning under adaptive auxiliary task selection.基于自适应辅助任务选择的多任务图学习进行ADMET性质预测
iScience. 2023 Oct 24;26(11):108285. doi: 10.1016/j.isci.2023.108285. eCollection 2023 Nov 17.
5
Anti-Autophagy Mechanism of Zhi Gan Prescription Based on Network Pharmacology in Nonalcoholic Steatohepatitis Rats.基于网络药理学探讨枳甘方对非酒精性脂肪性肝炎大鼠的抗自噬机制
Front Pharmacol. 2021 Jul 19;12:708479. doi: 10.3389/fphar.2021.708479. eCollection 2021.
6
A Study on the Therapeutic Efficacy of San Zi Yang Qin Decoction for Non-Alcoholic Fatty Liver Disease and the Underlying Mechanism Based on Network Pharmacology.基于网络药理学的三子养亲汤治疗非酒精性脂肪性肝病的疗效及作用机制研究
Evid Based Complement Alternat Med. 2021 Jan 8;2021:8819245. doi: 10.1155/2021/8819245. eCollection 2021.
7
Artificial Intelligence in Drug Discovery: A Comprehensive Review of Data-driven and Machine Learning Approaches.药物研发中的人工智能:数据驱动与机器学习方法的全面综述
Biotechnol Bioprocess Eng. 2020;25(6):895-930. doi: 10.1007/s12257-020-0049-y. Epub 2021 Jan 7.
8
Current status and future directions of high-throughput ADME screening in drug discovery.药物研发中高通量药物吸收、分布、代谢和排泄(ADME)筛选的现状与未来方向
J Pharm Anal. 2020 Jun;10(3):201-208. doi: 10.1016/j.jpha.2020.05.004. Epub 2020 May 23.
9
A Network Pharmacology Approach to Uncover the Molecular Mechanisms of Herbal Formula Ban-Xia-Xie-Xin-Tang.一种基于网络药理学的方法揭示中药方剂半夏泻心汤的分子机制
Evid Based Complement Alternat Med. 2018 Oct 16;2018:4050714. doi: 10.1155/2018/4050714. eCollection 2018.
基于分类定量构效关系方法的 Caco-2 细胞渗透率的计算预测。
Mol Inform. 2011 Apr 18;30(4):376-85. doi: 10.1002/minf.201000118. Epub 2011 Mar 31.
4
ADMET Evaluation in Drug Discovery. 16. Predicting hERG Blockers by Combining Multiple Pharmacophores and Machine Learning Approaches.药物发现中的ADMET评估。16. 通过结合多种药效团和机器学习方法预测人醚-à-去极化相关基因(hERG)阻滞剂
Mol Pharm. 2016 Aug 1;13(8):2855-66. doi: 10.1021/acs.molpharmaceut.6b00471. Epub 2016 Jul 18.
5
Predicting human intestinal absorption of diverse chemicals using ensemble learning based QSAR modeling approaches.使用基于集成学习的定量构效关系(QSAR)建模方法预测多种化学物质在人体肠道中的吸收情况。
Comput Biol Chem. 2016 Apr;61:178-96. doi: 10.1016/j.compbiolchem.2016.01.005. Epub 2016 Jan 29.
6
Exploring different strategies for imbalanced ADME data problem: case study on Caco-2 permeability modeling.探索不平衡的药物代谢动力学数据问题的不同策略:以Caco-2渗透性建模为例
Mol Divers. 2016 Feb;20(1):93-109. doi: 10.1007/s11030-015-9649-4. Epub 2015 Dec 7.
7
Development of in Silico Models for Predicting P-Glycoprotein Inhibitors Based on a Two-Step Approach for Feature Selection and Its Application to Chinese Herbal Medicine Screening.基于两步特征选择法的P-糖蛋白抑制剂计算机模拟模型的开发及其在中药筛选中的应用
Mol Pharm. 2015 Oct 5;12(10):3691-713. doi: 10.1021/acs.molpharmaceut.5b00465. Epub 2015 Sep 23.
8
An Improved Ensemble Learning Method for Classifying High-Dimensional and Imbalanced Biomedicine Data.一种用于分类高维不平衡生物医学数据的改进集成学习方法。
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):657-66. doi: 10.1109/TCBB.2014.2306838.
9
Genetic programming based ensemble system for microarray data classification.基于遗传编程的微阵列数据分类集成系统。
Comput Math Methods Med. 2015;2015:193406. doi: 10.1155/2015/193406. Epub 2015 Feb 25.
10
Large unbalanced credit scoring using Lasso-logistic regression ensemble.使用套索逻辑回归集成的大规模不平衡信用评分
PLoS One. 2015 Feb 23;10(2):e0117844. doi: 10.1371/journal.pone.0117844. eCollection 2015.