• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

化学亚细胞定位的预测 多分类方法

prediction of chemical subcellular localization multi-classification methods.

作者信息

Yang Hongbin, Li Xiao, Cai Yingchun, Wang Qin, Li Weihua, Liu Guixia, Tang Yun

机构信息

Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . Email:

出版信息

Medchemcomm. 2017 Mar 29;8(6):1225-1234. doi: 10.1039/c7md00074j. eCollection 2017 Jun 1.

DOI:10.1039/c7md00074j
PMID:30108833
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6072212/
Abstract

Chemical subcellular localization is closely related to drug distribution in the body and hence important in drug discovery and design. Although many and methods have been developed, methods play key roles in the prediction of chemical subcellular localization due to their low costs and high performance. For that purpose, machine learning-based methods were developed here. At first, 614 unique compounds localized in the lysosome, mitochondria, nucleus and plasma membrane were collected from the literature. 80% of the compounds were used to build the models and the rest as the external validation set. Both fingerprints and molecular descriptors were used to describe the molecules, and six machine learning methods were applied to build the multi-classification models. The performance of the models was measured by 5-fold cross-validation and external validation. We further detected key substructures for each localization and analyzed potential structure-localization relationships, which could be very helpful for molecular design and modification. The key substructures can also be used as features complementary to fingerprints to improve the performance of the models.

摘要

化学亚细胞定位与药物在体内的分布密切相关,因此在药物发现和设计中很重要。尽管已经开发了许多方法,但由于成本低和性能高,机器学习方法在化学亚细胞定位预测中起着关键作用。为此,本文开发了基于机器学习的方法。首先,从文献中收集了614种定位于溶酶体、线粒体、细胞核和质膜的独特化合物。80%的化合物用于构建模型,其余作为外部验证集。指纹和分子描述符都用于描述分子,并应用六种机器学习方法构建多分类模型。通过五折交叉验证和外部验证来衡量模型的性能。我们进一步检测了每种定位的关键子结构,并分析了潜在的结构-定位关系,这对分子设计和修饰非常有帮助。关键子结构也可以用作指纹的补充特征,以提高模型的性能。

相似文献

1
prediction of chemical subcellular localization multi-classification methods.化学亚细胞定位的预测 多分类方法
Medchemcomm. 2017 Mar 29;8(6):1225-1234. doi: 10.1039/c7md00074j. eCollection 2017 Jun 1.
2
prediction of hERG potassium channel blockage by chemical category approaches.通过化学类别方法预测人乙醚 - 去极化激活钾离子通道(hERG)阻滞
Toxicol Res (Camb). 2016 Jan 14;5(2):570-582. doi: 10.1039/c5tx00294j. eCollection 2016 Mar 1.
3
A comprehensive support vector machine binary hERG classification model based on extensive but biased end point hERG data sets.基于广泛但存在偏倚的终点 hERG 数据集的全面支持向量机二进制 hERG 分类模型。
Chem Res Toxicol. 2011 Jun 20;24(6):934-49. doi: 10.1021/tx200099j. Epub 2011 May 6.
4
Direct Comparison of Total Clearance Prediction: Computational Machine Learning Model versus Bottom-Up Approach Using In Vitro Assay.直接比较总清除率预测:基于计算机器学习模型与基于体外测定的自下而上方法。
Mol Pharm. 2020 Jul 6;17(7):2299-2309. doi: 10.1021/acs.molpharmaceut.9b01294. Epub 2020 Jun 12.
5
Prediction on the mutagenicity of nitroaromatic compounds using quantum chemistry descriptors based QSAR and machine learning derived classification methods.基于量子化学描述符的定量构效关系和机器学习衍生分类方法预测硝基芳香族化合物的突变性。
Ecotoxicol Environ Saf. 2019 Dec 30;186:109822. doi: 10.1016/j.ecoenv.2019.109822. Epub 2019 Oct 18.
6
prediction of chemical genotoxicity using machine learning methods and structural alerts.使用机器学习方法和结构警示预测化学物质的遗传毒性
Toxicol Res (Camb). 2017 Dec 15;7(2):211-220. doi: 10.1039/c7tx00259a. eCollection 2018 Mar 1.
7
Drug-likeness analysis of traditional Chinese medicines: prediction of drug-likeness using machine learning approaches.中药类药性分析:基于机器学习方法的类药性预测。
Mol Pharm. 2012 Oct 1;9(10):2875-86. doi: 10.1021/mp300198d. Epub 2012 Sep 20.
8
In Silico Prediction of O⁶-Methylguanine-DNA Methyltransferase Inhibitory Potency of Base Analogs with QSAR and Machine Learning Methods.基于 QSAR 和机器学习方法的碱基类似物对 O⁶-甲基鸟嘌呤-DNA 甲基转移酶抑制活性的计算预测。
Molecules. 2018 Nov 6;23(11):2892. doi: 10.3390/molecules23112892.
9
Machine Learning Models Identify New Inhibitors for Human OATP1B1.机器学习模型鉴定人源有机阴离子转运多肽 1B1 的新型抑制剂
Mol Pharm. 2022 Nov 7;19(11):4320-4332. doi: 10.1021/acs.molpharmaceut.2c00662. Epub 2022 Oct 21.
10
In Silico Prediction of Metabolic Epoxidation for Drug-like Molecules via Machine Learning Methods.基于机器学习方法的药物样分子代谢环氧化的计算预测。
Mol Inform. 2020 Aug;39(8):e1900178. doi: 10.1002/minf.201900178. Epub 2020 Mar 31.

引用本文的文献

1
Piperidine and valproic acid hybrid compound (F2S4--VPA) outperforms methotrexate as anti-proliferative and cells migration inhibition.哌啶与丙戊酸杂化化合物(F2S4--VPA)在抗增殖和抑制细胞迁移方面比甲氨蝶呤表现更优。
RSC Adv. 2025 Jul 16;15(31):25291-25309. doi: 10.1039/d5ra01365h. eCollection 2025 Jul 15.
2
Chimeric Drug Design with a Noncharged Carrier for Mitochondrial Delivery.用于线粒体递送的带非电荷载体的嵌合药物设计。
Pharmaceutics. 2021 Feb 12;13(2):254. doi: 10.3390/pharmaceutics13020254.
3
Prediction of Chemical Toxicity for Drug Design Using Machine Learning Methods and Structural Alerts.使用机器学习方法和结构警示进行药物设计的化学毒性预测
Front Chem. 2018 Feb 20;6:30. doi: 10.3389/fchem.2018.00030. eCollection 2018.

本文引用的文献

1
Predicting antimicrobial peptides with improved accuracy by incorporating the compositional, physico-chemical and structural features into Chou's general PseAAC.通过将组成、物理化学和结构特征纳入到周元的通用 PseAAC 中,提高了抗菌肽预测的准确性。
Sci Rep. 2017 Feb 13;7:42362. doi: 10.1038/srep42362.
2
iATC-mISF: a multi-label classifier for predicting the classes of anatomical therapeutic chemicals.iATC-mISF:一种用于预测解剖治疗化学物质类别的多标签分类器。
Bioinformatics. 2017 Feb 1;33(3):341-346. doi: 10.1093/bioinformatics/btw644.
3
Pse-Analysis: a python package for DNA/RNA and protein/ peptide sequence analysis based on pseudo components and kernel methods.伪分析:一个基于伪组件和核方法用于DNA/RNA以及蛋白质/肽序列分析的Python软件包。
Oncotarget. 2017 Feb 21;8(8):13338-13343. doi: 10.18632/oncotarget.14524.
4
Unb-DPC: Identify mycobacterial membrane protein types by incorporating un-biased dipeptide composition into Chou's general PseAAC.Unb-DPC:通过将无偏差二肽组成纳入周的通用伪氨基酸组成来鉴定分枝杆菌膜蛋白类型。
J Theor Biol. 2017 Feb 21;415:13-19. doi: 10.1016/j.jtbi.2016.12.004. Epub 2016 Dec 8.
5
iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences.iRNA-AI:识别RNA序列中腺苷到肌苷的编辑位点。
Oncotarget. 2017 Jan 17;8(3):4208-4217. doi: 10.18632/oncotarget.13758.
6
iOri-Human: identify human origin of replication by incorporating dinucleotide physicochemical properties into pseudo nucleotide composition.iOri-Human:通过将二核苷酸物理化学性质纳入伪核苷酸组成来识别人类复制起点。
Oncotarget. 2016 Oct 25;7(43):69783-69793. doi: 10.18632/oncotarget.11975.
7
iRSpot-EL: identify recombination spots with an ensemble learning approach.iRSpot-EL:基于集成学习方法识别重组热点。
Bioinformatics. 2017 Jan 1;33(1):35-41. doi: 10.1093/bioinformatics/btw539. Epub 2016 Aug 16.
8
In Silico Estimation of Chemical Carcinogenicity with Binary and Ternary Classification Methods.采用二元和三元分类方法对化学致癌性进行计算机模拟评估。
Mol Inform. 2015 Apr;34(4):228-35. doi: 10.1002/minf.201400127. Epub 2015 Mar 27.
9
iPTM-mLys: identifying multiple lysine PTM sites and their different types.iPTM-mLys:鉴定多个赖氨酸 PTM 位点及其不同类型。
Bioinformatics. 2016 Oct 15;32(20):3116-3123. doi: 10.1093/bioinformatics/btw380. Epub 2016 Jun 22.
10
iPhos-PseEn: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier.iPhos-PseEn:通过将不同的伪组分融合到集成分类器中来识别蛋白质中的磷酸化位点。
Oncotarget. 2016 Aug 9;7(32):51270-51283. doi: 10.18632/oncotarget.9987.