• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

主动学习有效地识别出NCI-60细胞系中一组最小的、信息量最大且渐近性能良好的细胞毒性构效模式。

Active learning effectively identifies a minimal set of maximally informative and asymptotically performant cytotoxic structure-activity patterns in NCI-60 cell lines.

作者信息

Nakano Takumi, Takeda Shunichi, Brown J B

机构信息

Kyoto University Graduate School of Medicine , Department of Molecular Biosciences , Life Science Informatics Research Unit , Konoemachi Yoshida Sakyo , Kyoto 606-8501 , Japan . Email:

Kyoto University Graduate School of Medicine , Department of Radiation Genetics , Konoemachi Yoshida Sakyo , Kyoto 606-8501 , Japan.

出版信息

RSC Med Chem. 2020 Jul 20;11(9):1075-1087. doi: 10.1039/d0md00110d. eCollection 2020 Sep 1.

DOI:10.1039/d0md00110d
PMID:33479700
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7513593/
Abstract

The NCI-60 cancer cell line screening panel has provided insights for development of subtype-specific chemical therapies and repurposing. By extracting chemical structure and cytotoxicity patterns, virtual screening potentially complements the availability of high-throughput assay platforms and improves bioactive compound discovery rates by computational prefiltering of candidate compound libraries. Many groups report high prediction performances in computational models of NCI-60 data when using cross-validation or similar techniques, yet prospective therapy development in novel cancers may have little to no such data and further may not have the resources to perform hit identification using large compound libraries. In contrast to bulk screening and analysis, the active learning methodology has demonstrated how to identify compounds for screening in small batches and update computational models iteratively, leading to predictive models with a minimum number of compounds, and importantly clarifying data volumes at which limits in predictive ability are achieved. Here, in replicate per-cell line experiments using 50% of data (∼20 000 compounds) as the external prediction target, predictive limits are reproducibly demonstrated at the stage of systematic selection of 10-30% of the incorporable half. The pattern was consistent across all 60 cell lines. Limits of predictability are found to be correlated to the doubling times of cell lines and the number of cellular response discontinuities (activity cliffs) present per cell line. Organization into chemical scaffolds delineated degrees of predictive challenge. These results provide key insights for strategies in developing new inhibitors in existing cell lines or for future automated therapy selection in personalized oncotherapy.

摘要

NCI - 60癌细胞系筛选小组为亚型特异性化学疗法的开发和药物重新利用提供了见解。通过提取化学结构和细胞毒性模式,虚拟筛选有可能补充高通量检测平台的可用性,并通过对候选化合物库进行计算预筛选来提高生物活性化合物的发现率。许多研究小组报告称,在使用交叉验证或类似技术时,NCI - 60数据的计算模型具有很高的预测性能,然而在新型癌症的前瞻性治疗开发中,可能几乎没有此类数据,而且进一步可能没有资源使用大型化合物库来进行活性化合物鉴定。与批量筛选和分析不同,主动学习方法已经展示了如何小批量识别用于筛选的化合物,并迭代更新计算模型,从而得到使用最少数量化合物的预测模型,并且重要的是明确了达到预测能力极限时的数据量。在此,在每个细胞系实验的重复实验中,使用50%的数据(约20000种化合物)作为外部预测目标,在系统选择可纳入的一半的10 - 30%阶段可重复地证明了预测极限。该模式在所有60个细胞系中都是一致的。发现可预测性的极限与细胞系的倍增时间以及每个细胞系中存在的细胞反应不连续性(活性悬崖)的数量相关。按化学支架进行组织划分了预测挑战的程度。这些结果为在现有细胞系中开发新抑制剂的策略或未来个性化肿瘤治疗中的自动治疗选择提供了关键见解。

相似文献

1
Active learning effectively identifies a minimal set of maximally informative and asymptotically performant cytotoxic structure-activity patterns in NCI-60 cell lines.主动学习有效地识别出NCI-60细胞系中一组最小的、信息量最大且渐近性能良好的细胞毒性构效模式。
RSC Med Chem. 2020 Jul 20;11(9):1075-1087. doi: 10.1039/d0md00110d. eCollection 2020 Sep 1.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Synthesis and Evaluation of 2-Naphthaleno trans-Stilbenes and Cyanostilbenes as Anticancer Agents.2-萘基反式二苯乙烯和氰基二苯乙烯作为抗癌剂的合成与评价
Anticancer Agents Med Chem. 2018;18(4):556-564. doi: 10.2174/1871521409666170412115703.
4
DeepMalaria: Artificial Intelligence Driven Discovery of Potent Antiplasmodials.深度疟疾:人工智能驱动的强效抗疟药物发现
Front Pharmacol. 2020 Jan 15;10:1526. doi: 10.3389/fphar.2019.01526. eCollection 2019.
5
Data mining the NCI cancer cell line compound GI(50) values: identifying quinone subtypes effective against melanoma and leukemia cell classes.挖掘美国国立癌症研究所癌细胞系化合物的半数抑制浓度(GI(50))值:识别对黑色素瘤和白血病细胞类型有效的醌亚型。
J Chem Inf Comput Sci. 2003 Sep-Oct;43(5):1652-67. doi: 10.1021/ci034050+.
6
Deep learning prediction of chemical-induced dose-dependent and context-specific multiplex phenotype responses and its application to personalized alzheimer's disease drug repurposing.深度学习预测化学诱导的剂量依赖性和上下文特异性多重表型反应及其在个性化阿尔茨海默病药物再利用中的应用。
PLoS Comput Biol. 2022 Aug 11;18(8):e1010367. doi: 10.1371/journal.pcbi.1010367. eCollection 2022 Aug.
7
Combining computational methods for hit to lead optimization in Mycobacterium tuberculosis drug discovery.结合计算方法用于结核分枝杆菌药物发现中从活性分子到先导化合物的优化
Pharm Res. 2014 Feb;31(2):414-35. doi: 10.1007/s11095-013-1172-7. Epub 2013 Oct 17.
8
Automated Inference of Chemical Discriminants of Biological Activity.生物活性化学判别因子的自动推断
Methods Mol Biol. 2018;1762:307-338. doi: 10.1007/978-1-4939-7756-7_16.
9
Predictive models for estimating cytotoxicity on the basis of chemical structures.基于化学结构预测细胞毒性的模型。
Bioorg Med Chem. 2020 May 15;28(10):115422. doi: 10.1016/j.bmc.2020.115422. Epub 2020 Mar 12.
10
Profile-QSAR: a novel meta-QSAR method that combines activities across the kinase family to accurately predict affinity, selectivity, and cellular activity.谱定量构效关系(Profile-QSAR):一种新型的元定量构效关系方法,它结合了激酶家族的各项活性,可准确预测亲和力、选择性和细胞活性。
J Chem Inf Model. 2011 Aug 22;51(8):1942-56. doi: 10.1021/ci1005004. Epub 2011 Jul 19.

引用本文的文献

1
Advancing genetic engineering with active learning: theory, implementations and potential opportunities.通过主动学习推进基因工程:理论、实现与潜在机遇
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf286.
2
UMAP-based clustering split for rigorous evaluation of AI models for virtual screening on cancer cell lines.基于UMAP的聚类划分,用于对癌细胞系虚拟筛选的人工智能模型进行严格评估。
J Cheminform. 2025 Jun 10;17(1):94. doi: 10.1186/s13321-025-01039-8.

本文引用的文献

1
SciPy 1.0: fundamental algorithms for scientific computing in Python.SciPy 1.0:Python 中的科学计算基础算法。
Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.
2
A Machine Learning Approach for Drug-target Interaction Prediction using Wrapper Feature Selection and Class Balancing.基于包装特征选择和类别平衡的药物-靶标相互作用预测的机器学习方法。
Mol Inform. 2020 May;39(5):e1900062. doi: 10.1002/minf.201900062. Epub 2020 Feb 11.
3
Combination of host immune metabolic biomarkers for the PD-1 blockade cancer immunotherapy.宿主免疫代谢生物标志物联合用于 PD-1 阻断癌症免疫治疗。
JCI Insight. 2020 Jan 30;5(2):133501. doi: 10.1172/jci.insight.133501.
4
Gefitinib Alone Versus Gefitinib Plus Chemotherapy for Non-Small-Cell Lung Cancer With Mutated Epidermal Growth Factor Receptor: NEJ009 Study.吉非替尼单药对比吉非替尼联合化疗用于表皮生长因子受体突变的非小细胞肺癌:NEJ009 研究。
J Clin Oncol. 2020 Jan 10;38(2):115-123. doi: 10.1200/JCO.19.01488. Epub 2019 Nov 4.
5
Applicability Domain of Active Learning in Chemical Probe Identification: Convergence in Learning from Non-Specific Compounds and Decision Rule Clarification.主动学习在化学探针识别中的适用性领域:从非特异性化合物中学习的收敛性和决策规则的阐明。
Molecules. 2019 Jul 26;24(15):2716. doi: 10.3390/molecules24152716.
6
Age-related remodelling of oesophageal epithelia by mutated cancer drivers.由突变的癌症驱动因子引起的食管上皮的年龄相关性重塑。
Nature. 2019 Jan;565(7739):312-317. doi: 10.1038/s41586-018-0811-x. Epub 2019 Jan 2.
7
Predicting tumor cell line response to drug pairs with deep learning.深度学习预测药物组合对肿瘤细胞系的反应
BMC Bioinformatics. 2018 Dec 21;19(Suppl 18):486. doi: 10.1186/s12859-018-2509-3.
8
PubChem 2019 update: improved access to chemical data.PubChem 2019 年更新:改善化学数据获取。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1102-D1109. doi: 10.1093/nar/gky1033.
9
Selection of Informative Examples in Chemogenomic Datasets.化学基因组学数据集中信息性示例的选择
Methods Mol Biol. 2018;1825:369-410. doi: 10.1007/978-1-4939-8639-2_13.
10
Adaptive mining and model building of medicinal chemistry data with a multi-metric perspective.基于多指标视角的药物化学数据自适应挖掘与模型构建
Future Med Chem. 2018 Aug 1;10(16):1885-1887. doi: 10.4155/fmc-2018-0188. Epub 2018 Jul 3.