• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从基因表达数据中学习药物反应的预测误差 - 标签、样本量和机器学习算法的影响。

Prediction errors in learning drug response from gene expression data - influence of labeling, sample size, and machine learning algorithm.

机构信息

Aachen Institute for Advanced Study in Computational Engineering Science (AICES), RWTH Aachen University, Aachen, Germany.

出版信息

PLoS One. 2013 Jul 23;8(7):e70294. doi: 10.1371/journal.pone.0070294. Print 2013.

DOI:10.1371/journal.pone.0070294
PMID:23894636
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3720898/
Abstract

Model-based prediction is dependent on many choices ranging from the sample collection and prediction endpoint to the choice of algorithm and its parameters. Here we studied the effects of such choices, exemplified by predicting sensitivity (as IC50) of cancer cell lines towards a variety of compounds. For this, we used three independent sample collections and applied several machine learning algorithms for predicting a variety of endpoints for drug response. We compared all possible models for combinations of sample collections, algorithm, drug, and labeling to an identically generated null model. The predictability of treatment effects varies among compounds, i.e. response could be predicted for some but not for all. The choice of sample collection plays a major role towards lowering the prediction error, as does sample size. However, we found that no algorithm was able to consistently outperform the other and there was no significant difference between regression and two- or three class predictors in this experimental setting. These results indicate that response-modeling projects should direct efforts mainly towards sample collection and data quality, rather than method adjustment.

摘要

基于模型的预测取决于许多选择,从样本采集和预测终点到算法及其参数的选择。在这里,我们研究了这些选择的影响,以预测癌细胞系对各种化合物的敏感性(作为 IC50)为例。为此,我们使用了三个独立的样本集,并应用了几种机器学习算法来预测药物反应的各种终点。我们将所有可能的模型组合(样本集、算法、药物和标签)与相同生成的空模型进行了比较。化合物之间的治疗效果可预测性存在差异,即有些化合物的反应可以预测,但并非所有化合物的反应都可以预测。样本集的选择对降低预测误差起着重要作用,样本量也是如此。然而,我们发现没有一种算法能够始终优于其他算法,并且在这种实验设置中,回归和两分类或三分类预测器之间没有显著差异。这些结果表明,响应建模项目应主要致力于样本采集和数据质量,而不是方法调整。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/0318041b249a/pone.0070294.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/770265b1316b/pone.0070294.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/01dc40594c87/pone.0070294.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/775c77ad2fc5/pone.0070294.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/837e8dfe32d4/pone.0070294.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/1c8447681c1b/pone.0070294.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/a04609aa773d/pone.0070294.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/0a1db55f722d/pone.0070294.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/0318041b249a/pone.0070294.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/770265b1316b/pone.0070294.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/01dc40594c87/pone.0070294.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/775c77ad2fc5/pone.0070294.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/837e8dfe32d4/pone.0070294.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/1c8447681c1b/pone.0070294.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/a04609aa773d/pone.0070294.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/0a1db55f722d/pone.0070294.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d71/3720898/0318041b249a/pone.0070294.g008.jpg

相似文献

1
Prediction errors in learning drug response from gene expression data - influence of labeling, sample size, and machine learning algorithm.从基因表达数据中学习药物反应的预测误差 - 标签、样本量和机器学习算法的影响。
PLoS One. 2013 Jul 23;8(7):e70294. doi: 10.1371/journal.pone.0070294. Print 2013.
2
Time-dependent AI-Modeling of the anticancer efficacy of synthesized gallic acid analogues.时间依赖性人工智能模拟合成没食子酸类似物的抗癌功效。
Comput Biol Chem. 2019 Apr;79:137-146. doi: 10.1016/j.compbiolchem.2019.02.004. Epub 2019 Feb 16.
3
Prediction of anti-cancer drug response by kernelized multi-task learning.通过核多任务学习预测抗癌药物反应
Artif Intell Med. 2016 Oct;73:70-77. doi: 10.1016/j.artmed.2016.09.004. Epub 2016 Oct 3.
4
Systematic assessment of analytical methods for drug sensitivity prediction from cancer cell line data.基于癌细胞系数据预测药物敏感性的分析方法的系统评估。
Pac Symp Biocomput. 2014:63-74.
5
Genomic approach towards personalized anticancer drug therapy.基因组学方法在个体化抗癌药物治疗中的应用。
Pharmacogenomics. 2012 Jan;13(2):191-9. doi: 10.2217/pgs.11.157.
6
Confident predictability: identifying reliable gene expression patterns for individualized tumor classification using a local minimax kernel algorithm.有信心的可预测性:使用局部极小极大核算法为个体化肿瘤分类确定可靠的基因表达模式。
BMC Med Genomics. 2011 Jan 24;4:10. doi: 10.1186/1755-8794-4-10.
7
The effect of machine learning regression algorithms and sample size on individualized behavioral prediction with functional connectivity features.机器学习回归算法和样本量对功能连接特征的个体化行为预测的影响。
Neuroimage. 2018 Sep;178:622-637. doi: 10.1016/j.neuroimage.2018.06.001. Epub 2018 Jun 2.
8
Open source machine-learning algorithms for the prediction of optimal cancer drug therapies.用于预测最佳癌症药物疗法的开源机器学习算法。
PLoS One. 2017 Oct 26;12(10):e0186906. doi: 10.1371/journal.pone.0186906. eCollection 2017.
9
Learning from examples in the small sample case: face expression recognition.从小样本案例中的示例学习:面部表情识别。
IEEE Trans Syst Man Cybern B Cybern. 2005 Jun;35(3):477-88. doi: 10.1109/tsmcb.2005.846658.
10
Training data selection method for prediction of anticancer drug effects using a genetic algorithm with local search.
Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:124-8. doi: 10.1109/IEMBS.2011.6089868.

引用本文的文献

1
Machine Learning for Pharmacogenomics and Personalized Medicine: A Ranking Model for Drug Sensitivity Prediction.机器学习在药物基因组学和个性化医学中的应用:药物敏感性预测的排序模型。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2324-2333. doi: 10.1109/TCBB.2021.3084562. Epub 2022 Aug 8.
2
Identification of Tamoxifen-Resistant Breast Cancer Cell Lines and Drug Response Signature.他莫昔芬耐药乳腺癌细胞系的鉴定及药物反应特征
Front Mol Biosci. 2020 Dec 4;7:564005. doi: 10.3389/fmolb.2020.564005. eCollection 2020.
3
Comprehensive anticancer drug response prediction based on a simple cell line-drug complex network model.

本文引用的文献

1
CellMiner: a web-based suite of genomic and pharmacologic tools to explore transcript and drug patterns in the NCI-60 cell line set.CellMiner:一套基于网络的基因组学和药理学工具套件,用于探索 NCI-60 细胞系集中的转录组和药物模式。
Cancer Res. 2012 Jul 15;72(14):3499-511. doi: 10.1158/0008-5472.CAN-12-1370.
2
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2012 Jan;40(Database issue):D13-25. doi: 10.1093/nar/gkr1184. Epub 2011 Dec 2.
3
Quantifying stability in gene list ranking across microarray derived clinical biomarkers.
基于简单的细胞系-药物复合物网络模型的全面抗癌药物反应预测。
BMC Bioinformatics. 2019 Jan 22;20(1):44. doi: 10.1186/s12859-019-2608-9.
4
CLC-Pred: A freely available web-service for in silico prediction of human cell line cytotoxicity for drug-like compounds.CLC-Pred:一种可免费获取的网络服务,用于对类药物化合物的人细胞系细胞毒性进行计算机模拟预测。
PLoS One. 2018 Jan 25;13(1):e0191838. doi: 10.1371/journal.pone.0191838. eCollection 2018.
5
Computational models for predicting drug responses in cancer research.癌症研究中预测药物反应的计算模型。
Brief Bioinform. 2017 Sep 1;18(5):820-829. doi: 10.1093/bib/bbw065.
6
Asporin Is a Fibroblast-Derived TGF-β1 Inhibitor and a Tumor Suppressor Associated with Good Prognosis in Breast Cancer.阿朴脂蛋白是一种成纤维细胞衍生的转化生长因子-β1抑制剂,也是一种与乳腺癌预后良好相关的肿瘤抑制因子。
PLoS Med. 2015 Sep 1;12(9):e1001871. doi: 10.1371/journal.pmed.1001871. eCollection 2015 Sep.
7
Regulation of epithelial-mesenchymal transition in breast cancer cells by cell contact and adhesion.细胞接触和黏附对乳腺癌细胞上皮-间质转化的调控
Cancer Inform. 2015 Feb 9;14(Suppl 3):1-13. doi: 10.4137/CIN.S18965. eCollection 2015.
量化基因列表在基于微阵列的临床生物标志物中的排序稳定性。
BMC Med Genomics. 2011 Oct 14;4:73. doi: 10.1186/1755-8794-4-73.
4
Predicting in vitro drug sensitivity using Random Forests.利用随机森林预测体外药物敏感性。
Bioinformatics. 2011 Jan 15;27(2):220-4. doi: 10.1093/bioinformatics/btq628. Epub 2010 Dec 5.
5
Regularization Paths for Generalized Linear Models via Coordinate Descent.基于坐标下降法的广义线性模型正则化路径
J Stat Softw. 2010;33(1):1-22.
6
The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models.《基因芯片质量控制(MAQC)-II 研究:基于基因芯片的预测模型的开发和验证的常见实践》。
Nat Biotechnol. 2010 Aug;28(8):827-38. doi: 10.1038/nbt.1665. Epub 2010 Jul 30.
7
ArrayExpress update--from an archive of functional genomics experiments to the atlas of gene expression.ArrayExpress更新——从功能基因组学实验存档到基因表达图谱
Nucleic Acids Res. 2009 Jan;37(Database issue):D868-72. doi: 10.1093/nar/gkn889. Epub 2008 Nov 10.
8
A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification.基于微阵列的癌症分类中随机森林与支持向量机的全面比较
BMC Bioinformatics. 2008 Jul 22;9:319. doi: 10.1186/1471-2105-9-319.
9
Gene selection and classification of microarray data using random forest.使用随机森林进行微阵列数据的基因选择与分类
BMC Bioinformatics. 2006 Jan 6;7:3. doi: 10.1186/1471-2105-7-3.
10
Classification of gene microarrays by penalized logistic regression.基于惩罚逻辑回归的基因微阵列分类
Biostatistics. 2004 Jul;5(3):427-43. doi: 10.1093/biostatistics/5.3.427.