在突变序列空间中选择下一个研究方向：信息丰富的p53癌症拯救突变体的主动学习

Choosing where to look next in a mutation sequence space: Active Learning of informative p53 cancer rescue mutants.

作者信息

Danziger Samuel A, Zeng Jue, Wang Ying, Brachmann Rainer K, Lathrop Richard H

机构信息

Department of Biomedical Engineering, University of California, Irvine, California 92697, USA.

出版信息

Bioinformatics. 2007 Jul 1;23(13):i104-14. doi: 10.1093/bioinformatics/btm166.

DOI:10.1093/bioinformatics/btm166

PMID:17646286

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2811495/

Abstract

Many biomedical projects would benefit from reducing the time and expense of in vitro experimentation by using computer models for in silico predictions. These models may help determine which expensive biological data are most useful to acquire next. Active Learning techniques for choosing the most informative data enable biologists and computer scientists to optimize experimental data choices for rapid discovery of biological function. To explore design choices that affect this desirable behavior, five novel and five existing Active Learning techniques, together with three control methods, were tested on 57 previously unknown p53 cancer rescue mutants for their ability to build classifiers that predict protein function. The best of these techniques, Maximum Curiosity, improved the baseline accuracy of 56-77%. This article shows that Active Learning is a useful tool for biomedical research, and provides a case study of interest to others facing similar discovery challenges.

摘要

许多生物医学项目将受益于通过使用计算机模型进行计算机模拟预测来减少体外实验的时间和费用。这些模型可能有助于确定接下来获取哪些昂贵的生物学数据最为有用。用于选择最具信息性数据的主动学习技术使生物学家和计算机科学家能够优化实验数据选择，以快速发现生物学功能。为了探索影响这种理想行为的设计选择，对5种新颖的和5种现有的主动学习技术以及3种控制方法进行了测试，以考察它们针对57个先前未知的p53癌症拯救突变体构建预测蛋白质功能分类器的能力。其中最好的技术——最大好奇心，将基线准确率提高了56%-77%。本文表明主动学习是生物医学研究的一种有用工具，并为面临类似发现挑战的其他人提供了一个有趣的案例研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e9d/2811495/750e4d8b267d/nihms169168f1.jpg

相似文献

Choosing where to look next in a mutation sequence space: Active Learning of informative p53 cancer rescue mutants.在突变序列空间中选择下一个研究方向：信息丰富的p53癌症拯救突变体的主动学习

Bioinformatics. 2007 Jul 1;23(13):i104-14. doi: 10.1093/bioinformatics/btm166.

Predicting positive p53 cancer rescue regions using Most Informative Positive (MIP) active learning.使用最具信息性阳性（MIP）主动学习预测p53癌症救援阳性区域。

PLoS Comput Biol. 2009 Sep;5(9):e1000498. doi: 10.1371/journal.pcbi.1000498. Epub 2008 Sep 4.

Functional census of mutation sequence spaces: the example of p53 cancer rescue mutants.突变序列空间的功能普查：以p53癌症拯救突变体为例。

IEEE/ACM Trans Comput Biol Bioinform. 2006 Apr-Jun;3(2):114-25. doi: 10.1109/TCBB.2006.22.

Prediction of cancer rescue p53 mutants in silico using Naïve Bayes learning methodology.使用朴素贝叶斯学习方法在计算机上预测癌症拯救性p53突变体。

Protein Pept Lett. 2013 Nov;20(11):1280-91. doi: 10.2174/09298665113209990046.

Identification and analysis of driver missense mutations using rotation forest with feature selection.使用带特征选择的旋转森林算法识别和分析驱动型错义突变

Biomed Res Int. 2014;2014:905951. doi: 10.1155/2014/905951. Epub 2014 Aug 27.

A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis.用于微阵列基因表达癌症诊断的多类别分类方法的综合评估。

Bioinformatics. 2005 Mar 1;21(5):631-43. doi: 10.1093/bioinformatics/bti033. Epub 2004 Sep 16.

Comprehensive detection of germline variants by MSK-IMPACT, a clinical diagnostic platform for solid tumor molecular oncology and concurrent cancer predisposition testing.通过MSK-IMPACT对种系变异进行全面检测，MSK-IMPACT是一种用于实体瘤分子肿瘤学和并发癌症易感性检测的临床诊断平台。

BMC Med Genomics. 2017 May 19;10(1):33. doi: 10.1186/s12920-017-0271-4.

Role of artificial intelligence in cancer detection using protein p53: A Review.人工智能在利用蛋白质p53进行癌症检测中的作用：综述

Mol Biol Rep. 2024 Dec 11;52(1):46. doi: 10.1007/s11033-024-10051-4.

p53--master and commander.p53——掌控者

N Engl J Med. 2007 Dec 20;357(25):2539-41. doi: 10.1056/NEJMp0707422.

Effects of TP53 mutational status on gene expression patterns across 10 human cancer types.TP53 基因突变状态对 10 种人类癌症类型中基因表达模式的影响。

J Pathol. 2014 Apr;232(5):522-33. doi: 10.1002/path.4321. Epub 2014 Jan 29.

引用本文的文献

Roles of computational modelling in understanding p53 structure, biology, and its therapeutic targeting.计算建模在理解 p53 结构、生物学及其治疗靶点中的作用。

J Mol Cell Biol. 2019 Apr 1;11(4):306-316. doi: 10.1093/jmcb/mjz009.

An indicator cell assay for blood-based diagnostics.一种用于基于血液诊断的指示细胞检测法。

PLoS One. 2017 Jun 8;12(6):e0178608. doi: 10.1371/journal.pone.0178608. eCollection 2017.

Deciding when to stop: efficient experimentation to learn to predict drug-target interactions.决定何时停止：通过高效实验学习预测药物-靶点相互作用

BMC Bioinformatics. 2015 Jul 9;16:213. doi: 10.1186/s12859-015-0650-9.

Structures of oncogenic, suppressor and rescued p53 core-domain variants: mechanisms of mutant p53 rescue.致癌性、抑癌性及挽救型p53核心结构域变体的结构：突变型p53挽救机制

Acta Crystallogr D Biol Crystallogr. 2013 Oct;69(Pt 10):2146-56. doi: 10.1107/S0907444913020830. Epub 2013 Sep 20.

Prediction of P53 mutants (multiple sites) transcriptional activity based on structural (2D&3D) properties.基于结构（2D 和 3D）特性预测 P53 突变体（多个位点）的转录活性。

PLoS One. 2013;8(2):e55401. doi: 10.1371/journal.pone.0055401. Epub 2013 Feb 13.

Computational identification of a transiently open L1/S3 pocket for reactivation of mutant p53.计算鉴定突变型 p53 重新激活的 L1/S3 口袋瞬态开放。

Nat Commun. 2013;4:1407. doi: 10.1038/ncomms2361.

Predicting transcriptional activity of multiple site p53 mutants based on hybrid properties.基于混合特性预测多个 p53 突变位点的转录活性。

PLoS One. 2011;6(8):e22940. doi: 10.1371/journal.pone.0022940. Epub 2011 Aug 8.

An active role for machine learning in drug development.机器学习在药物研发中的积极作用。

Nat Chem Biol. 2011 Jun;7(6):327-30. doi: 10.1038/nchembio.576.

An economic framework to prioritize confirmatory tests after a high-throughput screen.一种用于在高通量筛选后对确证性试验进行优先级排序的经济框架。

J Biomol Screen. 2010 Jul;15(6):680-6. doi: 10.1177/1087057110372803. Epub 2010 Jun 14.

Predicting positive p53 cancer rescue regions using Most Informative Positive (MIP) active learning.使用最具信息性阳性（MIP）主动学习预测p53癌症救援阳性区域。

PLoS Comput Biol. 2009 Sep;5(9):e1000498. doi: 10.1371/journal.pcbi.1000498. Epub 2008 Sep 4.

本文引用的文献

Functional census of mutation sequence spaces: the example of p53 cancer rescue mutants.突变序列空间的功能普查：以p53癌症拯救突变体为例。

IEEE/ACM Trans Comput Biol Bioinform. 2006 Apr-Jun;3(2):114-25. doi: 10.1109/TCBB.2006.22.

A global map of p53 transcription-factor binding sites in the human genome.人类基因组中p53转录因子结合位点的全球图谱。

Cell. 2006 Jan 13;124(1):207-19. doi: 10.1016/j.cell.2005.10.043.

The UMD TP53 database and website: update and revisions.UMD TP53数据库及网站：更新与修订

Hum Mutat. 2006 Jan;27(1):14-20. doi: 10.1002/humu.20269.

A unified framework for image retrieval using keyword and visual features.一种使用关键词和视觉特征进行图像检索的统一框架。

IEEE Trans Image Process. 2005 Jul;14(7):979-89. doi: 10.1109/tip.2005.847289.

Stress-induced p53 runs a transcription-independent death program.应激诱导的p53执行一个不依赖转录的死亡程序。

Biochem Biophys Res Commun. 2005 Jun 10;331(3):843-50. doi: 10.1016/j.bbrc.2005.03.187.

Global cancer statistics, 2002.2002年全球癌症统计数据。

CA Cancer J Clin. 2005 Mar-Apr;55(2):74-108. doi: 10.3322/canjclin.55.2.74.

Onvergence and application of online active sampling using orthogonal pillar vectors.使用正交柱向量的在线主动采样的收敛性与应用

IEEE Trans Pattern Anal Mach Intell. 2004 Sep;26(9):1197-207. doi: 10.1109/TPAMI.2004.61.

Binding of Rad51 and other peptide sequences to a promiscuous, highly electrostatic binding site in p53.Rad51及其他肽序列与p53中一个混杂的、高度静电结合位点的结合。

J Biol Chem. 2005 Mar 4;280(9):8051-9. doi: 10.1074/jbc.M411176200. Epub 2004 Dec 20.

The tumor suppressor gene TP53: implications for cancer management and therapy.肿瘤抑制基因TP53：对癌症管理和治疗的启示

Crit Rev Clin Lab Sci. 2004;41(5-6):551-83. doi: 10.1080/10408360490504952.

Active learning with support vector machine applied to gene expression data for cancer classification.应用支持向量机的主动学习在癌症分类基因表达数据中的应用。

J Chem Inf Comput Sci. 2004 Nov-Dec;44(6):1936-41. doi: 10.1021/ci049810a.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验