Suppr超能文献

基于蛋白质伪折叠HP晶格网络静电势的人乳腺癌生物标志物无比对预测的定量构效关系模型

QSAR model for alignment-free prediction of human breast cancer biomarkers based on electrostatic potentials of protein pseudofolding HP-lattice networks.

作者信息

Vilar Santiago, González-Díaz Humberto, Santana Lourdes, Uriarte Eugenio

机构信息

Unit of Bioinformatics and Connectivity Analysis, Institute of Industrial Pharmacy, and Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela 15782, Spain.

出版信息

J Comput Chem. 2008 Dec;29(16):2613-22. doi: 10.1002/jcc.21016.

Abstract

Network theory allows relationships to be established between numerical parameters that describe the molecular structure of genes and proteins and their biological properties. These models can be considered as quantitative structure-activity relationships (QSAR) for biopolymers. The work described here concerns the first QSAR model for 122 proteins that are associated with human breast cancer (HBC), as identified experimentally by Sjöblom et al. (Science 2006, 314, 268) from over 10,000 human proteins. In this study, the 122 proteins related to HBC (HBCp) and a control group of 200 proteins that are not related to HBC (non-HBCp) were forced to fold in an HP lattice network. From these networks a series of electrostatic potential parameters (xi(k)) was calculated to describe each protein numerically. The use of xi(k) as an entry point to linear discriminant analysis led to a QSAR model to discriminate between HBCp and non-HBCp, and this model could help to predict the involvement of a certain gene and/or protein in HBC. In addition, validation procedures were carried out on the model and these included an external prediction series and evaluation of an additional series of 1000 non-HBCp. In all cases good levels of classification were obtained with values above 80%. This study represents the first example of a QSAR model for the computational chemistry inspired search of potential HBC protein biomarkers.

摘要

网络理论允许在描述基因和蛋白质分子结构及其生物学特性的数值参数之间建立关系。这些模型可被视为生物聚合物的定量构效关系(QSAR)。本文所述工作涉及122种与人类乳腺癌(HBC)相关蛋白质的首个QSAR模型,这些蛋白质由Sjöblom等人(《科学》,2006年,314卷,268页)从一万多种人类蛋白质中通过实验鉴定出来。在本研究中,将122种与HBC相关的蛋白质(HBCp)和200种与HBC不相关的蛋白质组成的对照组(非HBCp)在HP晶格网络中进行强制折叠。从这些网络中计算出一系列静电势参数(xi(k))以对每种蛋白质进行数值描述。将xi(k)用作线性判别分析的切入点,得到了一个区分HBCp和非HBCp的QSAR模型,该模型有助于预测某个基因和/或蛋白质在HBC中的参与情况。此外,对该模型进行了验证程序,包括外部预测系列以及对另外1000种非HBCp的评估。在所有情况下,分类水平都很高,值超过80%。这项研究代表了用于计算化学启发式搜索潜在HBC蛋白质生物标志物的QSAR模型的首个实例。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验