Suppr超能文献

用于预测人类结直肠癌遗传成分生物标志物的网络定量构效关系模型。

A network-QSAR model for prediction of genetic-component biomarkers in human colorectal cancer.

机构信息

Department of Organic Chemistry, Faculty of Pharmacy, and Unit of Bioinformatics and Connectivity Analysis of Systems (UBICAS), Institute of Industrial Pharmacy, University of Santiago de Compostela, Santiago de Compostela 15782, Spain.

出版信息

J Theor Biol. 2009 Dec 7;261(3):449-58. doi: 10.1016/j.jtbi.2009.07.031. Epub 2009 Aug 3.

Abstract

The combination of the network theory and the calculation of topological indices (TIs) allow establishing relationships between the molecular structure of large molecules like the genes and proteins and their properties at a biological level. This type of models can be considered quantitative structure-activity relationships (QSAR) for biopolymers. In the present work a QSAR model is reported for proteins, related to human colorectal cancer (HCC) and codified by different genes that have been identified experimentally by Sjöblom et al. [2006. The consensus coding sequences of human breast and colorectal cancers. Science 314, 268-274] among more than 10000 human genes. The 69 proteins related to human colorectal cancer (HCCp) and a control group of 200 proteins not related to HCC (no-HCCp) were represented through an HP Lattice type Network. Starting from the generated graphs we calculate a set of descriptors of electrostatic potential type (xi(k)) that allow to establish, through a linear discriminant analysis (LDA), a QSAR model of relatively high percentage of good classification (higher than 80%) to differentiate between HCCp and no-HCCp proteins. The purpose of this study is helping to predict the possible implication of a certain gene and/or protein (biomarker) in the colorectal cancer. Different procedures of validation of the obtained model have been carried out in order to corroborate its stability, including cross-validation series (CV) and evaluation of an additional series of 200 no-HCCp. This biostatistic methodology could be applied to predict human colorectal cancer biomarkers and to understand much better the biological aspects of this disease.

摘要

网络理论与拓扑指数(TIs)计算的结合,使得建立大分子(如基因和蛋白质)的分子结构与其在生物水平上的性质之间的关系成为可能。这种类型的模型可以被认为是生物聚合物的定量构效关系(QSAR)。在本工作中,报道了一种与人类结直肠癌(HCC)相关的蛋白质的 QSAR 模型,该模型由 Sjöblom 等人[2006. 人类乳腺癌和结直肠癌的共识编码序列。Science 314, 268-274]通过实验确定的不同基因编码。在超过 10000 个人类基因中。与人类结直肠癌(HCCp)相关的 69 种蛋白质和 200 种与 HCC 无关的对照蛋白质(非 HCCp)通过 HP 格型网络表示。从生成的图形开始,我们计算了一组静电势类型的描述符(xi(k)),通过线性判别分析(LDA),可以建立一个相对较高的分类百分比(高于 80%)的 QSAR 模型,以区分 HCCp 和非 HCCp 蛋白质。本研究的目的是帮助预测某一基因和/或蛋白质(生物标志物)在结直肠癌中的可能作用。为了验证模型的稳定性,进行了不同的验证程序,包括交叉验证系列(CV)和对另外 200 个非 HCCp 的评估。这种生物统计学方法可用于预测人类结直肠癌的生物标志物,并更好地理解该疾病的生物学方面。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验