Cruz-Monteagudo Maykel, Munteanu Cristian Robert, Borges Fernanda, Cordeiro M Natália D S, Uriarte Eugenio, González-Díaz Humberto
Unit of Bioinformatics & Connectivity Analysis, Institute of Industrial Pharmacy, Faculty of Pharmacy, Department of Organic Chemistry, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain.
Bioorg Med Chem. 2008 Nov 15;16(22):9684-93. doi: 10.1016/j.bmc.2008.10.004. Epub 2008 Oct 5.
Numerical parameters of the molecular networks, also referred as Topological Indices or Connectivity Indices (CIs), have been used in Bioorganic and Medicinal Chemistry to find Quantitative Structure-Activity, Property or Toxicity Relationship (QSAR, QSPR and QSTR) models. QSPR models generally use CIs as inputs to predict the biological activity of compounds. However, the literature does not evidence a great effort to find QSAR-like models for other biologically and chemically relevant systems. For instance, blood proteome constitutes a protein-rich information reservoir, since the serum proteome Mass Spectra (MS) represents a potential information source for the early detection of Biomarkers for diseases and/or drug-induced toxicities. The concept of mass spectrum network (MS network) for a single protein is already well-known. However, there are no reported results on the use of CIs for a MS network of a whole proteome to explore MS patterns. In this work, we introduced for the first time a novel network representation and the CIs for the MS of blood proteome samples. The new network bases on Randic's Spiral network have been previously introduced for protein sequences. The new MS CIs, called here Spiral Markov Connectivity (SMC(k)) of the MS Spiral graph can be calculated with the software MARCH-INSIDE, combining network and Markov model theory. The SMC(k) values could be used to seek QSAR-like models, called in this work Quantitative Proteome-Property Relationships (QPPRs). We calculate the SMC(k) values for 62 blood samples and fit a QPPR model by discriminating proteome MS, typical of individuals susceptible to suffer drug-induced cardiotoxicity from control samples. The accuracy, sensitivity, and specificity values of the QPPR model were between 73.08% and 87.5% in training and validation series. This work points to QPPR models as a powerful tool for MS detection of biomarkers in proteomics.
分子网络的数值参数,也被称为拓扑指数或连接性指数(CIs),已被用于生物有机化学和药物化学中,以寻找定量构效关系、性质或毒性关系(QSAR、QSPR和QSTR)模型。QSPR模型通常使用CIs作为输入来预测化合物的生物活性。然而,文献中并没有显示出为其他生物学和化学相关系统寻找类似QSAR模型的巨大努力。例如,血液蛋白质组构成了一个富含蛋白质的信息库,因为血清蛋白质组质谱(MS)代表了疾病和/或药物诱导毒性生物标志物早期检测的潜在信息来源。单个蛋白质的质谱网络(MS网络)概念已经广为人知。然而,目前尚无关于将CIs用于整个蛋白质组的MS网络以探索MS模式的报道结果。在这项工作中,我们首次引入了一种新颖的网络表示方法以及血液蛋白质组样本MS的CIs。基于Randić螺旋网络的新网络先前已被引入用于蛋白质序列。这里称为MS螺旋图的螺旋马尔可夫连接性(SMC(k))的新MS CIs可以使用MARCH-INSIDE软件,结合网络和马尔可夫模型理论来计算。SMC(k)值可用于寻找类似QSAR的模型,在这项工作中称为定量蛋白质组-性质关系(QPPRs)。我们计算了62个血液样本的SMC(k)值,并通过区分易患药物诱导心脏毒性个体的蛋白质组MS与对照样本,拟合了一个QPPR模型。在训练和验证系列中,QPPR模型的准确性、敏感性和特异性值在73.08%至87.5%之间。这项工作表明QPPR模型是蛋白质组学中MS检测生物标志物的有力工具。