新型马尔可夫-香农熵模型评估复杂网络的连接质量：从分子到细胞通路、寄生虫-宿主、神经、工业和法律-社会网络。

Department of Microbiology & Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain.

J Theor Biol. 2012 Jan 21;293:174-88. doi: 10.1016/j.jtbi.2011.10.016. Epub 2011 Oct 25.

Graph and Complex Network theory is expanding its application to different levels of matter organization such as molecular, biological, technological, and social networks. A network is a set of items, usually called nodes, with connections between them, which are called links or edges. There are many different experimental and/or theoretical methods to assign node-node links depending on the type of network we want to construct. Unfortunately, the use of a method for experimental reevaluation of the entire network is very expensive in terms of time and resources; thus the development of cheaper theoretical methods is of major importance. In addition, different methods to link nodes in the same type of network are not totally accurate in such a way that they do not always coincide. In this sense, the development of computational methods useful to evaluate connectivity quality in complex networks (a posteriori of network assemble) is a goal of major interest. In this work, we report for the first time a new method to calculate numerical quality scores S(L(ij)) for network links L(ij) (connectivity) based on the Markov-Shannon Entropy indices of order k-th (θ(k)) for network nodes. The algorithm may be summarized as follows: (i) first, the θ(k)(j) values are calculated for all j-th nodes in a complex network already constructed; (ii) A Linear Discriminant Analysis (LDA) is used to seek a linear equation that discriminates connected or linked (L(ij)=1) pairs of nodes experimentally confirmed from non-linked ones (L(ij)=0); (iii) the new model is validated with external series of pairs of nodes; (iv) the equation obtained is used to re-evaluate the connectivity quality of the network, connecting/disconnecting nodes based on the quality scores calculated with the new connectivity function. This method was used to study different types of large networks. The linear models obtained produced the following results in terms of overall accuracy for network reconstruction: Metabolic networks (72.3%), Parasite-Host networks (93.3%), CoCoMac brain cortex co-activation network (89.6%), NW Spain fasciolosis spreading network (97.2%), Spanish financial law network (89.9%) and World trade network for Intelligent & Active Food Packaging (92.8%). In order to seek these models, we studied an average of 55,388 pairs of nodes in each model and a total of 332,326 pairs of nodes in all models. Finally, this method was used to solve a more complicated problem. A model was developed to score the connectivity quality in the Drug-Target network of US FDA approved drugs. In this last model the θ(k) values were calculated for three types of molecular networks representing different levels of organization: drug molecular graphs (atom-atom bonds), protein residue networks (amino acid interactions), and drug-target network (compound-protein binding). The overall accuracy of this model was 76.3%. This work opens a new door to the computational reevaluation of network connectivity quality (collation) for complex systems in molecular, biomedical, technological, and legal-social sciences as well as in world trade and industry.

图论和复杂网络理论正在将其应用扩展到分子、生物、技术和社交网络等不同层次的物质组织。网络是一组通常称为节点的项目，节点之间存在连接，这些连接称为链接或边。根据我们要构建的网络类型，有许多不同的实验和/或理论方法来分配节点-节点链接。不幸的是，使用一种方法对整个网络进行实验重新评估在时间和资源方面非常昂贵；因此，开发更便宜的理论方法非常重要。此外，在同一类型的网络中连接节点的不同方法并不完全准确，它们并不总是一致的。在这种意义上，开发用于评估复杂网络中连接质量的计算方法（网络组装后的后验）是一个主要关注的目标。在这项工作中，我们首次报告了一种新方法，用于根据网络节点的第 k 阶马尔可夫-香农熵指数 (θ(k)) 为网络链接 L(ij) (连接性) 计算数值质量分数 S(L(ij))。该算法可以概括为以下步骤：（i）首先，为已经构建的复杂网络中的所有第 j 个节点计算 θ(k)(j) 值；（ii）使用线性判别分析 (LDA) 来寻找一个线性方程，该方程可以区分实验上确认的连接或链接（L(ij)=1）节点对与非链接（L(ij)=0）节点对；（iii）使用外部节点对系列验证新模型；（iv）使用新的连接函数计算的质量分数重新评估网络的连接质量，根据计算出的质量分数连接/断开节点。该方法用于研究不同类型的大型网络。获得的线性模型在网络重建的整体准确性方面产生了以下结果：代谢网络（72.3%）、寄生虫-宿主网络（93.3%）、CoCoMac 大脑皮层共激活网络（89.6%）、西班牙西北部 fasciolosis 传播网络（97.2%）、西班牙金融法网络（89.9%）和世界智能与主动食品包装贸易网络（92.8%）。为了找到这些模型，我们在每个模型中研究了平均 55,388 对节点，并在所有模型中总共研究了 332,326 对节点。最后，该方法用于解决更复杂的问题。开发了一种模型来评分美国 FDA 批准药物的药物-靶标网络的连接质量。在最后一个模型中，为代表不同组织层次的三种类型的分子网络计算了θ(k)值：药物分子图（原子-原子键）、蛋白质残基网络（氨基酸相互作用）和药物-靶标网络（化合物-蛋白质结合）。该模型的整体准确性为 76.3%。这项工作为计算分子、生物医学、技术和法律社会科学以及世界贸易和工业中的复杂系统的网络连接质量（整理）开辟了新的途径。

相似文献

New Markov-Shannon Entropy models to assess connectivity quality in complex networks: from molecular to cellular pathway, Parasite-Host, Neural, Industry, and Legal-Social networks.

J Theor Biol. 2012 Jan 21;293:174-88. doi: 10.1016/j.jtbi.2011.10.016. Epub 2011 Oct 25.

The Rücker-Markov invariants of complex Bio-Systems: applications in Parasitology and Neuroinformatics.

Biosystems. 2013 Mar;111(3):199-207. doi: 10.1016/j.biosystems.2013.02.006. Epub 2013 Feb 27.

New Markov-autocorrelation indices for re-evaluation of links in chemical and biological complex networks used in metabolomics, parasitology, neurosciences, and epidemiology.

J Chem Inf Model. 2012 Dec 21;52(12):3331-40. doi: 10.1021/ci300321f. Epub 2012 Nov 26.

Modeling complex metabolic reactions, ecological systems, and financial and legal networks with MIANN models based on Markov-Wiener node descriptors.

J Chem Inf Model. 2014 Jan 27;54(1):16-29. doi: 10.1021/ci400280n. Epub 2013 Dec 23.

2D MI-DRAGON: a new predictor for protein-ligands interactions and theoretic-experimental studies of US FDA drug-target network, oxoisoaporphine inhibitors for MAO-A and human parasite proteins.

Eur J Med Chem. 2011 Dec;46(12):5838-51. doi: 10.1016/j.ejmech.2011.09.045. Epub 2011 Oct 1.

Using entropy of drug and protein graphs to predict FDA drug-target network: theoretic-experimental study of MAO inhibitors and hemoglobin peptides from Fasciola hepatica.

Eur J Med Chem. 2011 Apr;46(4):1074-94. doi: 10.1016/j.ejmech.2011.01.023. Epub 2011 Jan 21.

Unified QSAR approach to antimicrobials. Part 3: first multi-tasking QSAR model for input-coded prediction, structural back-projection, and complex networks clustering of antiprotozoal compounds.

Bioorg Med Chem. 2008 Jun 1;16(11):5871-80. doi: 10.1016/j.bmc.2008.04.068. Epub 2008 Apr 29.

Quantitative Proteome-Property Relationships (QPPRs). Part 1: finding biomarkers of organic drugs with mean Markov connectivity indices of spiral networks of blood mass spectra.

Bioorg Med Chem. 2008 Nov 15;16(22):9684-93. doi: 10.1016/j.bmc.2008.10.004. Epub 2008 Oct 5.

Gene expression complex networks: synthesis, identification, and analysis.

J Comput Biol. 2011 Oct;18(10):1353-67. doi: 10.1089/cmb.2010.0118. Epub 2011 May 6.

MISS-Prot: web server for self/non-self discrimination of protein residue networks in parasites; theory and experiments in Fasciola peptides and Anisakis allergens.

Mol Biosyst. 2011 Jun;7(6):1938-55. doi: 10.1039/c1mb05069a. Epub 2011 Apr 6.

引用本文的文献

Fractal Geometry Meets Computational Intelligence: Future Perspectives.

Adv Neurobiol. 2024;36:983-997. doi: 10.1007/978-3-031-47606-8_48.

Alignment-Free Method to Predict Enzyme Classes and Subclasses.

Int J Mol Sci. 2019 Oct 29;20(21):5389. doi: 10.3390/ijms20215389.

Net-Net Auto Machine Learning (AutoML) Prediction of Complex Ecosystems.

Sci Rep. 2018 Aug 17;8(1):12340. doi: 10.1038/s41598-018-30637-w.

Modeling the Sulfur Regulome by Quantifying the Storage and Communication of Information.

mSystems. 2018 Jun 19;3(3). doi: 10.1128/mSystems.00189-17. eCollection 2018 May-Jun.

PClass: Protein Quaternary Structure Classification by Using Bootstrapping Strategy as Model Selection.

Genes (Basel). 2018 Feb 14;9(2):91. doi: 10.3390/genes9020091.

Information entropy-based fitting of the disease trajectory of brain ischemia-induced vascular cognitive impairment.

Neural Regen Res. 2012 Mar 25;7(9):697-702. doi: 10.3969/j.issn.1673-5374.2012.09.010.

Prediction of multi-target networks of neuroprotective compounds with entropy indices and synthesis, assay, and theoretical study of new asymmetric 1,2-rasagiline carbamates.

Int J Mol Sci. 2014 Sep 24;15(9):17035-64. doi: 10.3390/ijms150917035.

Complexity in cancer biology: is systems biology the answer?

Cancer Med. 2013 Apr;2(2):164-77. doi: 10.1002/cam4.62. Epub 2013 Feb 17.

Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review.

Pharmacol Ther. 2013 Jun;138(3):333-408. doi: 10.1016/j.pharmthera.2013.01.016. Epub 2013 Feb 4.

Information properties of naturally-occurring proteins: Fourier analysis and complexity phase plots.

Protein J. 2012 Oct;31(7):550-63. doi: 10.1007/s10930-012-9432-7.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

New Markov-Shannon Entropy models to assess connectivity quality in complex networks: from molecular to cellular pathway, Parasite-Host, Neural, Industry, and Legal-Social networks.

J Theor Biol. 2012 Jan 21;293:174-88. doi: 10.1016/j.jtbi.2011.10.016. Epub 2011 Oct 25.

The Rücker-Markov invariants of complex Bio-Systems: applications in Parasitology and Neuroinformatics.

Biosystems. 2013 Mar;111(3):199-207. doi: 10.1016/j.biosystems.2013.02.006. Epub 2013 Feb 27.

New Markov-autocorrelation indices for re-evaluation of links in chemical and biological complex networks used in metabolomics, parasitology, neurosciences, and epidemiology.

J Chem Inf Model. 2012 Dec 21;52(12):3331-40. doi: 10.1021/ci300321f. Epub 2012 Nov 26.

Modeling complex metabolic reactions, ecological systems, and financial and legal networks with MIANN models based on Markov-Wiener node descriptors.

J Chem Inf Model. 2014 Jan 27;54(1):16-29. doi: 10.1021/ci400280n. Epub 2013 Dec 23.

2D MI-DRAGON: a new predictor for protein-ligands interactions and theoretic-experimental studies of US FDA drug-target network, oxoisoaporphine inhibitors for MAO-A and human parasite proteins.

Eur J Med Chem. 2011 Dec;46(12):5838-51. doi: 10.1016/j.ejmech.2011.09.045. Epub 2011 Oct 1.

Using entropy of drug and protein graphs to predict FDA drug-target network: theoretic-experimental study of MAO inhibitors and hemoglobin peptides from Fasciola hepatica.

Eur J Med Chem. 2011 Apr;46(4):1074-94. doi: 10.1016/j.ejmech.2011.01.023. Epub 2011 Jan 21.

Unified QSAR approach to antimicrobials. Part 3: first multi-tasking QSAR model for input-coded prediction, structural back-projection, and complex networks clustering of antiprotozoal compounds.

Bioorg Med Chem. 2008 Jun 1;16(11):5871-80. doi: 10.1016/j.bmc.2008.04.068. Epub 2008 Apr 29.

Quantitative Proteome-Property Relationships (QPPRs). Part 1: finding biomarkers of organic drugs with mean Markov connectivity indices of spiral networks of blood mass spectra.

Bioorg Med Chem. 2008 Nov 15;16(22):9684-93. doi: 10.1016/j.bmc.2008.10.004. Epub 2008 Oct 5.

Gene expression complex networks: synthesis, identification, and analysis.

J Comput Biol. 2011 Oct;18(10):1353-67. doi: 10.1089/cmb.2010.0118. Epub 2011 May 6.

MISS-Prot: web server for self/non-self discrimination of protein residue networks in parasites; theory and experiments in Fasciola peptides and Anisakis allergens.

Mol Biosyst. 2011 Jun;7(6):1938-55. doi: 10.1039/c1mb05069a. Epub 2011 Apr 6.

引用本文的文献

Fractal Geometry Meets Computational Intelligence: Future Perspectives.

Adv Neurobiol. 2024;36:983-997. doi: 10.1007/978-3-031-47606-8_48.

Alignment-Free Method to Predict Enzyme Classes and Subclasses.

Int J Mol Sci. 2019 Oct 29;20(21):5389. doi: 10.3390/ijms20215389.

Net-Net Auto Machine Learning (AutoML) Prediction of Complex Ecosystems.

Sci Rep. 2018 Aug 17;8(1):12340. doi: 10.1038/s41598-018-30637-w.

Modeling the Sulfur Regulome by Quantifying the Storage and Communication of Information.

mSystems. 2018 Jun 19;3(3). doi: 10.1128/mSystems.00189-17. eCollection 2018 May-Jun.

PClass: Protein Quaternary Structure Classification by Using Bootstrapping Strategy as Model Selection.

Genes (Basel). 2018 Feb 14;9(2):91. doi: 10.3390/genes9020091.

Information entropy-based fitting of the disease trajectory of brain ischemia-induced vascular cognitive impairment.

Neural Regen Res. 2012 Mar 25;7(9):697-702. doi: 10.3969/j.issn.1673-5374.2012.09.010.

Prediction of multi-target networks of neuroprotective compounds with entropy indices and synthesis, assay, and theoretical study of new asymmetric 1,2-rasagiline carbamates.

Int J Mol Sci. 2014 Sep 24;15(9):17035-64. doi: 10.3390/ijms150917035.

Complexity in cancer biology: is systems biology the answer?

Cancer Med. 2013 Apr;2(2):164-77. doi: 10.1002/cam4.62. Epub 2013 Feb 17.

Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review.

Pharmacol Ther. 2013 Jun;138(3):333-408. doi: 10.1016/j.pharmthera.2013.01.016. Epub 2013 Feb 4.

Information properties of naturally-occurring proteins: Fourier analysis and complexity phase plots.

Protein J. 2012 Oct;31(7):550-63. doi: 10.1007/s10930-012-9432-7.

New Markov-Shannon Entropy models to assess connectivity quality in complex networks: from molecular to cellular pathway, Parasite-Host, Neural, Industry, and Legal-Social networks.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献