用于分析生物网络的新型拓扑描述符。

Novel topological descriptors for analyzing biological networks.

作者信息

Dehmer Matthias M, Barbarini Nicola N, Varmuza Kurt K, Graber Armin A

机构信息

Institute for Bioinformatics and Translational Research, UMIT, Eduard Wallnoefer Zentrum 1, Hall in Tyrol, Austria.

出版信息

BMC Struct Biol. 2010 Jun 17;10:18. doi: 10.1186/1472-6807-10-18.

DOI:10.1186/1472-6807-10-18

PMID:20565796

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2906494/

Abstract

BACKGROUND

Topological descriptors, other graph measures, and in a broader sense, graph-theoretical methods, have been proven as powerful tools to perform biological network analysis. However, the majority of the developed descriptors and graph-theoretical methods does not have the ability to take vertex- and edge-labels into account, e.g., atom- and bond-types when considering molecular graphs. Indeed, this feature is important to characterize biological networks more meaningfully instead of only considering pure topological information.

RESULTS

In this paper, we put the emphasis on analyzing a special type of biological networks, namely bio-chemical structures. First, we derive entropic measures to calculate the information content of vertex- and edge-labeled graphs and investigate some useful properties thereof. Second, we apply the mentioned measures combined with other well-known descriptors to supervised machine learning methods for predicting Ames mutagenicity. Moreover, we investigate the influence of our topological descriptors - measures for only unlabeled vs. measures for labeled graphs - on the prediction performance of the underlying graph classification problem.

CONCLUSIONS

Our study demonstrates that the application of entropic measures to molecules representing graphs is useful to characterize such structures meaningfully. For instance, we have found that if one extends the measures for determining the structural information content of unlabeled graphs to labeled graphs, the uniqueness of the resulting indices is higher. Because measures to structurally characterize labeled graphs are clearly underrepresented so far, the further development of such methods might be valuable and fruitful for solving problems within biological network analysis.

摘要

背景

拓扑描述符、其他图测度，以及更广义地说，图论方法，已被证明是进行生物网络分析的有力工具。然而，大多数已开发的描述符和图论方法无法考虑顶点和边的标签，例如在考虑分子图时的原子类型和键类型。实际上，这一特征对于更有意义地表征生物网络很重要，而不仅仅是考虑纯粹的拓扑信息。

结果

在本文中，我们着重分析一种特殊类型的生物网络，即生化结构。首先，我们推导熵测度以计算带顶点和边标签的图的信息含量，并研究其一些有用的性质。其次，我们将上述测度与其他知名描述符相结合，应用于监督机器学习方法来预测埃姆斯致突变性。此外，我们研究了我们的拓扑描述符——仅针对无标签图的测度与针对带标签图的测度——对基础图分类问题预测性能的影响。

结论

我们的研究表明，将熵测度应用于表示图的分子，有助于有意义地表征此类结构。例如，我们发现，如果将用于确定无标签图结构信息含量的测度扩展到带标签图，所得指标的唯一性会更高。由于目前用于在结构上表征带标签图的测度明显不足，进一步开发此类方法可能对解决生物网络分析中的问题有价值且富有成果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f81/2906494/5998e976e60c/1472-6807-10-18-1.jpg

相似文献

Novel topological descriptors for analyzing biological networks.用于分析生物网络的新型拓扑描述符。

BMC Struct Biol. 2010 Jun 17;10:18. doi: 10.1186/1472-6807-10-18.

Structural measures for network biology using QuACN.使用 QuACN 的网络生物学结构措施。

BMC Bioinformatics. 2011 Dec 24;12:492. doi: 10.1186/1471-2105-12-492.

QuBiLS-MAS, open source multi-platform software for atom- and bond-based topological (2D) and chiral (2.5D) algebraic molecular descriptors computations.QuBiLS-MAS，一款用于基于原子和键的拓扑（二维）和手性（2.5维）代数分子描述符计算的开源多平台软件。

J Cheminform. 2017 Jun 7;9(1):35. doi: 10.1186/s13321-017-0211-5.

Chemical graphs, molecular matrices and topological indices in chemoinformatics and quantitative structure-activity relationships.化学信息学与定量构效关系中的化学图、分子矩阵和拓扑指数

Curr Comput Aided Drug Des. 2013 Jun;9(2):153-63. doi: 10.2174/1573409911309020002.

A network-based approach to classify the three domains of life.基于网络的方法对生命的三个域进行分类。

Biol Direct. 2011 Oct 13;6:53. doi: 10.1186/1745-6150-6-53.

Hyperbolic relational graph convolution networks plus: a simple but highly efficient QSAR-modeling method.双曲关系图卷积网络加：一种简单但高效的 QSAR 建模方法。

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab112.

A large scale analysis of information-theoretic network complexity measures using chemical structures.利用化学结构对信息论网络复杂性测度进行大规模分析。

PLoS One. 2009 Dec 15;4(12):e8057. doi: 10.1371/journal.pone.0008057.

Structural discrimination of networks by using distance, degree and eigenvalue-based measures.基于距离、度和特征值的网络结构歧视。

PLoS One. 2012;7(7):e38564. doi: 10.1371/journal.pone.0038564. Epub 2012 Jul 6.

A novel representation of RNA secondary structure based on element-contact graphs.一种基于元素接触图的RNA二级结构新表示法。

BMC Bioinformatics. 2008 Apr 11;9:188. doi: 10.1186/1471-2105-9-188.

On topological indices and entropy measures of beryllonitrene network via logarithmic regression model.基于对数回归模型的铍氮烯网络的拓扑指数与熵测度研究

Sci Rep. 2024 Mar 26;14(1):7187. doi: 10.1038/s41598-024-57601-1.

引用本文的文献

Mathematical modeling and estimation of physicochemical characteristics of pneumonia treatment drugs through a novel approach -Banhatti topological descriptors.通过一种新方法——班哈蒂拓扑描述符对肺炎治疗药物的物理化学特性进行数学建模和估计。

Front Chem. 2025 May 2;13:1564809. doi: 10.3389/fchem.2025.1564809. eCollection 2025.

Modeling and estimation of physiochemical properties of cancer drugs using entropy measures.利用熵测度对癌症药物理化性质进行建模与估计。

Sci Rep. 2025 Feb 8;15(1):4785. doi: 10.1038/s41598-025-87755-5.

A QSPR analysis of physical properties of antituberculosis drugs using neighbourhood degree-based topological indices and support vector regression.基于邻域度拓扑指数和支持向量回归的抗结核药物物理性质的定量构效关系分析

Heliyon. 2024 Mar 20;10(7):e28260. doi: 10.1016/j.heliyon.2024.e28260. eCollection 2024 Apr 15.

Predicting prediction: A systematic workflow to analyze factors affecting the classification performance in genomic biomarker discovery.预测预测：分析影响基因组生物标志物发现分类性能因素的系统工作流程。

PLoS One. 2022 Nov 9;17(11):e0276607. doi: 10.1371/journal.pone.0276607. eCollection 2022.

A Decomposition Method for Global Evaluation of Shannon Entropy and Local Estimations of Algorithmic Complexity.一种用于香农熵全局评估和算法复杂度局部估计的分解方法。

Entropy (Basel). 2018 Aug 15;20(8):605. doi: 10.3390/e20080605.

Mutual information model for link prediction in heterogeneous complex networks.异构复杂网络链路预测的互信息模型。

Sci Rep. 2017 Mar 27;7:44981. doi: 10.1038/srep44981.

The discrimination power of structural SuperIndices.结构超指数的判别能力。

PLoS One. 2013 Jul 25;8(7):e70551. doi: 10.1371/journal.pone.0070551. Print 2013.

RMol: a toolset for transforming SD/Molfile structure information into R objects.RMol：一个用于将SD/Molfile结构信息转换为R对象的工具集。

Source Code Biol Med. 2012 Nov 14;7(1):12. doi: 10.1186/1751-0473-7-12.

Information indices with high discriminative power for graphs.具有高区分能力的图信息指标。

PLoS One. 2012;7(2):e31214. doi: 10.1371/journal.pone.0031214. Epub 2012 Feb 29.

Profiling the human response to physical exercise: a computational strategy for the identification and kinetic analysis of metabolic biomarkers.剖析人类对体育锻炼的反应：一种用于识别和动力学分析代谢生物标志物的计算策略。

J Clin Bioinforma. 2011 Dec 19;1(1):34. doi: 10.1186/2043-9113-1-34.

本文引用的文献

Structural determination of paraffin boiling points.石蜡沸点的结构测定

J Am Chem Soc. 1947 Jan;69(1):17-20. doi: 10.1021/ja01193a005.

Benchmark data set for in silico prediction of Ames mutagenicity.用于计算机模拟预测埃姆斯致突变性的基准数据集。

J Chem Inf Model. 2009 Sep;49(9):2077-81. doi: 10.1021/ci900161g.

On entropy-based molecular descriptors: statistical analysis of real and synthetic chemical structures.基于熵的分子描述符：真实和合成化学结构的统计分析。

J Chem Inf Model. 2009 Jul;49(7):1655-63. doi: 10.1021/ci900060x.

Information processing in the transcriptional regulatory network of yeast: functional robustness.酵母转录调控网络中的信息处理：功能稳健性。

BMC Syst Biol. 2009 Mar 19;3:35. doi: 10.1186/1752-0509-3-35.

Phylogenetic distances are encoded in networks of interacting pathways.系统发生距离编码于相互作用通路的网络中。

Bioinformatics. 2008 Nov 15;24(22):2579-85. doi: 10.1093/bioinformatics/btn503. Epub 2008 Sep 26.

Fitting a geometric graph to a protein-protein interaction network.将几何图拟合到蛋白质-蛋白质相互作用网络。

Bioinformatics. 2008 Apr 15;24(8):1093-9. doi: 10.1093/bioinformatics/btn079. Epub 2008 Mar 14.

Pybel: a Python wrapper for the OpenBabel cheminformatics toolkit.Pybel：用于OpenBabel化学信息学工具包的Python包装器。

Chem Cent J. 2008 Mar 9;2:5. doi: 10.1186/1752-153X-2-5.

A probabilistic approach to classifying metabolic stability.一种用于代谢稳定性分类的概率方法。

J Chem Inf Model. 2008 Apr;48(4):785-96. doi: 10.1021/ci700142c. Epub 2008 Mar 8.

Structural information content of networks: graph entropy based on local vertex functionals.网络的结构信息内容：基于局部顶点泛函的图熵

Comput Biol Chem. 2008 Apr;32(2):131-8. doi: 10.1016/j.compbiolchem.2007.09.007. Epub 2007 Sep 29.

Graphs in molecular biology.分子生物学中的图表。

BMC Bioinformatics. 2007 Sep 27;8 Suppl 6(Suppl 6):S8. doi: 10.1186/1471-2105-8-S6-S8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于分析生物网络的新型拓扑描述符。

Novel topological descriptors for analyzing biological networks.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献