Suppr超能文献

阈值对基因共表达网络拓扑结构的影响。

Effects of threshold on the topology of gene co-expression networks.

作者信息

Couto Cynthia Martins Villar, Comin César Henrique, Costa Luciano da Fontoura

机构信息

São Carlos Institute of Physics, University of São Paulo, PO Box 369, 13560-970, São Carlos, SP, Brazil.

出版信息

Mol Biosyst. 2017 Sep 26;13(10):2024-2035. doi: 10.1039/c7mb00101k.

Abstract

Several developments regarding the analysis of gene co-expression profiles using complex network theory have been reported recently. Such approaches usually start with the construction of an unweighted gene co-expression network, therefore requiring the selection of a suitable threshold defining which pairs of vertices will be connected. We aimed at addressing such an important problem by suggesting and comparing five different approaches for threshold selection. Each of the methods considers a respective biologically-motivated criterion for electing a potentially suitable threshold. A set of 21 microarray experiments from different biological groups was used to investigate the effect of applying the five proposed criteria to several biological situations. For each experiment, we used the Pearson correlation coefficient to measure the relationship between each gene pair, and the resulting weight matrices were thresholded considering several values, generating respective adjacency matrices (co-expression networks). Each of the five proposed criteria was then applied in order to select the respective threshold value. The effects of these thresholding approaches on the topology of the resulting networks were compared by using several measurements, and we verified that, depending on the database, the impact on the topological properties can be large. However, a group of databases was verified to be similarly affected by most of the considered criteria. Based on such results, it can be suggested that when the generated networks present similar measurements, the thresholding method can be chosen with greater freedom. If the generated networks are markedly different, the thresholding method that better suits the interests of each specific research study represents a reasonable choice.

摘要

最近有报道称,在利用复杂网络理论分析基因共表达谱方面有了一些进展。此类方法通常始于构建一个无权基因共表达网络,因此需要选择一个合适的阈值来确定哪些顶点对将被连接。我们旨在通过提出并比较五种不同的阈值选择方法来解决这一重要问题。每种方法都考虑了各自基于生物学动机的标准来选择一个潜在合适的阈值。使用一组来自不同生物组的21个微阵列实验来研究将这五种提出的标准应用于几种生物学情况的效果。对于每个实验,我们使用皮尔逊相关系数来测量每对基因之间的关系,并考虑多个值对所得的权重矩阵进行阈值处理,生成各自的邻接矩阵(共表达网络)。然后应用所提出的五种标准中的每一种来选择各自的阈值。通过使用几种测量方法比较了这些阈值处理方法对所得网络拓扑结构的影响,并且我们验证了,根据数据库的不同,对拓扑性质的影响可能很大。然而,一组数据库被验证受到大多数所考虑标准的类似影响。基于这些结果,可以建议,当生成的网络呈现相似的测量结果时,可以更自由地选择阈值处理方法。如果生成的网络明显不同,那么最适合每个特定研究兴趣的阈值处理方法是一个合理的选择。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验