Suppr超能文献

使用有向距离相关系数构建稳健的基因共表达网络。

Robust gene coexpression networks using signed distance correlation.

机构信息

Department of Statistics, University of Oxford, Oxford, UK.

Department of Plant Sciences, University of Oxford, Oxford, UK.

出版信息

Bioinformatics. 2021 Aug 4;37(14):1982–1989. doi: 10.1093/bioinformatics/btab041. Epub 2021 Feb 1.

Abstract

MOTIVATION

Even within well studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lack of trustworthy functional annotations can impede the validation of such networks. Hence, there is a need for a principled method to construct gene coexpression networks that capture biological information and are structurally stable even in the absence of functional information.

RESULTS

We introduce the concept of signed distance correlation as a measure of dependency between two variables, and apply it to generate gene coexpression networks. Distance correlation offers a more intuitive approach to network construction than commonly used methods such as Pearson correlation and mutual information. We propose a framework to generate self-consistent networks using signed distance correlation purely from gene expression data, with no additional information. We analyse data from three different organisms to illustrate how networks generated with our method are more stable and capture more biological information compared to networks obtained from Pearson correlation or mutual information.

SUPPLEMENTARY INFORMATION

Supplementary Information and code are available at Bioinformatics and https://github.com/javier-pardodiaz/sdcorGCN online.

摘要

动机

即使在研究充分的生物体内,许多基因仍缺乏有用的功能注释。一种生成此类功能信息的方法是利用包含功能注释的基因共表达数据网络,推断基因/蛋白质之间的生物学关系。然而,缺乏可靠的功能注释会阻碍此类网络的验证。因此,需要一种有原则的方法来构建基因共表达网络,这些网络能够捕捉生物信息,并且在缺乏功能信息的情况下结构也保持稳定。

结果

我们介绍了符号距离相关系数的概念,将其作为衡量两个变量之间依赖性的度量,并将其应用于生成基因共表达网络。距离相关系数为网络构建提供了一种比常用方法(如皮尔逊相关系数和互信息)更直观的方法。我们提出了一个框架,该框架可以使用符号距离相关系数从基因表达数据中生成自洽的网络,而无需其他信息。我们分析了来自三个不同生物体的数据,说明了与使用皮尔逊相关系数或互信息获得的网络相比,我们的方法生成的网络更加稳定且能捕获更多的生物学信息。

补充信息

补充信息和代码可在 Bioinformatics 和 https://github.com/javier-pardodiaz/sdcorGCN 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1102/8557847/c32d9e5e46a8/btab041f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验