基因数据探索分析的图排序。

Graph ranking for exploratory gene data analysis.

机构信息

Department of Mathematics, The University of Mississippi, University, MS 38677, USA.

出版信息

BMC Bioinformatics. 2009 Oct 8;10 Suppl 11(Suppl 11):S19. doi: 10.1186/1471-2105-10-S11-S19.

DOI:10.1186/1471-2105-10-S11-S19

PMID:19811684

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3226190/

Abstract

BACKGROUND

Microarray technology has made it possible to simultaneously monitor the expression levels of thousands of genes in a single experiment. However, the large number of genes greatly increases the challenges of analyzing, comprehending and interpreting the resulting mass of data. Selecting a subset of important genes is inevitable to address the challenge. Gene selection has been investigated extensively over the last decade. Most selection procedures, however, are not sufficient for accurate inference of underlying biology, because biological significance does not necessarily have to be statistically significant. Additional biological knowledge needs to be integrated into the gene selection procedure.

RESULTS

We propose a general framework for gene ranking. We construct a bipartite graph from the Gene Ontology (GO) and gene expression data. The graph describes the relationship between genes and their associated molecular functions. Under a species condition, edge weights of the graph are assigned to be gene expression level. Such a graph provides a mathematical means to represent both species-independent and species-dependent biological information. We also develop a new ranking algorithm to analyze the weighted graph via a kernelized spatial depth (KSD) approach. Consequently, the importance of gene and molecular function can be simultaneously ranked by a real-valued measure, KSD, which incorporates the global and local structure of the graph. Over-expressed and under-regulated genes also can be separately ranked.

CONCLUSION

The gene-function bigraph integrates molecular function annotations into gene expression data. The relevance of genes is described in the graph (through a common function). The proposed method provides an exploratory framework for gene data analysis.

摘要

背景

微阵列技术使得在单个实验中同时监测数千个基因的表达水平成为可能。然而，大量的基因大大增加了分析、理解和解释由此产生的大量数据的挑战。选择一组重要的基因是解决这一挑战的必然选择。在过去的十年中，基因选择已经得到了广泛的研究。然而，大多数选择程序都不足以进行准确的生物学推断，因为生物学意义不一定具有统计学意义。需要将额外的生物学知识整合到基因选择过程中。

结果

我们提出了一种通用的基因排序框架。我们从基因本体论（GO）和基因表达数据构建了一个二分图。该图描述了基因与其相关分子功能之间的关系。在物种条件下，图的边权重被分配为基因表达水平。这样的图提供了一种数学方法来表示既不依赖于物种又依赖于物种的生物学信息。我们还开发了一种新的排序算法，通过核空间深度（KSD）方法分析加权图。因此，基因和分子功能的重要性可以通过一个整合了图的全局和局部结构的实值度量 KSD 来同时排序。过表达和下调的基因也可以分别排序。

结论

基因-功能二分图将分子功能注释集成到基因表达数据中。基因的相关性在图中描述（通过共同的功能）。所提出的方法为基因数据分析提供了一个探索性的框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61cd/3226190/cd00c1edd72e/1471-2105-10-S11-S19-1.jpg

相似文献

Graph ranking for exploratory gene data analysis.基因数据探索分析的图排序。

BMC Bioinformatics. 2009 Oct 8;10 Suppl 11(Suppl 11):S19. doi: 10.1186/1471-2105-10-S11-S19.

Improved scoring of functional groups from gene expression data by decorrelating GO graph structure.通过去相关GO图结构从基因表达数据中改进功能组的评分。

Bioinformatics. 2006 Jul 1;22(13):1600-7. doi: 10.1093/bioinformatics/btl140. Epub 2006 Apr 10.

Feature Subset Selection for Cancer Classification Using Weight Local Modularity.基于权重局部模块度的癌症分类特征子集选择

Sci Rep. 2016 Oct 5;6:34759. doi: 10.1038/srep34759.

Cancer module genes ranking using kernelized score functions.基于核化评分函数的癌症模块基因排序。

BMC Bioinformatics. 2012;13 Suppl 14(Suppl 14):S3. doi: 10.1186/1471-2105-13-S14-S3. Epub 2012 Sep 7.

Graph-based analysis and visualization of experimental results with ONDEX.使用ONDEX对实验结果进行基于图形的分析和可视化。

Bioinformatics. 2006 Jun 1;22(11):1383-90. doi: 10.1093/bioinformatics/btl081. Epub 2006 Mar 13.

Functional Categorization of Disease Genes Based on Spectral Graph Theory and Integrated Biological Knowledge.基于谱图理论和综合生物学知识的疾病基因功能分类。

Interdiscip Sci. 2019 Sep;11(3):460-474. doi: 10.1007/s12539-017-0279-7. Epub 2018 Jan 30.

Comparisons of graph-structure clustering methods for gene expression data.基因表达数据的图结构聚类方法比较。

Acta Biochim Biophys Sin (Shanghai). 2006 Jun;38(6):379-84. doi: 10.1111/j.1745-7270.2006.00175.x.

MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data.MICRAT：一种使用时间序列基因表达数据推断基因调控网络的新算法。

BMC Syst Biol. 2018 Dec 14;12(Suppl 7):115. doi: 10.1186/s12918-018-0635-1.

Analysis of a Gibbs sampler method for model-based clustering of gene expression data.一种基于模型的基因表达数据聚类的吉布斯采样器方法分析。

Bioinformatics. 2008 Jan 15;24(2):176-83. doi: 10.1093/bioinformatics/btm562. Epub 2007 Nov 22.

Detecting intergene correlation changes in microarray analysis: a new approach to gene selection.检测微阵列分析中的基因间相关性变化：一种新的基因选择方法。

BMC Bioinformatics. 2009 Jan 15;10:20. doi: 10.1186/1471-2105-10-20.

引用本文的文献

COmic: convolutional kernel networks for interpretable end-to-end learning on (multi-)omics data.漫画：卷积核网络在（多）组学数据上进行可解释的端到端学习。

Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i76-i85. doi: 10.1093/bioinformatics/btad204.

PIMKL: Pathway-Induced Multiple Kernel Learning.PIMKL：基于通路的多核学习。

NPJ Syst Biol Appl. 2019 Mar 5;5:8. doi: 10.1038/s41540-019-0086-3. eCollection 2019.

Biomarker gene signature discovery integrating network knowledge.整合网络知识的生物标志物基因特征发现。

Biology (Basel). 2012 Feb 27;1(1):5-17. doi: 10.3390/biology1010005.

Network and data integration for biomarker signature discovery via network smoothed T-statistics.通过网络平滑 T 统计量进行生物标志物特征发现的网络和数据集成。

PLoS One. 2013 Sep 3;8(9):e73074. doi: 10.1371/journal.pone.0073074. eCollection 2013.

Prognostic gene signatures for patient stratification in breast cancer: accuracy, stability and interpretability of gene selection approaches using prior knowledge on protein-protein interactions.用于乳腺癌患者分层的预后基因特征：利用蛋白质 - 蛋白质相互作用的先验知识选择基因的方法的准确性、稳定性和可解释性。

BMC Bioinformatics. 2012 May 1;13:69. doi: 10.1186/1471-2105-13-69.

Proceedings of the 2010 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) conference.2010年中南计算生物学与生物信息学学会（MCBIOS）会议论文集

BMC Bioinformatics. 2010 Oct 7;11 Suppl 6(Suppl 6):S1. doi: 10.1186/1471-2105-11-S6-S1.

Proceedings of the 2009 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) conference. Introduction.2009年中南计算生物学与生物信息学学会（MCBIOS）会议论文集。引言。

BMC Bioinformatics. 2009 Oct 8;10 Suppl 11(Suppl 11):S1. doi: 10.1186/1471-2105-10-S11-S1.

本文引用的文献

Outlier detection with the kernelized spatial depth function.基于核空间深度函数的异常值检测

IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):288-305. doi: 10.1109/TPAMI.2008.72.

A novel method incorporating gene ontology information for unsupervised clustering and feature selection.一种结合基因本体信息用于无监督聚类和特征选择的新方法。

PLoS One. 2008;3(12):e3860. doi: 10.1371/journal.pone.0003860. Epub 2008 Dec 4.

M-BISON: microarray-based integration of data sources using networks.M-BISON：基于微阵列的使用网络对数据源进行整合

BMC Bioinformatics. 2008 Apr 25;9:214. doi: 10.1186/1471-2105-9-214.

SEGS: search for enriched gene sets in microarray data.SEGS：在微阵列数据中搜索富集的基因集。

J Biomed Inform. 2008 Aug;41(4):588-601. doi: 10.1016/j.jbi.2007.12.001. Epub 2007 Dec 15.

Hybrid huberized support vector machines for microarray classification and gene selection.用于微阵列分类和基因选择的混合胡贝尔化支持向量机

Bioinformatics. 2008 Feb 1;24(3):412-9. doi: 10.1093/bioinformatics/btm579. Epub 2008 Jan 5.

Robust clustering in high dimensional data using statistical depths.使用统计深度对高维数据进行稳健聚类。

BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S8. doi: 10.1186/1471-2105-8-S7-S8.

Improving the performance of SVM-RFE to select genes in microarray data.提高 SVM-RFE 在微阵列数据中选择基因的性能。

BMC Bioinformatics. 2006 Sep 6;7 Suppl 2(Suppl 2):S12. doi: 10.1186/1471-2105-7-S2-S12.

Using GOstats to test gene lists for GO term association.使用GOstats测试基因列表与GO术语的关联性。

Bioinformatics. 2007 Jan 15;23(2):257-8. doi: 10.1093/bioinformatics/btl567. Epub 2006 Nov 10.

CGI: a new approach for prioritizing genes by combining gene expression and protein-protein interaction data.CGI：一种通过整合基因表达和蛋白质-蛋白质相互作用数据对基因进行优先级排序的新方法。

Bioinformatics. 2007 Jan 15;23(2):215-21. doi: 10.1093/bioinformatics/btl569. Epub 2006 Nov 10.

Improved scoring of functional groups from gene expression data by decorrelating GO graph structure.通过去相关GO图结构从基因表达数据中改进功能组的评分。

Bioinformatics. 2006 Jul 1;22(13):1600-7. doi: 10.1093/bioinformatics/btl140. Epub 2006 Apr 10.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基因数据探索分析的图排序。

Graph ranking for exploratory gene data analysis.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献