• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

子空间微分共表达分析:问题定义与通用方法。

Subspace differential coexpression analysis: problem definition and a general approach.

作者信息

Fang Gang, Kuang Rui, Pandey Gaurav, Steinbach Michael, Myers Chad L, Kumar Vipin

机构信息

Department of Computer Science, University of Minnesota, Twin Cities, 200 Union Street SE, Minneapolis, MN 55455, USA.

出版信息

Pac Symp Biocomput. 2010:145-56.

PMID:19908367
Abstract

In this paper, we study methods to identify differential coexpression patterns in case-control gene expression data. A differential coexpression pattern consists of a set of genes that have substantially different levels of coherence of their expression profiles across the two sample-classes, i.e., highly coherent in one class, but not in the other. Biologically, a differential coexpression patterns may indicate the disruption of a regulatory mechanism possibly caused by disregulation of pathways or mutations of transcription factors. A common feature of all the existing approaches for differential coexpression analysis is that the coexpression of a set of genes is measured on all the samples in each of the two classes, i.e., over the full-space of samples. Hence, these approaches may miss patterns that only cover a subset of samples in each class, i.e., subspace patterns, due to the heterogeneity of the subject population and disease causes. In this paper, we extend differential coexpression analysis by defining a subspace differential coexpression pattern, i.e., a set of genes that are coexpressed in a relatively large percent of samples in one class, but in a much smaller percent of samples in the other class. We propose a general approach based upon association analysis framework that allows exhaustive yet efficient discovery of subspace differential coexpression patterns. This approach can be used to adapt a family of biclustering algorithms to obtain their corresponding differential versions that can directly discover differential coexpression patterns. Using a recently developed biclustering algorithm as illustration, we perform experiments on cancer datasets which demonstrates the existence of subspace differential coexpression patterns. Permutation tests demonstrate the statistical significance for a large number of discovered subspace patterns, many of which can not be discovered if they are measured over all the samples in each of the classes. Interestingly, in our experiments, some discovered subspace patterns have significant overlap with known cancer pathways, and some are enriched with the target gene sets of cancer-related microRNA and transcription factors. The source codes and datasets used in this paper are available at http://vk.cs.umn.edu/SDC/.

摘要

在本文中,我们研究了在病例对照基因表达数据中识别差异共表达模式的方法。差异共表达模式由一组基因组成,这些基因在两个样本类别中的表达谱具有显著不同的连贯水平,即在一个类别中高度连贯,而在另一个类别中则不然。从生物学角度来看,差异共表达模式可能表明调节机制受到破坏,这可能是由通路失调或转录因子突变引起的。所有现有差异共表达分析方法的一个共同特点是,一组基因的共表达是在两个类别中每个类别的所有样本上进行测量的,即在样本的全空间上进行测量。因此,由于研究对象群体和疾病病因的异质性,这些方法可能会错过仅覆盖每个类别中一部分样本的模式,即子空间模式。在本文中,我们通过定义子空间差异共表达模式来扩展差异共表达分析,即一组基因在一个类别中相对较大比例的样本中共表达,但在另一个类别中只有小得多比例的样本中共表达。我们提出了一种基于关联分析框架的通用方法,该方法允许详尽而高效地发现子空间差异共表达模式。这种方法可用于调整一类双聚类算法,以获得其相应的差异版本,从而直接发现差异共表达模式。以最近开发的一种双聚类算法为例,我们在癌症数据集上进行了实验,结果表明存在子空间差异共表达模式。置换检验证明了大量发现模式的统计显著性,如果在每个类别中的所有样本上进行测量,其中许多模式是无法发现的。有趣的是,在我们的实验中,一些发现的子空间模式与已知的癌症通路有显著重叠,并且一些模式富含癌症相关微小RNA和转录因子的靶基因集。本文使用的源代码和数据集可在http://vk.cs.umn.edu/SDC/获取。

相似文献

1
Subspace differential coexpression analysis: problem definition and a general approach.子空间微分共表达分析:问题定义与通用方法。
Pac Symp Biocomput. 2010:145-56.
2
An iterative data mining approach for mining overlapping coexpression patterns in noisy gene expression data.一种用于在嘈杂基因表达数据中挖掘重叠共表达模式的迭代数据挖掘方法。
IEEE Trans Nanobioscience. 2009 Sep;8(3):252-8. doi: 10.1109/TNB.2009.2026747. Epub 2009 Jul 14.
3
A new geometric biclustering algorithm based on the Hough transform for analysis of large-scale microarray data.一种基于霍夫变换的新型几何双聚类算法,用于大规模微阵列数据分析。
J Theor Biol. 2008 Mar 21;251(2):264-74. doi: 10.1016/j.jtbi.2007.11.030. Epub 2007 Dec 4.
4
A graph-based approach to systematically reconstruct human transcriptional regulatory modules.一种基于图形的方法来系统地重建人类转录调控模块。
Bioinformatics. 2007 Jul 1;23(13):i577-86. doi: 10.1093/bioinformatics/btm227.
5
Random forests-based differential analysis of gene sets for gene expression data.基于随机森林的基因表达数据基因集差异分析。
Gene. 2013 Apr 10;518(1):179-86. doi: 10.1016/j.gene.2012.11.034. Epub 2012 Dec 6.
6
Towards patterns tree of gene coexpression in eukaryotic species.迈向真核生物物种中基因共表达模式树
Bioinformatics. 2008 Jun 1;24(11):1367-73. doi: 10.1093/bioinformatics/btn134. Epub 2008 Apr 10.
7
Gene expression analysis in clear cell renal cell carcinoma using gene set enrichment analysis for biostatistical management.基于基因集富集分析的 clear cell 肾细胞癌基因表达分析用于生物统计学管理。
BJU Int. 2011 Jul;108(2 Pt 2):E29-35. doi: 10.1111/j.1464-410X.2010.09794.x. Epub 2011 Mar 16.
8
A systematic comparison and evaluation of biclustering methods for gene expression data.基因表达数据双聚类方法的系统比较与评估
Bioinformatics. 2006 May 1;22(9):1122-9. doi: 10.1093/bioinformatics/btl060. Epub 2006 Feb 24.
9
Graph-based consensus clustering for class discovery from gene expression data.基于图的共识聚类用于从基因表达数据中发现类别
Bioinformatics. 2007 Nov 1;23(21):2888-96. doi: 10.1093/bioinformatics/btm463. Epub 2007 Sep 14.
10
PageRank-based identification of signaling crosstalk from transcriptomics data: the case of Arabidopsis thaliana.基于PageRank从转录组学数据中识别信号转导串扰:以拟南芥为例
Mol Biosyst. 2012 Apr;8(4):1121-7. doi: 10.1039/c2mb05365a. Epub 2012 Feb 10.

引用本文的文献

1
Differential Co-Expression Analyses Allow the Identification of Critical Signalling Pathways Altered during Tumour Transformation and Progression.差异共表达分析可识别肿瘤转化和进展过程中改变的关键信号通路。
Int J Mol Sci. 2020 Dec 12;21(24):9461. doi: 10.3390/ijms21249461.
2
Identifying differentially coexpressed module during HIV disease progression: A multiobjective approach.在 HIV 疾病进展过程中识别差异共表达模块:一种多目标方法。
Sci Rep. 2017 Mar 7;7(1):86. doi: 10.1038/s41598-017-00090-2.
3
Differential correlation for sequencing data.
测序数据的差异相关性
BMC Res Notes. 2017 Jan 19;10(1):54. doi: 10.1186/s13104-016-2331-9.
4
DGCA: A comprehensive R package for Differential Gene Correlation Analysis.DGCA:用于差异基因相关性分析的综合R软件包。
BMC Syst Biol. 2016 Nov 15;10(1):106. doi: 10.1186/s12918-016-0349-1.
5
BicNET: Flexible module discovery in large-scale biological networks using biclustering.BicNET:使用双聚类在大规模生物网络中进行灵活的模块发现。
Algorithms Mol Biol. 2016 May 20;11:14. doi: 10.1186/s13015-016-0074-8. eCollection 2016.
6
Graph-based unsupervised feature selection and multiview clustering for microarray data.基于图的无监督特征选择与微阵列数据的多视图聚类
J Biosci. 2015 Oct;40(4):755-67. doi: 10.1007/s12038-015-9559-8.
7
The discordant method: a novel approach for differential correlation.不一致方法:一种用于差异相关性分析的新方法。
Bioinformatics. 2016 Mar 1;32(5):690-6. doi: 10.1093/bioinformatics/btv633. Epub 2015 Oct 31.
8
Extensive decoupling of metabolic genes in cancer.癌症中代谢基因的广泛解偶联。
PLoS Comput Biol. 2015 May 11;11(5):e1004176. doi: 10.1371/journal.pcbi.1004176. eCollection 2015 May.
9
Efficient Mining of Discriminative Co-clusters from Gene Expression Data.从基因表达数据中高效挖掘有判别力的共聚类
Knowl Inf Syst. 2014 Dec;41(3):667-696. doi: 10.1007/s10115-013-0684-0.
10
Pathway mapping and development of disease-specific biomarkers: protein-based network biomarkers.疾病特异性生物标志物的通路映射与开发:基于蛋白质的网络生物标志物
J Cell Mol Med. 2015 Feb;19(2):297-314. doi: 10.1111/jcmm.12447. Epub 2015 Jan 5.