• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从基因表达数据中高效挖掘有判别力的共聚类

Efficient Mining of Discriminative Co-clusters from Gene Expression Data.

作者信息

Odibat Omar, Reddy Chandan K

机构信息

Department of Computer Science, Wayne State University, Detroit, MI, 48202.

出版信息

Knowl Inf Syst. 2014 Dec;41(3):667-696. doi: 10.1007/s10115-013-0684-0.

DOI:10.1007/s10115-013-0684-0
PMID:25642010
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4308820/
Abstract

Discriminative models are used to analyze the differences between two classes and to identify class-specific patterns. Most of the existing discriminative models depend on using the entire feature space to compute the discriminative patterns for each class. Co-clustering has been proposed to capture the patterns that are correlated in a subset of features, but it cannot handle discriminative patterns in labeled datasets. In certain biological applications such as gene expression analysis, it is critical to consider the discriminative patterns that are correlated only in a subset of the feature space. The objective of this paper is two-fold: first, it presents an algorithm to efficiently find arbitrarily positioned co-clusters from complex data. Second, it extends this co-clustering algorithm to discover discriminative co-clusters by incorporating the class information into the co-cluster search process. In addition, we also characterize the discriminative co-clusters and propose three novel measures that can be used to evaluate the performance of any discriminative subspace pattern mining algorithm. We evaluated the proposed algorithms on several synthetic and real gene expression datasets, and our experimental results showed that the proposed algorithms outperformed several existing algorithms available in the literature.

摘要

判别模型用于分析两类之间的差异并识别特定类别的模式。大多数现有的判别模型依赖于使用整个特征空间来计算每个类别的判别模式。协同聚类已被提出用于捕获在特征子集中相关的模式,但它无法处理标记数据集中的判别模式。在某些生物应用中,如基因表达分析,考虑仅在特征空间子集中相关的判别模式至关重要。本文的目标有两个:首先,提出一种算法,用于从复杂数据中高效地找到任意位置的协同聚类。其次,通过将类信息纳入协同聚类搜索过程,扩展此协同聚类算法以发现判别协同聚类。此外,我们还对判别协同聚类进行了表征,并提出了三种新颖的度量,可用于评估任何判别子空间模式挖掘算法的性能。我们在几个合成和真实的基因表达数据集上评估了所提出的算法,实验结果表明所提出的算法优于文献中现有的几种算法。

相似文献

1
Efficient Mining of Discriminative Co-clusters from Gene Expression Data.从基因表达数据中高效挖掘有判别力的共聚类
Knowl Inf Syst. 2014 Dec;41(3):667-696. doi: 10.1007/s10115-013-0684-0.
2
Noise-robust unsupervised spike sorting based on discriminative subspace learning with outlier handling.基于具有异常值处理的判别子空间学习的抗噪声无监督尖峰排序。
J Neural Eng. 2017 Jun;14(3):036003. doi: 10.1088/1741-2552/aa6089. Epub 2017 Feb 15.
3
Discriminative sparse subspace learning and its application to unsupervised feature selection.判别式稀疏子空间学习及其在无监督特征选择中的应用。
ISA Trans. 2016 Mar;61:104-118. doi: 10.1016/j.isatra.2015.12.011. Epub 2016 Jan 20.
4
Subspace Weighting Co-Clustering of Gene Expression Data.基于基因表达数据的子空间加权协同聚类。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Mar-Apr;16(2):352-364. doi: 10.1109/TCBB.2017.2705686. Epub 2017 May 18.
5
Efficiently mining time-delayed gene expression patterns.高效挖掘时间延迟基因表达模式。
IEEE Trans Syst Man Cybern B Cybern. 2010 Apr;40(2):400-11. doi: 10.1109/TSMCB.2009.2025564. Epub 2009 Oct 30.
6
Unsupervised fuzzy pattern discovery in gene expression data.基于基因表达数据的无监督模糊模式发现。
BMC Bioinformatics. 2011;12 Suppl 5(Suppl 5):S5. doi: 10.1186/1471-2105-12-S5-S5. Epub 2011 Jul 27.
7
Discriminative Feature Selection for Uncertain Graph Classification.用于不确定图分类的判别特征选择
Proc SIAM Int Conf Data Min. 2013;2013:82-93. doi: 10.1137/1.9781611972832.10.
8
Microarray data mining using landmark gene-guided clustering.使用标志性基因引导聚类的微阵列数据挖掘
BMC Bioinformatics. 2008 Feb 11;9:92. doi: 10.1186/1471-2105-9-92.
9
Integrating biological knowledge based on functional annotations for biclustering of gene expression data.基于功能注释整合生物学知识以进行基因表达数据的双聚类分析。
Comput Methods Programs Biomed. 2015 May;119(3):163-80. doi: 10.1016/j.cmpb.2015.02.010. Epub 2015 Mar 18.
10
Discovering biclusters in gene expression data based on high-dimensional linear geometries.基于高维线性几何在基因表达数据中发现双簇。
BMC Bioinformatics. 2008 Apr 23;9:209. doi: 10.1186/1471-2105-9-209.

引用本文的文献

1
Biclustering data analysis: a comprehensive survey.双聚类数据分析:全面综述。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae342.
2
Methylation differences reveal heterogeneity in preterm pathophysiology: results from bipartite network analyses.甲基化差异揭示早产病理生理学中的异质性:二分网络分析结果
J Perinat Med. 2018 Jul 26;46(5):509-521. doi: 10.1515/jpm-2017-0126.
3
BicNET: Flexible module discovery in large-scale biological networks using biclustering.BicNET:使用双聚类在大规模生物网络中进行灵活的模块发现。
Algorithms Mol Biol. 2016 May 20;11:14. doi: 10.1186/s13015-016-0074-8. eCollection 2016.
4
A composite model for subgroup identification and prediction via bicluster analysis.一种通过双聚类分析进行亚组识别和预测的复合模型。
PLoS One. 2014 Oct 27;9(10):e111318. doi: 10.1371/journal.pone.0111318. eCollection 2014.

本文引用的文献

1
DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach.DeBi:使用频繁项集方法发现差异表达的双聚类
Algorithms Mol Biol. 2011 Jun 23;6(1):18. doi: 10.1186/1748-7188-6-18.
2
From 'differential expression' to 'differential networking' - identification of dysfunctional regulatory networks in diseases.从“差异表达”到“差异网络”——疾病中失调调控网络的识别。
Trends Genet. 2010 Jul;26(7):326-33. doi: 10.1016/j.tig.2010.05.001.
3
Identification of differentially expressed gene modules between two-class DNA microarray data.两类DNA微阵列数据之间差异表达基因模块的识别。
Bioinformation. 2009 Oct 11;4(4):134-7. doi: 10.6026/97320630004134.
4
Subspace differential coexpression analysis: problem definition and a general approach.子空间微分共表达分析:问题定义与通用方法。
Pac Symp Biocomput. 2010:145-56.
5
Coclustering of human cancer microarrays using Minimum Sum-Squared Residue coclustering.使用最小平方残差共聚类法对人类癌症微阵列进行共聚类分析。
IEEE/ACM Trans Comput Biol Bioinform. 2008 Jul-Sep;5(3):385-400. doi: 10.1109/TCBB.2007.70268.
6
TRUST-TECH-based expectation maximization for learning finite mixture models.基于TRUST-TECH的期望最大化算法用于学习有限混合模型
IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1146-57. doi: 10.1109/TPAMI.2007.70775.
7
Biclustering algorithms for biological data analysis: a survey.用于生物数据分析的双聚类算法:一项综述。
IEEE/ACM Trans Comput Biol Bioinform. 2004 Jan-Mar;1(1):24-45. doi: 10.1109/TCBB.2004.2.
8
A systematic comparison and evaluation of biclustering methods for gene expression data.基因表达数据双聚类方法的系统比较与评估
Bioinformatics. 2006 May 1;22(9):1122-9. doi: 10.1093/bioinformatics/btl060. Epub 2006 Feb 24.
9
Biclustering in gene expression data by tendency.基于趋势的基因表达数据双聚类分析
Proc IEEE Comput Syst Bioinform Conf. 2004:182-93. doi: 10.1109/csb.2004.1332431.
10
Defining transcription modules using large-scale gene expression data.利用大规模基因表达数据定义转录模块。
Bioinformatics. 2004 Sep 1;20(13):1993-2003. doi: 10.1093/bioinformatics/bth166. Epub 2004 Mar 25.