• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于知识的聚类集成在生物分子数据中的癌症发现。

Knowledge based cluster ensemble for cancer discovery from biomolecular data.

机构信息

School of Computer Science and Engineering, South China University of Technology, Guangzhou, China.

出版信息

IEEE Trans Nanobioscience. 2011 Jun;10(2):76-85. doi: 10.1109/TNB.2011.2144997. Epub 2011 Jul 7.

DOI:10.1109/TNB.2011.2144997
PMID:21742574
Abstract

The adoption of microarray techniques in biological and medical research provides a new way for cancer diagnosis and treatment. In order to perform successful diagnosis and treatment of cancer, discovering and classifying cancer types correctly is essential. Class discovery is one of the most important tasks in cancer classification using biomolecular data. Most of the existing works adopt single clustering algorithms to perform class discovery from biomolecular data. However, single clustering algorithms have limitations, which include a lack of robustness, stability, and accuracy. In this paper, we propose a new cluster ensemble approach called knowledge based cluster ensemble (KCE) which incorporates the prior knowledge of the data sets into the cluster ensemble framework. Specifically, KCE represents the prior knowledge of a data set in the form of pairwise constraints. Then, the spectral clustering algorithm (SC) is adopted to generate a set of clustering solutions. Next, KCE transforms pairwise constraints into confidence factors for these clustering solutions. After that, a consensus matrix is constructed by considering all the clustering solutions and their corresponding confidence factors. The final clustering result is obtained by partitioning the consensus matrix. Comparison with single clustering algorithms and conventional cluster ensemble approaches, knowledge based cluster ensemble approaches are more robust, stable and accurate. The experiments on cancer data sets show that: 1) KCE works well on these data sets; 2) KCE not only outperforms most of the state-of-the-art single clustering algorithms, but also outperforms most of the state-of-the-art cluster ensemble approaches.

摘要

微阵列技术在生物和医学研究中的采用为癌症的诊断和治疗提供了新的方法。为了成功地进行癌症的诊断和治疗,正确地发现和分类癌症类型是至关重要的。分类发现是使用生物分子数据进行癌症分类的最重要任务之一。大多数现有的工作采用单一聚类算法从生物分子数据中执行分类发现。然而,单一聚类算法具有缺乏稳健性、稳定性和准确性的局限性。在本文中,我们提出了一种新的聚类集成方法,称为基于知识的聚类集成(KCE),它将数据集的先验知识纳入聚类集成框架中。具体来说,KCE 以成对约束的形式表示数据集的先验知识。然后,采用谱聚类算法(SC)生成一组聚类解决方案。接下来,KCE 将成对约束转换为这些聚类解决方案的置信因子。之后,通过考虑所有聚类解决方案及其相应的置信因子来构建一致矩阵。最后通过分割一致矩阵得到聚类结果。与单一聚类算法和传统聚类集成方法相比,基于知识的聚类集成方法更加稳健、稳定和准确。在癌症数据集上的实验表明:1)KCE 在这些数据集上表现良好;2)KCE 不仅优于大多数最先进的单一聚类算法,而且优于大多数最先进的聚类集成方法。

相似文献

1
Knowledge based cluster ensemble for cancer discovery from biomolecular data.基于知识的聚类集成在生物分子数据中的癌症发现。
IEEE Trans Nanobioscience. 2011 Jun;10(2):76-85. doi: 10.1109/TNB.2011.2144997. Epub 2011 Jul 7.
2
Hybrid fuzzy cluster ensemble framework for tumor clustering from biomolecular data.用于从生物分子数据中进行肿瘤聚类的混合模糊聚类集成框架。
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):657-70. doi: 10.1109/TCBB.2013.59.
3
Double Selection Based Semi-Supervised Clustering Ensemble for Tumor Clustering from Gene Expression Profiles.基于双重选择的半监督聚类集成用于从基因表达谱中进行肿瘤聚类
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):727-40. doi: 10.1109/TCBB.2014.2315996.
4
SC(3): Triple spectral clustering-based consensus clustering framework for class discovery from cancer gene expression profiles.SC(3):基于三重谱聚类的共识聚类框架,用于从癌症基因表达谱中发现类别。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1751-65. doi: 10.1109/TCBB.2012.108.
5
Graph-based consensus clustering for class discovery from gene expression data.基于图的共识聚类用于从基因表达数据中发现类别
Bioinformatics. 2007 Nov 1;23(21):2888-96. doi: 10.1093/bioinformatics/btm463. Epub 2007 Sep 14.
6
Class discovery from gene expression data based on perturbation and cluster ensemble.基于扰动和聚类集成从基因表达数据中发现类别
IEEE Trans Nanobioscience. 2009 Jun;8(2):147-60. doi: 10.1109/TNB.2009.2023321. Epub 2009 Jun 2.
7
Fuzzy ensemble clustering based on random projections for DNA microarray data analysis.基于随机投影的模糊集成聚类用于DNA微阵列数据分析
Artif Intell Med. 2009 Feb-Mar;45(2-3):173-83. doi: 10.1016/j.artmed.2008.07.014. Epub 2008 Sep 17.
8
Adaptive Fuzzy Consensus Clustering Framework for Clustering Analysis of Cancer Data.用于癌症数据聚类分析的自适应模糊共识聚类框架
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):887-901. doi: 10.1109/TCBB.2014.2359433.
9
LCE: a link-based cluster ensemble method for improved gene expression data analysis.LCE:一种基于链接的聚类集成方法,用于改进基因表达数据分析。
Bioinformatics. 2010 Jun 15;26(12):1513-9. doi: 10.1093/bioinformatics/btq226. Epub 2010 May 5.
10
A new unsupervised feature ranking method for gene expression data based on consensus affinity.基于一致性亲和力的基因表达数据新无监督特征排序方法
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1257-63. doi: 10.1109/TCBB.2012.34.

引用本文的文献

1
Gene Expression-Assisted Cancer Prediction Techniques.基于基因表达的癌症预测技术。
J Healthc Eng. 2021 Aug 19;2021:4242646. doi: 10.1155/2021/4242646. eCollection 2021.
2
Semi-supervised consensus clustering for gene expression data analysis.基于半监督共识聚类的基因表达数据分析。
BioData Min. 2014 May 8;7:7. doi: 10.1186/1756-0381-7-7. eCollection 2014.