• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多视图多表示共识聚类集成(mmcc)方法的增强型网络搜索结果聚类模型。

Enhancing web search result clustering model based on multiview multirepresentation consensus cluster ensemble (mmcc) approach.

作者信息

Sabah Ali, Tiun Sabrina, Sani Nor Samsiah, Ayob Masri, Taha Adil Yaseen

机构信息

Center for Artificial Intelligence Technology (CAIT), Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia.

出版信息

PLoS One. 2021 Jan 15;16(1):e0245264. doi: 10.1371/journal.pone.0245264. eCollection 2021.

DOI:10.1371/journal.pone.0245264
PMID:33449949
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7810326/
Abstract

Existing text clustering methods utilize only one representation at a time (single view), whereas multiple views can represent documents. The multiview multirepresentation method enhances clustering quality. Moreover, existing clustering methods that utilize more than one representation at a time (multiview) use representation with the same nature. Hence, using multiple views that represent data in a different representation with clustering methods is reasonable to create a diverse set of candidate clustering solutions. On this basis, an effective dynamic clustering method must consider combining multiple views of data including semantic view, lexical view (word weighting), and topic view as well as the number of clusters. The main goal of this study is to develop a new method that can improve the performance of web search result clustering (WSRC). An enhanced multiview multirepresentation consensus clustering ensemble (MMCC) method is proposed to create a set of diverse candidate solutions and select a high-quality overlapping cluster. The overlapping clusters are obtained from the candidate solutions created by different clustering methods. The framework to develop the proposed MMCC includes numerous stages: (1) acquiring the standard datasets (MORESQUE and Open Directory Project-239), which are used to validate search result clustering algorithms, (2) preprocessing the dataset, (3) applying multiview multirepresentation clustering models, (4) using the radius-based cluster number estimation algorithm, and (5) employing the consensus clustering ensemble method. Results show an improvement in clustering methods when multiview multirepresentation is used. More importantly, the proposed MMCC model improves the overall performance of WSRC compared with all single-view clustering models.

摘要

现有的文本聚类方法一次仅使用一种表示形式(单视图),而多个视图可以表示文档。多视图多表示方法可提高聚类质量。此外,现有的一次使用多种表示形式(多视图)的聚类方法使用的是性质相同的表示形式。因此,将以不同表示形式表示数据的多个视图与聚类方法结合使用,以创建一组多样化的候选聚类解决方案是合理的。在此基础上,一种有效的动态聚类方法必须考虑结合数据的多个视图,包括语义视图、词汇视图(词加权)和主题视图以及聚类数量。本研究的主要目标是开发一种能够提高网络搜索结果聚类(WSRC)性能的新方法。提出了一种增强的多视图多表示共识聚类集成(MMCC)方法,以创建一组多样化的候选解决方案并选择高质量的重叠聚类。重叠聚类是从由不同聚类方法创建的候选解决方案中获得的。开发所提出的MMCC的框架包括多个阶段:(1)获取标准数据集(MORESQUE和开放目录项目 - 239),用于验证搜索结果聚类算法,(2)对数据集进行预处理,(3)应用多视图多表示聚类模型,(4)使用基于半径的聚类数量估计算法,以及(5)采用共识聚类集成方法。结果表明,使用多视图多表示时聚类方法有改进。更重要的是,与所有单视图聚类模型相比,所提出的MMCC模型提高了WSRC的整体性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/423d61206022/pone.0245264.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/e88591ebdd7f/pone.0245264.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/b39cdb50f446/pone.0245264.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/8e29ebed3ef2/pone.0245264.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/fecc26f1b3b8/pone.0245264.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/4eb85694179f/pone.0245264.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/a391d02fb80e/pone.0245264.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/423d61206022/pone.0245264.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/e88591ebdd7f/pone.0245264.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/b39cdb50f446/pone.0245264.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/8e29ebed3ef2/pone.0245264.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/fecc26f1b3b8/pone.0245264.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/4eb85694179f/pone.0245264.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/a391d02fb80e/pone.0245264.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3ab/7810326/423d61206022/pone.0245264.g007.jpg

相似文献

1
Enhancing web search result clustering model based on multiview multirepresentation consensus cluster ensemble (mmcc) approach.基于多视图多表示共识聚类集成(mmcc)方法的增强型网络搜索结果聚类模型。
PLoS One. 2021 Jan 15;16(1):e0245264. doi: 10.1371/journal.pone.0245264. eCollection 2021.
2
VH: View Variation and View Heredity for Incomplete Multiview Clustering.VH:用于不完全多视图聚类的视图变化与视图遗传
IEEE Trans Artif Intell. 2021 Jan 18;1(3):233-247. doi: 10.1109/TAI.2021.3052425. eCollection 2020 Dec.
3
Consensus Kernel -Means Clustering for Incomplete Multiview Data.一致性核 - 均值聚类算法在不完全多视图数据中的应用。
Comput Intell Neurosci. 2017;2017:3961718. doi: 10.1155/2017/3961718. Epub 2017 Oct 22.
4
Partition level multiview subspace clustering.分区级多视图子空间聚类。
Neural Netw. 2020 Feb;122:279-288. doi: 10.1016/j.neunet.2019.10.010. Epub 2019 Nov 6.
5
Cross-View Representation Learning-Based Deep Multiview Clustering With Adaptive Graph Constraint.基于交叉视图表示学习的具有自适应图约束的深度多视图聚类
IEEE Trans Neural Netw Learn Syst. 2024 Sep 4;PP. doi: 10.1109/TNNLS.2024.3447006.
6
Multiple view clustering using a weighted combination of exemplar-based mixture models.使用基于样本的混合模型加权组合的多视图聚类
IEEE Trans Neural Netw. 2010 Dec;21(12):1925-38. doi: 10.1109/TNN.2010.2081999. Epub 2010 Oct 7.
7
Clustering Ensemble Based on Hybrid Multiview Clustering.基于混合多视图聚类的聚类集成。
IEEE Trans Cybern. 2022 Jul;52(7):6518-6530. doi: 10.1109/TCYB.2020.3034157. Epub 2022 Jul 4.
8
Multiview Consensus Structure Discovery.多视图共识结构发现。
IEEE Trans Cybern. 2022 May;52(5):3469-3482. doi: 10.1109/TCYB.2020.3013136. Epub 2022 May 19.
9
Incomplete Multiview Clustering via Late Fusion.基于后期融合的不完全多视图聚类。
Comput Intell Neurosci. 2018 Oct 1;2018:6148456. doi: 10.1155/2018/6148456. eCollection 2018.
10
Marginalized Multiview Ensemble Clustering.边缘化多视图集成聚类
IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):600-611. doi: 10.1109/TNNLS.2019.2906867. Epub 2019 Apr 15.

引用本文的文献

1
Crosstalk of lactate metabolism-related subtypes, establishment of a prognostic signature and immune infiltration characteristics in colon adenocarcinoma.结肠腺癌中乳酸代谢相关亚型的相互作用、预后特征的建立及免疫浸润特征
Sci Rep. 2025 Apr 26;15(1):14599. doi: 10.1038/s41598-025-98735-0.
2
A novel machine learning-driven immunogenic cell death signature for predicting ovarian cancer prognosis.一种用于预测卵巢癌预后的新型机器学习驱动的免疫原性细胞死亡特征。
Transl Cancer Res. 2025 Feb 28;14(2):1359-1374. doi: 10.21037/tcr-2025-118. Epub 2025 Feb 26.
3
The interplay between angiogenesis-associated genes and molecular, clinical, and immune features in bladder cancer.

本文引用的文献

1
Multi-View Cluster Analysis with Incomplete Data to Understand Treatment Effects.用于理解治疗效果的具有不完整数据的多视图聚类分析。
Inf Sci (N Y). 2019 Aug;494:278-293. doi: 10.1016/j.ins.2019.04.039. Epub 2019 Apr 22.
2
BioWordVec, improving biomedical word embeddings with subword information and MeSH.BioWordVec,利用子词信息和 MeSH 改进生物医学词向量。
Sci Data. 2019 May 10;6(1):52. doi: 10.1038/s41597-019-0055-0.
3
EDISON-WMW: Exact Dynamic Programing Solution of the Wilcoxon-Mann-Whitney Test.爱迪生 - 威尔科克森 - 曼 - 惠特尼检验的精确动态规划解决方案
膀胱癌中血管生成相关基因与分子、临床及免疫特征之间的相互作用。
Discov Oncol. 2025 Mar 5;16(1):265. doi: 10.1007/s12672-025-01966-w.
4
Mechanistic insights into PROS1 inhibition of bladder cancer progression and angiogenesis via the AKT/GSK3β/β-catenin pathway.关于PROS1通过AKT/GSK3β/β-连环蛋白途径抑制膀胱癌进展和血管生成的机制性见解。
Sci Rep. 2025 Feb 8;15(1):4748. doi: 10.1038/s41598-025-89217-4.
5
The complement C3a/C3aR pathway is associated with treatment resistance to gemcitabine-based neoadjuvant therapy in pancreatic cancer.补体C3a/C3aR途径与胰腺癌中基于吉西他滨的新辅助治疗的耐药性相关。
Comput Struct Biotechnol J. 2024 Oct 5;23:3634-3650. doi: 10.1016/j.csbj.2024.09.032. eCollection 2024 Dec.
6
Vasculogenic mimicry-related gene prognostic index for predicting prognosis, immune microenvironment in clear cell renal cell carcinoma.用于预测透明细胞肾细胞癌预后及免疫微环境的血管生成拟态相关基因预后指数
Heliyon. 2024 Aug 14;10(16):e36235. doi: 10.1016/j.heliyon.2024.e36235. eCollection 2024 Aug 30.
7
A multi-view representation technique based on principal component analysis for enhanced short text clustering.基于主成分分析的多视图表示技术增强短文本聚类。
PLoS One. 2024 Aug 23;19(8):e0309206. doi: 10.1371/journal.pone.0309206. eCollection 2024.
8
A novel angiogenesis-associated risk score predicts prognosis and characterizes the tumor microenvironment in colon cancer.一种新型血管生成相关风险评分可预测结肠癌的预后并描绘其肿瘤微环境特征。
Transl Cancer Res. 2024 May 31;13(5):2094-2107. doi: 10.21037/tcr-23-2048. Epub 2024 May 28.
9
Identification of an angiogenesis-related risk score model for survival prediction and immunosubtype screening in multiple myeloma.鉴定多发性骨髓瘤生存预测和免疫亚型筛选的血管生成相关风险评分模型。
Aging (Albany NY). 2024 Feb 5;16(3):2657-2678. doi: 10.18632/aging.205502.
10
Machine learning-derived identification of prognostic signature for improving prognosis and drug response in patients with ovarian cancer.机器学习衍生的卵巢癌患者预后和药物反应的预后特征识别。
J Cell Mol Med. 2024 Jan;28(1):e18021. doi: 10.1111/jcmm.18021. Epub 2023 Nov 23.
Genomics Proteomics Bioinformatics. 2016 Feb;14(1):55-61. doi: 10.1016/j.gpb.2015.11.004. Epub 2016 Jan 29.
4
Locally Embedding Autoencoders: A Semi-Supervised Manifold Learning Approach of Document Representation.局部嵌入自动编码器:一种文档表示的半监督流形学习方法。
PLoS One. 2016 Jan 19;11(1):e0146672. doi: 10.1371/journal.pone.0146672. eCollection 2016.
5
Paradigm of tunable clustering using Binarization of Consensus Partition Matrices (Bi-CoPaM) for gene discovery.使用共识分区矩阵二值化(Bi-CoPaM)进行可调聚类的范例,用于基因发现。
PLoS One. 2013;8(2):e56432. doi: 10.1371/journal.pone.0056432. Epub 2013 Feb 11.