• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于倒置的复杂聚类聚类方法。

An inversion-based clustering approach for complex clusters.

机构信息

Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.

Department of Pathology, Microbiology, and Immunology, School Of Veterinary Medicine, University of California, Davis, USA.

出版信息

BMC Res Notes. 2024 May 12;17(1):133. doi: 10.1186/s13104-024-06791-y.

DOI:10.1186/s13104-024-06791-y
PMID:38735941
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11089746/
Abstract

BACKGROUND

The choice of an appropriate similarity measure plays a pivotal role in the effectiveness of clustering algorithms. However, many conventional measures rely solely on feature values to evaluate the similarity between objects to be clustered. Furthermore, the assumption of feature independence, while valid in certain scenarios, does not hold true for all real-world problems. Hence, considering alternative similarity measures that account for inter-dependencies among features can enhance the effectiveness of clustering in various applications.

METHODS

In this paper, we present the Inv measure, a novel similarity measure founded on the concept of inversion. The Inv measure considers the significance of features, the values of all object features, and the feature values of other objects, leading to a comprehensive and precise evaluation of similarity. To assess the performance of our proposed clustering approach that incorporates the Inv measure, we evaluate it on simulated data using the adjusted Rand index.

RESULTS

The simulation results strongly indicate that inversion-based clustering outperforms other methods in scenarios where clusters are complex, i.e., apparently highly overlapped. This showcases the practicality and effectiveness of the proposed approach, making it a valuable choice for applications that involve complex clusters across various domains.

CONCLUSIONS

The inversion-based clustering approach may hold significant value in the healthcare industry, offering possible benefits in tasks like hospital ranking, treatment improvement, and high-risk patient identification. In social media analysis, it may prove valuable for trend detection, sentiment analysis, and user profiling. E-commerce may be able to utilize the approach for product recommendation and customer segmentation. The manufacturing sector may benefit from improved quality control, process optimization, and predictive maintenance. Additionally, the approach may be applied to traffic management and fleet optimization in the transportation domain. Its versatility and effectiveness make it a promising solution for diverse fields, providing valuable insights and optimization opportunities for complex and dynamic data analysis tasks.

摘要

背景

在聚类算法的有效性中,选择适当的相似度度量起着关键作用。然而,许多传统的度量方法仅依赖于特征值来评估要聚类的对象之间的相似性。此外,特征独立性的假设虽然在某些场景下是有效的,但并不适用于所有真实世界的问题。因此,考虑考虑特征之间相互依赖的替代相似性度量可以提高聚类在各种应用中的效果。

方法

在本文中,我们提出了 Inv 度量,这是一种基于反转概念的新相似性度量。Inv 度量考虑了特征的重要性、所有对象特征的值以及其他对象的特征值,从而对相似性进行了全面而精确的评估。为了评估我们提出的聚类方法的性能,我们在模拟数据上使用调整后的兰德指数进行了评估。

结果

模拟结果强烈表明,基于反转的聚类在聚类复杂的情况下表现优于其他方法,即明显高度重叠的情况下。这展示了所提出方法的实用性和有效性,使其成为涉及各种领域复杂聚类的应用的有价值的选择。

结论

基于反转的聚类方法在医疗保健行业可能具有重要价值,在医院排名、治疗改进和高危患者识别等任务中可能具有潜在优势。在社交媒体分析中,它可能对趋势检测、情感分析和用户画像有用。电子商务可能能够利用该方法进行产品推荐和客户细分。制造业可能受益于改进的质量控制、过程优化和预测性维护。此外,该方法可应用于交通领域的交通管理和车队优化。其多功能性和有效性使其成为各种领域的有前途的解决方案,为复杂和动态数据分析任务提供有价值的见解和优化机会。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7946/11089746/ea5a5683526e/13104_2024_6791_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7946/11089746/8a3a725bf31b/13104_2024_6791_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7946/11089746/3f1fe3d468eb/13104_2024_6791_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7946/11089746/ea5a5683526e/13104_2024_6791_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7946/11089746/8a3a725bf31b/13104_2024_6791_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7946/11089746/3f1fe3d468eb/13104_2024_6791_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7946/11089746/ea5a5683526e/13104_2024_6791_Fig3_HTML.jpg

相似文献

1
An inversion-based clustering approach for complex clusters.基于倒置的复杂聚类聚类方法。
BMC Res Notes. 2024 May 12;17(1):133. doi: 10.1186/s13104-024-06791-y.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Cross-over between discrete and continuous protein structure space: insights into automatic classification and networks of protein structures.离散与连续蛋白质结构空间之间的交叉:对蛋白质结构自动分类及网络的见解。
PLoS Comput Biol. 2009 Mar;5(3):e1000331. doi: 10.1371/journal.pcbi.1000331. Epub 2009 Mar 27.
4
Genetic Programming for Evolving Similarity Functions for Clustering: Representations and Analysis.遗传编程用于聚类的相似性函数进化:表示和分析。
Evol Comput. 2020 Winter;28(4):531-561. doi: 10.1162/evco_a_00264. Epub 2019 Oct 10.
5
Understanding and enhancement of internal clustering validation measures.理解和增强内部聚类验证措施。
IEEE Trans Cybern. 2013 Jun;43(3):982-94. doi: 10.1109/TSMCB.2012.2220543. Epub 2012 Oct 26.
6
Metric for measuring the effectiveness of clustering of DNA microarray expression.用于测量 DNA 微阵列表达聚类有效性的度量。
BMC Bioinformatics. 2006 Sep 6;7 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2105-7-S2-S5.
7
A methodology to assess the intrinsic discriminative ability of a distance function and its interplay with clustering algorithms for microarray data analysis.一种评估距离函数内在判别能力及其与聚类算法相互作用的方法,用于微阵列数据分析。
BMC Bioinformatics. 2013;14 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2105-14-S1-S6. Epub 2013 Jan 14.
8
A mutual neighbor-based clustering method and its medical applications.基于互邻居的聚类方法及其医学应用。
Comput Biol Med. 2022 Nov;150:106184. doi: 10.1016/j.compbiomed.2022.106184. Epub 2022 Oct 12.
9
clusterBMA: Bayesian model averaging for clustering.聚类 BMA:用于聚类的贝叶斯模型平均。
PLoS One. 2023 Aug 21;18(8):e0288000. doi: 10.1371/journal.pone.0288000. eCollection 2023.
10
Implementation and evaluation of a multivariate abstraction-based, interval-based dynamic time-warping method as a similarity measure for longitudinal medical records.基于多元抽象和区间的动态时间规整方法的实现和评估,作为一种用于纵向医疗记录的相似性度量方法。
J Biomed Inform. 2021 Nov;123:103919. doi: 10.1016/j.jbi.2021.103919. Epub 2021 Oct 8.

本文引用的文献

1
A Comparison Study on Similarity and Dissimilarity Measures in Clustering Continuous Data.连续数据聚类中相似性和相异性度量的比较研究
PLoS One. 2015 Dec 11;10(12):e0144059. doi: 10.1371/journal.pone.0144059. eCollection 2015.
2
Comparison of nonbinary similarity coefficients for similarity searching, clustering and compound selection.用于相似性搜索、聚类和化合物选择的非二元相似系数比较。
J Chem Inf Model. 2009 May;49(5):1193-201. doi: 10.1021/ci8004644.
3
Survey of clustering algorithms.聚类算法综述
IEEE Trans Neural Netw. 2005 May;16(3):645-78. doi: 10.1109/TNN.2005.845141.