• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于K均值聚类分析的新变量加权与选择程序。

A New Variable Weighting and Selection Procedure for K-means Cluster Analysis.

作者信息

Steinley Douglas, Brusco Michael J

机构信息

a University of Missouri-Columbia .

b Florida State University .

出版信息

Multivariate Behav Res. 2008 Jan-Mar;43(1):77-108. doi: 10.1080/00273170701836695.

DOI:10.1080/00273170701836695
PMID:26788973
Abstract

A variance-to-range ratio variable weighting procedure is proposed. We show how this weighting method is theoretically grounded in the inherent variability found in data exhibiting cluster structure. In addition, a variable selection procedure is proposed to operate in conjunction with the variable weighting technique. The performances of these procedures are demonstrated in a simulation study, showing favorable results when compared with existing standardization methods. A detailed demonstration of the weighting and selection procedure is provided for the well-known Fisher Iris data and several synthetic data sets.

摘要

提出了一种方差与范围比变量加权程序。我们展示了这种加权方法如何在理论上基于具有聚类结构的数据中固有的变异性。此外,还提出了一种与变量加权技术协同操作的变量选择程序。在模拟研究中展示了这些程序的性能,与现有标准化方法相比显示出良好的结果。针对著名的费舍尔鸢尾花数据集和几个合成数据集,提供了加权和选择程序的详细演示。

相似文献

1
A New Variable Weighting and Selection Procedure for K-means Cluster Analysis.一种用于K均值聚类分析的新变量加权与选择程序。
Multivariate Behav Res. 2008 Jan-Mar;43(1):77-108. doi: 10.1080/00273170701836695.
2
Part 1. Statistical Learning Methods for the Effects of Multiple Air Pollution Constituents.第1部分. 多种空气污染成分影响的统计学习方法
Res Rep Health Eff Inst. 2015 Jun(183 Pt 1-2):5-50.
3
Automated variable weighting in k-means type clustering.k均值类型聚类中的自动可变加权
IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):657-68. doi: 10.1109/TPAMI.2005.95.
4
Optimization algorithms and weighting factors for analysis of dynamic PET studies.用于动态PET研究分析的优化算法和加权因子。
Phys Med Biol. 2006 Sep 7;51(17):4217-32. doi: 10.1088/0031-9155/51/17/007. Epub 2006 Aug 8.
5
Clustering binary data in the presence of masking variables.在存在屏蔽变量的情况下对二元数据进行聚类。
Psychol Methods. 2004 Dec;9(4):510-23. doi: 10.1037/1082-989X.9.4.510.
6
Principal Cluster Axes: A Projection Pursuit Index for the Preservation of Cluster Structures in the Presence of Data Reduction.主聚类轴:一种在数据约简情况下用于保留聚类结构的投影寻踪指标。
Multivariate Behav Res. 2012 Jun 18;47(3):463-92. doi: 10.1080/00273171.2012.673952.
7
Analysis of clustered matched-pair data.聚类匹配对数据的分析。
Stat Med. 2003 Aug 15;22(15):2417-28. doi: 10.1002/sim.1438.
8
[Introduction to an individual-based standardization method -- propensity score weighting].[基于个体的标准化方法——倾向得分加权介绍]
Zhonghua Liu Xing Bing Xue Za Zhi. 2010 Feb;31(2):223-6.
9
Penalized and weighted K-means for clustering with scattered objects and prior information in high-throughput biological data.用于高通量生物数据中具有分散对象和先验信息的聚类的惩罚加权K均值算法
Bioinformatics. 2007 Sep 1;23(17):2247-55. doi: 10.1093/bioinformatics/btm320. Epub 2007 Jun 27.
10
Simultaneous Two-Way Clustering of Multiple Correspondence Analysis.多重对应分析的同步双向聚类
Multivariate Behav Res. 2010 Jan 29;45(1):186-208. doi: 10.1080/00273170903504893.

引用本文的文献

1
COVID-19 related conspiracy beliefs and their relationship with defense strategies, emotions, powerlessness, attitudes, and time perspective.与新冠病毒相关的阴谋论信念及其与防御策略、情绪、无力感、态度和时间观念的关系。
Front Psychol. 2022 Oct 11;13:939615. doi: 10.3389/fpsyg.2022.939615. eCollection 2022.
2
Simultaneous clustering and variable selection: A novel algorithm and model selection procedure.同时聚类和变量选择:一种新算法和模型选择过程。
Behav Res Methods. 2023 Aug;55(5):2157-2174. doi: 10.3758/s13428-022-01795-7. Epub 2022 Sep 9.
3
Determining the spatial distribution of environmental and socio-economic suitability for human leptospirosis in the face of limited epidemiological data.
在有限的流行病学数据下,确定人类钩端螺旋体病的环境和社会经济适宜性的空间分布。
Infect Dis Poverty. 2022 Aug 4;11(1):86. doi: 10.1186/s40249-022-01010-x.
4
Research on routing and scheduling algorithms for the simultaneous transmission of diverse data streaming services on the industrial internet.工业互联网上多种数据流服务同时传输的路由与调度算法研究
Sci Rep. 2021 Sep 15;11(1):18351. doi: 10.1038/s41598-021-97613-9.
5
Person-centered approaches in the study of personality disorders.人格障碍研究中的以患者为中心的方法。
Personal Disord. 2017 Oct;8(4):288-297. doi: 10.1037/per0000212.
6
A comparison of latent class, K-means, and K-median methods for clustering dichotomous data.比较潜在类别、K-均值和 K-中位数方法在二分类数据聚类中的应用。
Psychol Methods. 2017 Sep;22(3):563-580. doi: 10.1037/met0000095. Epub 2016 Sep 8.
7
Local Optima in Mixture Modeling.混合建模中的局部最优解。
Multivariate Behav Res. 2016 Jul-Aug;51(4):466-81. doi: 10.1080/00273171.2016.1160359.
8
Principal Cluster Axes: A Projection Pursuit Index for the Preservation of Cluster Structures in the Presence of Data Reduction.主聚类轴:一种在数据约简情况下用于保留聚类结构的投影寻踪指标。
Multivariate Behav Res. 2012 Jun 18;47(3):463-92. doi: 10.1080/00273171.2012.673952.
9
Sex differences in the latent class structure of alcohol use disorder: Does (dis)aggregation of indicators matter?酒精使用障碍潜在类别结构中的性别差异:指标的(去)聚合重要吗?
Exp Clin Psychopharmacol. 2015 Aug;23(4):291-301. doi: 10.1037/pha0000029.
10
Converging identities: dimensions of acculturation and personal identity status among immigrant college students.融合的身份认同:移民大学生的文化适应维度和个人身份认同状态。
Cultur Divers Ethnic Minor Psychol. 2013 Apr;19(2):155-65. doi: 10.1037/a0030753. Epub 2012 Nov 12.