Suppr超能文献

探索性数据结构比较:基于主成分分析的三种新可视化工具

Exploratory data structure comparisons: three new visual tools based on principal component analysis.

作者信息

Petersen Anne Helby, Markussen Bo, Christensen Karl Bang

机构信息

Department of Public Health, University of Copenhagen, Copenhagen, Denmark.

Department of Mathematical Sciences, University of Copenhagen, Copenhagen, Denmark.

出版信息

J Appl Stat. 2020 May 27;48(9):1675-1695. doi: 10.1080/02664763.2020.1773772. eCollection 2021.

Abstract

Datasets are sometimes divided into distinct subsets, e.g. due to multi-center sampling, or to variations in instruments, questionnaire item ordering or mode of administration, and the data analyst then needs to assess whether a joint analysis is meaningful. The Principal Component Analysis-based Data Structure Comparisons (PCADSC) tools are three new non-parametric, visual diagnostic tools for investigating differences in structure for two subsets of a dataset through covariance matrix comparisons by use of principal component analysis. The PCADCS tools are demonstrated in a data example using European Social Survey data on psychological well-being in three countries, Denmark, Sweden, and Bulgaria. The data structures are found to be different in Denmark and Bulgaria, and thus a comparison of for example mean psychological well-being scores is not meaningful. However, when comparing Denmark and Sweden, very similar data structures, and thus comparable concepts of well-being, are found. Therefore, inter-country comparisons are warranted for these countries.

摘要

数据集有时会被划分为不同的子集,例如由于多中心抽样,或者由于仪器、问卷项目顺序或施测方式的差异,然后数据分析师需要评估联合分析是否有意义。基于主成分分析的数据结构比较(PCADSC)工具是三种新的非参数可视化诊断工具,用于通过使用主成分分析进行协方差矩阵比较来研究数据集中两个子集的结构差异。PCADCS工具在一个数据示例中得到了展示,该示例使用了欧洲社会调查中关于丹麦、瑞典和保加利亚三个国家心理健康的数据。研究发现丹麦和保加利亚的数据结构不同,因此例如比较平均心理健康得分是没有意义的。然而,在比较丹麦和瑞典时,发现它们的数据结构非常相似,因此幸福概念具有可比性。所以,对这些国家进行国家间比较是有必要的。

相似文献

本文引用的文献

2
The Scree Test For The Number Of Factors.因子数量的碎石检验
Multivariate Behav Res. 1966 Apr 1;1(2):245-76. doi: 10.1207/s15327906mbr0102_10.
5
Model-checking techniques based on cumulative residuals.基于累积残差的模型检查技术。
Biometrics. 2002 Mar;58(1):1-12. doi: 10.1111/j.0006-341x.2002.00001.x.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验