• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估稀疏计数矩阵得到的聚类解决方案的稳健性。

Assessing the robustness of cluster solutions obtained from sparse count matrices.

机构信息

Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill.

Department of Mathematics, University of North Carolina at Chapel Hill.

出版信息

Psychol Methods. 2019 Dec;24(6):675-689. doi: 10.1037/met0000204. Epub 2019 Feb 11.

DOI:10.1037/met0000204
PMID:30742473
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6689462/
Abstract

Psychological researchers often seek to obtain cluster solutions from sparse count matrices (e.g., social networks; counts of symptoms that are in common for 2 given individuals; structural brain imaging). Increasingly, community detection methods are being used to subset the data in a data-driven manner. While many of these approaches perform well in simulation studies and thus offer some improvement upon traditional clustering approaches, there is no readily available approach for evaluating the robustness of these solutions in empirical data. Researchers have no way of knowing if their results are due to noise. We describe here 2 approaches novel to the field of psychology that enable evaluation of cluster solution robustness. This tutorial also explains the use of an associated R package, perturbR, which provides researchers with the ability to use the methods described herein. In the first approach, the cluster assignment from the original matrix is compared against cluster assignments obtained by randomly perturbing the edges in the matrix. Stable cluster solutions should not demonstrate large changes in the presence of small perturbations. For the second approach, Monte Carlo simulations of random matrices that have the same properties as the original matrix are generated. The distribution of quality scores ("modularity") obtained from the cluster solutions from these matrices are then compared with the score obtained from the original matrix results. From this, one can assess if the results are better than what would be expected by chance. perturbR automates these 2 methods, providing an easy-to-use resource for psychological researchers. We demonstrate the utility of this package using benchmark simulated data generated from a previous study and then apply the methods to publicly available empirical data obtained from social networks and structural neuroimaging. (PsycINFO Database Record (c) 2019 APA, all rights reserved).

摘要

心理研究人员通常试图从稀疏计数矩阵中获得聚类解决方案(例如,社交网络;两个特定个体共有的症状计数;结构脑成像)。越来越多的社区检测方法正被用于以数据驱动的方式对数据进行子集化。虽然这些方法中的许多在模拟研究中表现良好,从而在传统聚类方法上提供了一些改进,但在经验数据中没有现成的方法来评估这些解决方案的稳健性。研究人员无法知道他们的结果是否是由于噪声造成的。我们在这里描述了心理学领域的 2 种新颖方法,这些方法能够评估聚类解决方案的稳健性。本教程还解释了关联的 R 包 perturbR 的使用,该包为研究人员提供了使用本文所述方法的能力。在第一种方法中,将原始矩阵的聚类分配与通过随机扰乱矩阵中的边缘获得的聚类分配进行比较。在存在小扰动的情况下,稳定的聚类解决方案不应表现出大的变化。对于第二种方法,生成与原始矩阵具有相同属性的随机矩阵的蒙特卡罗模拟。然后将从这些矩阵的聚类解决方案获得的质量得分(“模块度”)的分布与从原始矩阵结果获得的得分进行比较。通过这种方式,可以评估结果是否优于随机预期的结果。perturbR 自动化了这 2 种方法,为心理研究人员提供了一个易于使用的资源。我们使用先前研究中生成的基准模拟数据演示了该软件包的实用性,然后将这些方法应用于从社交网络和结构神经影像学获得的公开可用的经验数据。(PsycINFO 数据库记录(c)2019 APA,保留所有权利)。

相似文献

1
Assessing the robustness of cluster solutions obtained from sparse count matrices.评估稀疏计数矩阵得到的聚类解决方案的稳健性。
Psychol Methods. 2019 Dec;24(6):675-689. doi: 10.1037/met0000204. Epub 2019 Feb 11.
2
Uncovering general, shared, and unique temporal patterns in ambulatory assessment data.揭示门诊评估数据中的一般、共享和独特的时间模式。
Psychol Methods. 2019 Feb;24(1):54-69. doi: 10.1037/met0000192. Epub 2018 Aug 20.
3
A tutorial on regularized partial correlation networks.正则化偏相关网络教程。
Psychol Methods. 2018 Dec;23(4):617-634. doi: 10.1037/met0000167. Epub 2018 Mar 29.
4
The big, the bad, and the ugly: Geographic estimation with flawed psychological data.大、坏、丑:有缺陷的心理数据的地理估计。
Psychol Methods. 2020 Aug;25(4):412-429. doi: 10.1037/met0000240. Epub 2019 Oct 24.
5
Deductive data mining.演绎式数据挖掘。
Psychol Methods. 2020 Dec;25(6):691-707. doi: 10.1037/met0000252. Epub 2020 Jan 9.
6
Investigating the performance of exploratory graph analysis and traditional techniques to identify the number of latent factors: A simulation and tutorial.探索性图分析和传统技术识别潜在因素数量的性能研究:模拟与教程。
Psychol Methods. 2020 Jun;25(3):292-320. doi: 10.1037/met0000255. Epub 2020 Mar 19.
7
Statistical power in two-level models: A tutorial based on Monte Carlo simulation.两层模型中的统计功效:基于蒙特卡罗模拟的教程。
Psychol Methods. 2019 Feb;24(1):1-19. doi: 10.1037/met0000195. Epub 2018 Sep 27.
8
A Monte Carlo Evaluation of Weighted Community Detection Algorithms.加权社区检测算法的蒙特卡罗评估
Front Neuroinform. 2016 Nov 10;10:45. doi: 10.3389/fninf.2016.00045. eCollection 2016.
9
Compositional data analysis tutorial.成分数据分析教程。
Psychol Methods. 2024 Apr;29(2):362-378. doi: 10.1037/met0000464. Epub 2022 Jan 31.
10
Spectral redemption in clustering sparse networks.聚类稀疏网络中的谱救赎。
Proc Natl Acad Sci U S A. 2013 Dec 24;110(52):20935-40. doi: 10.1073/pnas.1312486110. Epub 2013 Nov 25.

引用本文的文献

1
Exploring Predictors of Passive Versus Active Suicidal Ideation.探索被动与主动自杀意念的预测因素。
Crisis. 2025 May;46(3):142-148. doi: 10.1027/0227-5910/a000999. Epub 2025 Mar 18.
2
Heterogeneity in suicide risk: Evidence from personalized dynamic models.自杀风险的异质性:来自个性化动态模型的证据。
Behav Res Ther. 2024 Sep;180:104574. doi: 10.1016/j.brat.2024.104574. Epub 2024 May 23.
3
Data-driven connectivity profiles relate to smoking cessation outcomes.数据驱动的连接性概况与戒烟结果相关。
Neuropsychopharmacology. 2024 May;49(6):1007-1013. doi: 10.1038/s41386-024-01802-9. Epub 2024 Jan 27.
4
Characterizing heterogeneity in early adolescent reward networks and individualized associations with behavioral and clinical outcomes.表征青少年早期奖励网络中的异质性以及与行为和临床结果的个体化关联。
Netw Neurosci. 2023 Jun 30;7(2):787-810. doi: 10.1162/netn_a_00306. eCollection 2023.
5
Generalisable long COVID subtypes: findings from the NIH N3C and RECOVER programmes.可泛化的长新冠亚型:来自 NIH N3C 和 RECOVER 项目的发现。
EBioMedicine. 2023 Jan;87:104413. doi: 10.1016/j.ebiom.2022.104413. Epub 2022 Dec 21.
6
Clinical subtyping using community detection: Limited utility?临床亚型分类采用社区检测法:作用有限?
Int J Methods Psychiatr Res. 2023 Jun;32(2):e1951. doi: 10.1002/mpr.1951. Epub 2022 Nov 22.
7
Fifty years of structural equation modeling: A history of generalization, unification, and diffusion.结构方程建模五十年:泛化、统一与传播的历史
Soc Sci Res. 2022 Sep;107:102769. doi: 10.1016/j.ssresearch.2022.102769. Epub 2022 Jul 11.
8
Integrating a functional view on suicide risk into idiographic statistical models.将功能视角整合到个体化统计模型中的自杀风险。
Behav Res Ther. 2022 Mar;150:104012. doi: 10.1016/j.brat.2021.104012. Epub 2021 Nov 30.
9
A Guide for Choosing Community Detection Algorithms in Social Network Studies: The Question Alignment Approach.在社会网络研究中选择社区检测算法的指南:问题对齐方法。
Am J Prev Med. 2020 Oct;59(4):597-605. doi: 10.1016/j.amepre.2020.04.015.
10
Parsing Heterogeneity in Autism Spectrum Disorder and Attention-Deficit/Hyperactivity Disorder with Individual Connectome Mapping.利用个体连接组图谱解析自闭症谱系障碍和注意缺陷多动障碍的异质性。
Brain Connect. 2019 Nov;9(9):673-691. doi: 10.1089/brain.2019.0669.

本文引用的文献

1
Uncovering general, shared, and unique temporal patterns in ambulatory assessment data.揭示门诊评估数据中的一般、共享和独特的时间模式。
Psychol Methods. 2019 Feb;24(1):54-69. doi: 10.1037/met0000192. Epub 2018 Aug 20.
2
A network perspective on comorbid depression in adolescents with obsessive-compulsive disorder.从网络视角看青少年强迫症共病抑郁障碍。
J Anxiety Disord. 2018 Jan;53:1-8. doi: 10.1016/j.janxdis.2017.09.008. Epub 2017 Nov 2.
3
Post-Processing Partitions to Identify Domains of Modularity Optimization.后处理分区以识别模块化优化的领域。
Algorithms. 2017 Sep;10(3). doi: 10.3390/a10030093. Epub 2017 Aug 19.
4
Network Analysis on Attitudes: A Brief Tutorial.态度的网络分析:简要教程
Soc Psychol Personal Sci. 2017 Jul;8(5):528-537. doi: 10.1177/1948550617709827. Epub 2017 Jul 10.
5
Exploratory graph analysis: A new approach for estimating the number of dimensions in psychological research.探索性图形分析:一种估计心理学研究维度数量的新方法。
PLoS One. 2017 Jun 8;12(6):e0174035. doi: 10.1371/journal.pone.0174035. eCollection 2017.
6
Data-Driven Subgroups in Depression Derived from Directed Functional Connectivity Paths at Rest.基于静息态定向功能连接路径的抑郁症数据驱动亚组研究。
Neuropsychopharmacology. 2017 Dec;42(13):2623-2632. doi: 10.1038/npp.2017.97. Epub 2017 May 12.
7
Unsupervised Classification During Time-Series Model Building.时间序列模型构建过程中的无监督分类
Multivariate Behav Res. 2017 Mar-Apr;52(2):129-148. doi: 10.1080/00273171.2016.1256187. Epub 2016 Dec 7.
8
A Monte Carlo Evaluation of Weighted Community Detection Algorithms.加权社区检测算法的蒙特卡罗评估
Front Neuroinform. 2016 Nov 10;10:45. doi: 10.3389/fninf.2016.00045. eCollection 2016.
9
Validating Cluster Analysis: Consistent Replication and Symmetry.验证聚类分析:一致的复制与对称性
Multivariate Behav Res. 2000 Apr 1;35(2):261-85. doi: 10.1207/S15327906MBR3502_5.
10
A Comparison of Cluster Analysis Techniques Withing a Sequential Validation Framework.顺序验证框架内聚类分析技术的比较
Multivariate Behav Res. 1983 Jul 1;18(3):309-29. doi: 10.1207/s15327906mbr1803_4.