Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA.
Psychol Med. 2012 Mar;42(3):657-67. doi: 10.1017/S0033291711001632. Epub 2011 Aug 24.
Studies have criticized the low level of agreement between the various methods of personality disorder (PD) assessment. This is an important issue for research and clinical purposes.
Seven hundred and forty-two participants in the Hopkins Epidemiology of Personality Disorders Study (HEPS) were assessed on two occasions using the Personality Disorder Schedule (PDS) and the International Personality Disorder Examination (IPDE). The concordance between the two diagnostic methods for all DSM-IV PDs was assessed using standard methods and also two item response analytic approaches designed to take account of measurement error: a latent trait-based approach and a generalized estimating equations (GEE)-based approach, with post-hoc adjustment.
Raw criteria counts, using the intraclass correlation coefficient (ICC), κ and odds ratio (OR), showed poor concordance. The more refined statistical methods showed a moderate to moderately high level of concordance between the methods for most PDs studied. Overall, the PDS produced lower prevalences of traits but higher precision of measurement than the IPDE. Specific criteria within each PD showed varying endorsement thresholds and precision for ascertaining the disorder.
Concordance in the raw measurement of the individual PD criteria between the two clinical methods is lacking. However, based on two statistical methods that adjust for differential endorsement thresholds and measurement error in the assessments, we deduce that the PD constructs themselves can be measured with a moderate degree of confidence regardless of the clinical approach used. This may suggest that the individual criteria for each PD are, in and of themselves, less specific for diagnosis, but as a group the criteria for each PD usefully identify specific PD constructs.
研究批评了各种人格障碍(PD)评估方法之间的一致性水平较低。这是研究和临床目的的一个重要问题。
霍普金斯人格障碍流行病学研究(HEPS)的 742 名参与者在两次使用人格障碍量表(PDS)和国际人格障碍检查(IPDE)时进行了评估。使用标准方法和两种旨在考虑测量误差的项目反应分析方法(基于潜在特质的方法和广义估计方程(GEE)方法)评估了两种诊断方法对所有 DSM-IV PD 的一致性,并进行了事后调整。
使用组内相关系数(ICC)、κ和优势比(OR)进行原始标准计数,显示一致性较差。更精细的统计方法显示,在研究的大多数 PD 中,两种方法之间具有中度至中度高度的一致性。总体而言,PDS 产生的特质患病率较低,但 IPDE 的测量精度较高。每个 PD 中的特定标准显示出不同的确认阈值和用于确定该疾病的精度。
两种临床方法之间个体 PD 标准的原始测量一致性不足。然而,基于两种调整评估中差异确认阈值和测量误差的统计方法,我们推断无论使用哪种临床方法,PD 结构本身都可以以中等程度的置信度进行测量。这可能表明,每个 PD 的个别标准本身对于诊断的特异性较低,但作为一个整体,每个 PD 的标准都可以有效地识别特定的 PD 结构。