Aggen Steven H, Neale Michael C, Kendler Kenneth S
Department of Psychiatry, Virginia Institute for Psychiatric and Behavioral Genetics, Medical College of Virginia and Virginia Commonwealth University, Richmond, VA 23298-0126, USA.
Psychol Med. 2005 Apr;35(4):475-87. doi: 10.1017/s0033291704003563.
Expert committees of clinicians have chosen diagnostic criteria for psychiatric disorders with little guidance from measurement theory or modern psychometric methods. The DSM-III-R criteria for major depression (MD) are examined to determine the degree to which latent trait item response models can extract additional useful information.
The dimensionality and measurement properties of the 9 DSM-III-R criteria plus duration are evaluated using dichotomous factor analysis and the Rasch and 2 parameter logistic item response models. Quantitative liability scales are compared with a binary DSM-III-R diagnostic algorithm variable to determine the ramifications of using each approach.
Factor and item response model results indicated the 10 MD criteria defined a reasonably coherent unidimensional scale of liability. However, person risk measurement was not optimal. Criteria thresholds were unevenly spaced leaving scale regions poorly measured. Criteria varied in discriminating levels of risk. Compared to a binary MD diagnosis, item response model (IRM) liability scales performed far better in (i) elucidating the relationship between MD symptoms and liability, (ii) predicting the personality trait of neuroticism and future depressive episodes and (iii) more precisely estimating heritability parameters.
Criteria for MD largely defined a single dimension of disease liability although the quality of person risk measurement was less clear. The quantitative item response scales were statistically superior in predicting relevant outcomes and estimating twin model parameters. Item response models that treat symptoms as ordered indicators of risk rather than as counts towards a diagnostic threshold more fully exploit the information available in symptom endorsement data patterns.
临床医生专家委员会在制定精神疾病诊断标准时,几乎没有测量理论或现代心理测量方法的指导。对《精神疾病诊断与统计手册》第三版修订版(DSM-III-R)中重度抑郁症(MD)的标准进行研究,以确定潜在特质项目反应模型能够提取额外有用信息的程度。
使用二分法因子分析、拉施模型和双参数逻辑斯蒂项目反应模型,评估9条DSM-III-R标准加上病程的维度和测量属性。将定量易感性量表与二元DSM-III-R诊断算法变量进行比较,以确定使用每种方法的影响。
因子分析和项目反应模型结果表明,10条MD标准定义了一个相当连贯的单维易感性量表。然而,个体风险测量并不理想。标准阈值间隔不均匀,导致量表区域测量不佳。标准在区分风险水平方面存在差异。与二元MD诊断相比,项目反应模型(IRM)易感性量表在以下方面表现得更好:(i)阐明MD症状与易感性之间的关系;(ii)预测神经质人格特质和未来抑郁发作;(iii)更精确地估计遗传力参数。
MD标准在很大程度上定义了疾病易感性的单一维度,尽管个体风险测量的质量尚不清楚。定量项目反应量表在预测相关结果和估计双生子模型参数方面在统计学上更具优势。将症状视为风险的有序指标而非诊断阈值计数的项目反应模型,能更充分地利用症状认可数据模式中可用的信息。