Bar Ilan University, Ramat Gan, Israel.
Dept. of Psychiatry, Weil Cornell Medical College, New York, NY 10065, USA.
J Affect Disord. 2024 Jan 15;345:24-31. doi: 10.1016/j.jad.2023.10.098. Epub 2023 Oct 20.
Mitigating rating inconsistency can improve measurement fidelity and detection of treatment response.
The International Society for CNS Clinical Trials and Methodology convened an expert Working Group that developed logical consistency (LC) checks for ratings of the Young Mania Rating Scale (YMRS), which is widely used in studies of mood and bipolar disorders. LC and statistical outlier-response pattern checks (SC) were applied to 63,228 YMRS administrations from 14 clinical trials evaluating treatments for bipolar disorder. Checks were also applied to Monte Carlo-simulated data as a proxy for their use under conditions of inconsistency.
42 LC flags were developed, and four SC flags were created from the data set (n = 14). Almost 20 % of the rating administrations had at least one LC flag, 6.7 % had two or more, 1.7 % had three or more; 17.3 % percent of the administrations had at least one SC flag and 4.6 % percent had two or more. Overall, 31 % of administrations had at least one flag of any type, 12.1 % had two or more and 5.3 % had three or more. In acute antimanic treatment trials (n = 10) there were more flags of any type compared to relapse prevention trials (n = 4).
Flagged ratings may represent less-common presentations assessed correctly.
Using established methods, we illustrate development and application of consistency flags for YMRS ratings. Applying flags and mitigation during trials may improve the value of YMRS data, help focus attention on rater training, and improve reliability and validity of trial data.
减轻评分不一致性可以提高测量的准确性并发现治疗反应。
国际中枢神经系统临床试验和方法学会召集了一个专家工作组,为广泛用于情绪和双相情感障碍研究的 Young 躁狂评定量表 (YMRS) 的评分制定了逻辑一致性 (LC) 检查。LC 和统计离群反应模式检查 (SC) 应用于 14 项评估双相情感障碍治疗的临床试验中的 63228 次 YMRS 管理。还将检查应用于蒙特卡罗模拟数据,作为在不一致情况下使用的代理。
开发了 42 个 LC 标志,从数据集 (n = 14) 中创建了四个 SC 标志。近 20%的评分管理至少有一个 LC 标志,6.7%有两个或更多,1.7%有三个或更多;17.3%的管理至少有一个 SC 标志,4.6%有两个或更多。总体而言,31%的管理至少有一个标志,12.1%有两个或更多,5.3%有三个或更多。在急性抗躁狂治疗试验(n = 10)中,与复发预防试验(n = 4)相比,任何类型的标志都更多。
标记的评分可能代表评估正确的较少见表现。
使用既定方法,我们说明了 YMRS 评分一致性标志的开发和应用。在试验中应用标志和缓解措施可能会提高 YMRS 数据的价值,有助于关注评分者培训,并提高试验数据的可靠性和有效性。