多中心试验中的评分者间信度问题，第一部分：退伍军人事务部合作研究#394中使用的理论概念和操作程序

Interrater reliability issues in multicenter trials, Part I: Theoretical concepts and operational procedures used in Department of Veterans Affairs Cooperative Study #394.

作者信息

Tracy K, Adler L A, Rotrosen J, Edson R, Lavori P

机构信息

Psychiatry Service, VA Medical Center/NYU School of Medicine, NY 10010, USA.

出版信息

Psychopharmacol Bull. 1997;33(1):53-7.

PMID:9133751

Abstract

This article describes a standardized method for establishing and maintaining desired levels of interrater reliability (IRR) in multicenter trials. The procedure involves six steps: distribution of procedural guides, distribution of an introduction tape, initial distribution of patient interviews to rate, training at the study kickoff meeting, ongoing IRR monitoring, and group training throughout the study. This method is being used in a national Veterans Affairs Cooperative Study (CS #394), involving nine sites to examine the treatment effects of vitamin E on tardive dyskinesia. The six-step standardized process allowed for early detection of areas of concern in assessment administration. When comparing intraclass correlation coefficients (ICCs) at different points in the initial training, the Barnes Akathisia Scale and Anchored Brief Psychiatric Rating Scale reliability improved from 0.68 to 0.74 and from 0.54 to 0.87, respectively. After analyzing the ratings collected prior to the start of CS #394, data were collected to conduct the first check on Abnormal Involuntary Movement Scale (AIMS) IRR during enrollment; the estimated ICC for the AIMS had decreased from 0.87 to 0.60. Raters were instructed to re-assess the subjects from the first videotape on the AIMS and received additional training. The re-rating indicated very good reliability, 0.84, IRR was measured once for the Global Assessment of Functioning Scale resulting in an ICC of 0.90. The companion article (Part II: Edson et al. 1997, page 59 of this issue) describes the statistical procedures used to measure IRR.

摘要

本文介绍了一种在多中心试验中建立和维持所需评分者间信度（IRR）水平的标准化方法。该程序包括六个步骤：分发程序指南、分发介绍录像带、初步分发患者访谈以供评分、在研究启动会议上进行培训、持续的IRR监测以及在整个研究过程中进行小组培训。这种方法正在一项全国退伍军人事务合作研究（CS #394）中使用，该研究涉及九个地点，旨在研究维生素E对迟发性运动障碍的治疗效果。这一六个步骤的标准化流程有助于早期发现评估管理中令人担忧的领域。在比较初始培训不同阶段的组内相关系数（ICC）时，巴恩斯静坐不能量表和简明定式精神病评定量表的信度分别从0.68提高到0.74以及从0.54提高到0.87。在分析CS #394开始前收集的评分后，在入组期间收集数据以对异常不自主运动量表（AIMS）的IRR进行首次检查；AIMS的估计ICC从0.87降至0.60。评分者被要求根据第一份录像带重新评估受试者在AIMS上的表现，并接受了额外培训。重新评分显示信度非常好，为0.84，对功能总体评定量表仅测量了一次IRR，ICC为0.90。配套文章（第二部分：埃德森等人，1997年，本期第59页）描述了用于测量IRR的统计程序。

相似文献

Interrater reliability issues in multicenter trials, Part I: Theoretical concepts and operational procedures used in Department of Veterans Affairs Cooperative Study #394.多中心试验中的评分者间信度问题，第一部分：退伍军人事务部合作研究#394中使用的理论概念和操作程序

Psychopharmacol Bull. 1997;33(1):53-7.

Interrater reliability issues in multicenter trials, Part II: Statistical procedures used in Department of Veterans Affairs Cooperative Study #394.多中心试验中的评估者间可靠性问题，第二部分：退伍军人事务部合作研究#394中使用的统计程序。

Psychopharmacol Bull. 1997;33(1):59-67.

An interrater reliability study of the Braden scale in two nursing homes.在两家养老院对Braden量表进行的评分者间信度研究。

Int J Nurs Stud. 2008 Oct;45(10):1501-11. doi: 10.1016/j.ijnurstu.2008.02.007.

[Assessment of clinical assessment reliability among researchers of a multicenter clinical trial].[多中心临床试验研究者间临床评估可靠性的评估]

Actas Luso Esp Neurol Psiquiatr Cienc Afines. 1998 Nov-Dec;26(6):358-62.

Assessment of the reliability of data collected for the Department of Veterans Affairs national surgical quality improvement program.对退伍军人事务部国家外科质量改进计划所收集数据的可靠性评估。

J Am Coll Surg. 2007 Apr;204(4):550-60. doi: 10.1016/j.jamcollsurg.2007.01.012. Epub 2007 Mar 2.

A comparison of face-to-face and remote assessment of inter-rater reliability on the Hamilton Depression Rating Scale via videoconferencing.通过视频会议对汉密尔顿抑郁量表评分者间信度进行面对面评估与远程评估的比较。

Psychiatry Res. 2008 Feb 28;158(1):99-103. doi: 10.1016/j.psychres.2007.06.025. Epub 2007 Oct 24.

Discrete state analysis for interpretation of data from clinical trials.用于解释临床试验数据的离散状态分析

Med Care. 2004 Feb;42(2):183-96. doi: 10.1097/01.mlr.0000108748.13206.ba.

Assuring interrater reliability for the UPDRS motor section: utility of the UPDRS teaching tape.确保统一帕金森病评定量表（UPDRS）运动部分的评分者间信度：UPDRS教学录像带的效用

Mov Disord. 2004 Dec;19(12):1453-6. doi: 10.1002/mds.20220.

Lessons learned from independent central review.独立中央审查的经验教训。

Eur J Cancer. 2009 Jan;45(2):268-74. doi: 10.1016/j.ejca.2008.10.031.

Sources of unreliability in depression ratings.抑郁评分中不可靠性的来源。

J Clin Psychopharmacol. 2009 Feb;29(1):82-5. doi: 10.1097/JCP.0b013e318192e4d7.

引用本文的文献

Vitamin E for antipsychotic-induced tardive dyskinesia.维生素E用于抗精神病药物所致迟发性运动障碍

Cochrane Database Syst Rev. 2018 Jan 17;1(1):CD000209. doi: 10.1002/14651858.CD000209.pub3.

Neurocognitive impairment in the deficit subtype of schizophrenia.精神分裂症缺陷型中的神经认知障碍

Eur Arch Psychiatry Clin Neurosci. 2016 Aug;266(5):397-407. doi: 10.1007/s00406-015-0629-6. Epub 2015 Aug 11.

Antipsychotics and amotivation.抗精神病药物与动机缺乏

Neuropsychopharmacology. 2015 May;40(6):1539-48. doi: 10.1038/npp.2015.3. Epub 2015 Jan 8.

Clinical validity of a dimensional assessment of self- and interpersonal functioning in adolescent inpatients.青少年住院患者自我与人际功能维度评估的临床效度

J Pers Assess. 2015;97(1):3-12. doi: 10.1080/00223891.2014.930744. Epub 2014 Jul 10.

Accuracy of self-report, biological tests, collateral reports and clinician ratings in identifying substance use disorders among adults with schizophrenia.自我报告、生物测试、间接报告和临床医生评定在识别精神分裂症成年患者物质使用障碍中的准确性。

Psychol Addict Behav. 2013 Sep;27(3):774-87. doi: 10.1037/a0031256. Epub 2012 Dec 31.

The life situation of people with persistent mental illness visiting day centers: a comparative study.探访日间中心的持续性精神病患者的生活状况：一项对比研究。

Community Ment Health J. 2012 Oct;48(5):592-7. doi: 10.1007/s10597-011-9410-0. Epub 2011 May 10.

Antipsychotic-induced vacuous chewing movements and extrapyramidal side effects are highly heritable in mice.抗精神病药引起的空嚼运动和锥体外系副作用在小鼠中具有高度遗传性。

Pharmacogenomics J. 2012 Apr;12(2):147-55. doi: 10.1038/tpj.2010.82. Epub 2010 Nov 16.

Use of nationwide outcomes monitoring data to compare clinical outcomes in specialized mental health programs and general psychiatric clinics in the Veterans Health Administration.利用全国性结果监测数据比较退伍军人健康管理局中专业心理健康项目和普通精神科诊所的临床结果。

Psychiatr Q. 2006 Summer;77(2):151-72. doi: 10.1007/s11126-006-9004-0.

The European Schizophrenia Cohort (EuroSC): a naturalistic prognostic and economic study.欧洲精神分裂症队列研究（EuroSC）：一项自然主义的预后和经济学研究。

Soc Psychiatry Psychiatr Epidemiol. 2005 Sep;40(9):707-17. doi: 10.1007/s00127-005-0955-5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

多中心试验中的评分者间信度问题，第一部分：退伍军人事务部合作研究#394中使用的理论概念和操作程序

Interrater reliability issues in multicenter trials, Part I: Theoretical concepts and operational procedures used in Department of Veterans Affairs Cooperative Study #394.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献