临床试验中用于药物效应检测的总分模型的效能与一类错误的比较。

Comparison of the power and type 1 error of total score models for drug effect detection in clinical trials.

作者信息

Haem Elham, Karlsson Mats O, Ueckert Sebastian

机构信息

Department of Biostatistics, School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran.

Pharmacometrics Research Group, Department of Pharmacy, Uppsala University, Uppsala, Sweden.

出版信息

J Pharmacokinet Pharmacodyn. 2024 Dec 10;52(1):4. doi: 10.1007/s10928-024-09949-0.

DOI:10.1007/s10928-024-09949-0

PMID:39656313

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11632077/

Abstract

Composite scale data consists of numerous categorical questions/items that are often summed as a total score and are commonly utilized as primary endpoints in clinical trials. These endpoints are conceptually discrete and constrained by nature. Item response theory (IRT) is a powerful approach for detecting drug effects in composite scale data from clinical trials, but estimating all parameters requires a large sample size and all item information, which may not be available. Therefore, total score models are often utilized. The most popular total score models are continuous variable (CV) models, but this strategy establishes assumptions that go against the integer nature, and typically also the bounded nature, of data. Bounded integer (BI) and Coarsened grid (CG) models respect the nature of the data. However, their power to detect drug effects has not been as thoroughly studied in clinical trials. When an IRT model is accessible, IRT-informed models (I-BI and I-CV) are promising methods in which the mean and variability of the total score at any position are extracted from the existing IRT model. In this study, total score data were simulated from the MDS-UPDRS motor subscale. Then, the power, type 1 error, and treatment effect bias of six total score models for detecting drug effects in clinical trials were explored. Further, it was investigated how the power, type 1 of error, and treatment effect bias for the I-BI and I-CV models were affected by mis-specified item information from the IRT model. The I-BI model demonstrated the highest statistical power, maintained an acceptable Type I error rate, and exhibited minimal bias, approaching zero. Following that, the I-CV, BI, and CG with Czado transformation (CG_Czado) models provided the maximum power. However, the CG_Czado model had inflated type 1 error under low sample size scenarios in each arm of clinical trials. The CG model among total score models displayed the lowest power and the most inflated type 1 error. Therefore, the results favor the I-BI model when an IRT model is available; otherwise, the BI model.

摘要

综合量表数据由众多分类问题/条目组成，这些问题/条目通常被汇总为一个总分，并在临床试验中普遍用作主要终点。这些终点在概念上是离散的，并且受其性质的限制。项目反应理论（IRT）是一种用于检测来自临床试验的综合量表数据中药物效应的强大方法，但估计所有参数需要大样本量和所有项目信息，而这些信息可能无法获得。因此，总分模型经常被使用。最流行的总分模型是连续变量（CV）模型，但这种策略建立的假设与数据的整数性质以及通常的有界性质相悖。有界整数（BI）模型和粗化网格（CG）模型尊重数据的性质。然而，它们在临床试验中检测药物效应的能力尚未得到充分研究。当IRT模型可用时，基于IRT的模型（I-BI和I-CV）是很有前景的方法，其中总分在任何位置的均值和变异性是从现有的IRT模型中提取的。在本研究中，总分数据是从MDS-UPDRS运动子量表模拟而来的。然后，探讨了六种总分模型在临床试验中检测药物效应的效能（power）、一类错误和治疗效果偏差。此外，还研究了IRT模型中错误指定的项目信息如何影响I-BI和I-CV模型的效能、一类错误和治疗效果偏差。I-BI模型显示出最高的统计效能，保持了可接受的一类错误率，并且偏差最小，接近零。其次，I-CV、BI和采用Czado变换的CG（CG_Czado）模型提供了最大效能。然而，在临床试验各臂的低样本量情况下，CG_Czado模型的一类错误有所膨胀。总分模型中的CG模型显示出最低的效能和最膨胀的一类错误。因此，当IRT模型可用时，结果支持I-BI模型；否则，支持BI模型。

相似文献

Comparison of the power and type 1 error of total score models for drug effect detection in clinical trials.临床试验中用于药物效应检测的总分模型的效能与一类错误的比较。

J Pharmacokinet Pharmacodyn. 2024 Dec 10;52(1):4. doi: 10.1007/s10928-024-09949-0.

Comparison of Precision and Accuracy of Five Methods to Analyse Total Score Data.五种分析总分数据方法的精密度和准确度比较。

AAPS J. 2020 Dec 17;23(1):9. doi: 10.1208/s12248-020-00546-w.

An Item Response Theory-Informed Strategy to Model Total Score Data from Composite Scales.项目反应理论指导的复合量表总分数据建模策略。

AAPS J. 2021 Mar 16;23(3):45. doi: 10.1208/s12248-021-00555-3.

Bayesian item response theory to estimate power in clinical trials with patient-reported outcomes as endpoints.采用贝叶斯项目反应理论估计以患者报告结局为终点的临床试验效能。

Qual Life Res. 2025 Apr;34(4):1113-1124. doi: 10.1007/s11136-024-03874-y. Epub 2025 Jan 8.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Bounded Integer Modeling of Symptom Scales Specific to Lower Urinary Tract Symptoms Secondary to Benign Prostatic Hyperplasia.良性前列腺增生症继发下尿路症状症状量表的有界整数建模。

AAPS J. 2021 Feb 25;23(2):33. doi: 10.1208/s12248-021-00568-y.

The impact of allocation bias on test decisions in clinical trials with multiple endpoints using multiple testing strategies.在使用多种检验策略的多终点临床试验中，分配偏倚对检验决策的影响。

BMC Med Res Methodol. 2024 Sep 30;24(1):223. doi: 10.1186/s12874-024-02335-x.

Estimating power for clinical trials with Patient Reported Outcomes - using Item Response Theory.使用项目反应理论估计患者报告结局临床试验的效能。

J Clin Epidemiol. 2022 Jan;141:141-148. doi: 10.1016/j.jclinepi.2021.10.002. Epub 2021 Oct 11.

Performance of mixed effects models and generalized estimating equations for continuous outcomes in partially clustered trials including both independent and paired data.在部分聚类试验中，包括独立数据和配对数据，混合效应模型和广义估计方程对连续结果的表现。

Stat Med. 2024 Nov 10;43(25):4819-4835. doi: 10.1002/sim.10201. Epub 2024 Sep 4.

Methodological issues regarding power of classical test theory (CTT) and item response theory (IRT)-based approaches for the comparison of patient-reported outcomes in two groups of patients--a simulation study.关于经典测量理论（CTT）和项目反应理论（IRT）方法在两组患者间比较患者报告结局的功效的方法学问题——一项模拟研究。

BMC Med Res Methodol. 2010 Mar 25;10:24. doi: 10.1186/1471-2288-10-24.

本文引用的文献

An Item Response Theory-Informed Strategy to Model Total Score Data from Composite Scales.项目反应理论指导的复合量表总分数据建模策略。

AAPS J. 2021 Mar 16;23(3):45. doi: 10.1208/s12248-021-00555-3.

AAPS J. 2021 Feb 25;23(2):33. doi: 10.1208/s12248-021-00568-y.

Comparison of Precision and Accuracy of Five Methods to Analyse Total Score Data.五种分析总分数据方法的精密度和准确度比较。

AAPS J. 2020 Dec 17;23(1):9. doi: 10.1208/s12248-020-00546-w.

Improved numerical stability for the bounded integer model.提高有界整数模型的数值稳定性。

J Pharmacokinet Pharmacodyn. 2021 Apr;48(2):241-251. doi: 10.1007/s10928-020-09727-8. Epub 2020 Nov 26.

A longitudinal item response model for Aberrant Behavior Checklist (ABC) data from children with autism.针对自闭症儿童异常行为检查表（ABC）数据的纵向项目反应模型。

J Pharmacokinet Pharmacodyn. 2020 Jun;47(3):241-253. doi: 10.1007/s10928-020-09686-0. Epub 2020 Apr 13.

On the Comparison of Methods in Analyzing Bounded Outcome Score Data.《有界结局评分数据分析方法比较》

AAPS J. 2019 Aug 26;21(6):102. doi: 10.1208/s12248-019-0370-6.

A Bounded Integer Model for Rating and Composite Scale Data.用于评分和复合量表数据的有界整数模型。

AAPS J. 2019 Jun 6;21(4):74. doi: 10.1208/s12248-019-0343-9.

A Pharmacometric Analysis of Patient-Reported Outcomes in Breast Cancer Patients Through Item Response Theory.通过项目反应理论分析乳腺癌患者报告结局的药物代谢动力学分析。

Pharm Res. 2018 Apr 19;35(6):122. doi: 10.1007/s11095-018-2403-8.

Item Response Theory as an Efficient Tool to Describe a Heterogeneous Clinical Rating Scale in De Novo Idiopathic Parkinson's Disease Patients.项目反应理论作为一种有效的工具，可用于描述初发性特发性帕金森病患者中具有异质性的临床评分量表。

Pharm Res. 2017 Oct;34(10):2109-2118. doi: 10.1007/s11095-017-2216-1. Epub 2017 Jul 10.

Item Response Theory to Quantify Longitudinal Placebo and Paliperidone Effects on PANSS Scores in Schizophrenia.运用项目反应理论量化纵向安慰剂和帕利哌酮对精神分裂症患者阳性和阴性症状量表（PANSS）评分的影响。

CPT Pharmacometrics Syst Pharmacol. 2017 Aug;6(8):543-551. doi: 10.1002/psp4.12207. Epub 2017 Jul 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

临床试验中用于药物效应检测的总分模型的效能与一类错误的比较。

Comparison of the power and type 1 error of total score models for drug effect detection in clinical trials.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献