Unit of Biostatistics, Epidemiology and Public Health, Department of Cardiac, Thoracic, Vascular Sciences, and Public Health, University of Padova, Via Loredan, 18, 35121 Padova, Italy.
Department of Clinical and Biological Sciences, University of Torino, Via Verdi 8, 10124 Torino, Italy.
Comput Math Methods Med. 2022 Sep 10;2022:4306413. doi: 10.1155/2022/4306413. eCollection 2022.
A critical early step in a clinical trial is defining the study sample that appropriately represents the target population from which the sample will be drawn. Envisaging a "run-in" process in study design may accomplish this task; however, the traditional run-in requires additional patients, increasing times, and costs. The possible use of the available a-priori data could skip the run-in period. In this regard, ML (machine learning) techniques, which have recently shown considerable promising usage in clinical research, can be used to construct individual predictions of therapy response probability conditional on patient characteristics. An ensemble model of ML techniques was trained and validated on twin randomized clinical trials to mimic a run-in process within this framework. An ensemble ML model composed of 26 algorithms was trained on the twin clinical trials. SuperLearner (SL) performance for the Verum (Treatment) arm is above 70% sensitivity. The Positive Predictive Value (PPP) achieves a value of 80%. Results show good performance in the direction of being useful in the simulation of the run-in period; the trials conducted in similar settings can train an optimal patient selection algorithm minimizing the run-in time and costs of conduction.
临床试验的一个关键早期步骤是定义研究样本,使其适当地代表将从中抽取样本的目标人群。在研究设计中设想“预试验”过程可能会完成这项任务;然而,传统的预试验需要额外的患者,增加时间和成本。可用的先验数据的可能使用可以跳过预试验阶段。在这方面,机器学习 (ML) 技术最近在临床研究中显示出了相当大的应用潜力,可以用于根据患者特征构建治疗反应概率的个体预测。在这个框架内,对双胞胎随机临床试验进行了 ML 技术的集成模型训练和验证,以模拟预试验过程。在双胞胎临床试验上训练了一个由 26 种算法组成的集成 ML 模型。SuperLearner (SL) 在 Verum (治疗) 臂上的性能超过了 70%的灵敏度。阳性预测值 (PPP) 达到了 80%。结果表明,在模拟预试验期间有用的方向上表现良好;在类似环境下进行的试验可以训练出最佳的患者选择算法,从而最大限度地减少试验的进行时间和成本。