Hein Nicholas, Rantou Elena, Schuette Paul
Department of Biostatistics, University of Nebraska Medical Center , Omaha , NE , USA.
Office of Biostatistics/Office of Translational Sciences/Center for Drug Evaluation and Research, U.S. Food and Drug Administration , Silver Spring , MD , USA.
J Biopharm Stat. 2019;29(5):860-873. doi: 10.1080/10543406.2019.1657134. Epub 2019 Aug 28.
During the past two decades, the number and complexity of clinical trials have risen dramatically increasing the difficulty of choosing sites for inspection. FDA's resources are limited and so sites should be chosen with care. To determine if data mining techniques and/or unsupervised statistical monitoring can assist with the process of identifying potential clinical sites for inspection. Five summary-level clinical site datasets from four new drug applications (NDA) and one biologics license application (BLA), where the FDA had performed or had planned site inspections, were used. The number of sites inspected and the results of the inspections were blinded to the researchers. Five supervised learning models from the previous two years (2016-2017) of an on-going research project were used to predict site inspections results, i.e., No Action Indicated (NAI), Voluntary Action Indicated (VAI), or Official Action Indicated (OAI). Statistical Monitoring Applied to Research Trials (SMART) software for unsupervised statistical monitoring software developed by CluePoints (Mont-Saint-Guibert, Belgium) was utilized to identify atypical centers (via a -value approach) within a study.Finally, Clinical Investigator Site Selection Tool (CISST), developed by the Center for Drug Evaluation and Research (CDER), was used to calculate the total risk of each site thereby providing a framework for site selection. The agreement between the predictions of these methods was compared. The overall accuracy and sensitivity of the methods were graphically compared. Spearman's rank order correlation was used to examine the agreement between the SMART analysis (CluePoints' software) and the CISST analysis. The average aggregated correlation between the -values (SMART) and total risk scores (CISST) for all five studies was 0.21, and range from -0.41 to 0.50. The Random Forest models for 2016 and 2017 showed the highest aggregated mean agreement (65.1%) amongst outcomes (NAI, VAI, OAI) for the three available studies. While there does not appear to be a single most accurate approach, the performance of methods under certain circumstances is discussed later in this paper. Classifier models based on data mining techniques require historical data (i.e., training data) to develop the model. There is a possibility that sites in the five-summary level datasets were included in the training datasets for the models from the previous year's research which could result in spurious confirmation of predictive ability. Additionally, the CISST was utilized in three of the five site selection processes, possibly biasing the data. The agreement between methods was lower than expected and no single method emerged as the most accurate.
在过去二十年中,临床试验的数量和复杂性急剧增加,这使得选择检查地点的难度增大。美国食品药品监督管理局(FDA)的资源有限,因此选择检查地点时应谨慎。目的是确定数据挖掘技术和/或无监督统计监测是否有助于识别潜在的临床检查地点。研究使用了来自四项新药申请(NDA)和一项生物制品许可申请(BLA)的五个汇总级临床地点数据集,这些申请中FDA已进行或计划进行现场检查。检查的地点数量和检查结果对研究人员保密。使用了一个正在进行的研究项目前两年(2016 - 2017年)的五个监督学习模型来预测现场检查结果,即无行动指征(NAI)、自愿行动指征(VAI)或官方行动指征(OAI)。利用由CluePoints(比利时蒙圣吉贝尔)开发的用于无监督统计监测软件的“应用于研究试验的统计监测(SMART)”软件,通过P值方法识别研究中的非典型中心。最后,使用药物评价和研究中心(CDER)开发的临床研究者地点选择工具(CISST)来计算每个地点的总风险,从而提供一个地点选择框架。比较了这些方法预测结果之间的一致性。以图形方式比较了这些方法的总体准确性和敏感性。使用斯皮尔曼等级相关来检验SMART分析(CluePoints软件)和CISST分析之间的一致性。所有五项研究中P值(SMART)与总风险评分(CISST)之间的平均汇总相关性为0.21,范围从 - 0.41到0.50。2016年和2017年的随机森林模型在三项可用研究的结果(NAI、VAI、OAI)中显示出最高的汇总平均一致性(65.1%)。虽然似乎没有一种最准确的方法,但本文后面将讨论某些情况下这些方法 的表现。基于数据挖掘技术的分类器模型需要历史数据(即训练数据)来开发模型。五个汇总级数据集中的地点有可能被纳入上一年研究模型的训练数据集中,这可能导致预测能力的虚假确认。此外,在五个地点选择过程中有三个使用了CISST,可能会使数据产生偏差。方法之间的一致性低于预期,没有一种方法脱颖而出成为最准确的方法。