Beinse Guillaume, Tellier Virgile, Charvet Valentin, Deutsch Eric, Borget Isabelle, Massard Christophe, Hollebecque Antoine, Verlingue Loic
Gustave Roussy Cancer Campus, Villejuif, France.
Université Paris-Saclay, Le Kremlin-Bicêtre, France.
JCO Clin Cancer Inform. 2019 Sep;3:1-10. doi: 10.1200/CCI.19.00023.
Drug development in oncology currently is facing a conjunction of an increasing number of antineoplastic agents (ANAs) candidate for phase I clinical trials (P1CTs) and an important attrition rate for final approval. We aimed to develop a machine learning algorithm (RESOLVED2) to predict drug development outcome, which could support early go/no-go decisions after P1CTs by better selection of drugs suitable for further development.
PubMed abstracts of P1CTs reporting on ANAs were used together with pharmacologic data from the DrugBank5.0 database to model time to US Food and Drug Administration (FDA) approval (FDA approval-free survival) since the first P1CT publication. The RESOLVED2 model was trained with machine learning methods. Its performance was evaluated on an independent test set with weighted concordance index (IPCW).
We identified 462 ANAs from PubMed that matched with DrugBank5.0 (P1CT publication dates 1972 to 2017). Among 1,411 variables, 28 were used by RESOLVED2 to model the FDA approval-free survival, with an IPCW of 0.89 on the independent test set. RESOLVED2 outperformed a model that was based on efficacy/toxicity (IPCW, 0.69). In the test set at 6 years of follow-up, 73% (95% CI, 49% to 86%) of drugs predicted to be approved were approved, whereas 92% (95% CI, 87% to 98%) of drugs predicted to be nonapproved were still not approved (log-rank < .001). A predicted approved drug was 16 times more likely to be approved than a predicted nonapproved drug (hazard ratio, 16.4; 95% CI, 8.40 to 32.2).
As soon as P1CT completion, RESOLVED2 can predict accurately the time to FDA approval. We provide the proof of concept that drug development outcome can be predicted by machine learning strategies.
肿瘤学药物研发目前面临着越来越多进入I期临床试验(P1CTs)的抗肿瘤药物(ANAs)候选药物,以及最终获批时较高的淘汰率。我们旨在开发一种机器学习算法(RESOLVED2)来预测药物研发结果,通过更好地选择适合进一步开发的药物,为P1CTs后的早期“继续/终止”决策提供支持。
报告ANAs的P1CTs的PubMed摘要与DrugBank5.0数据库中的药理学数据一起用于模拟自首次P1CTs发表以来至美国食品药品监督管理局(FDA)批准的时间(无FDA批准生存期)。RESOLVED2模型采用机器学习方法进行训练。其性能在独立测试集上通过加权一致性指数(IPCW)进行评估。
我们从PubMed中识别出462种与DrugBank5.0匹配的ANAs(P1CTs发表日期为1972年至2017年)。在1411个变量中,RESOLVED2使用了28个变量来模拟无FDA批准生存期,在独立测试集上的IPCW为0.89。RESOLVED2的表现优于基于疗效/毒性的模型(IPCW,0.69)。在随访6年的测试集中,预测会获批的药物中有73%(95%CI,49%至86%)获批,而预测不会获批的药物中有92%(95%CI,87%至98%)仍未获批(对数秩检验<0.001)。预测会获批的药物获批的可能性是预测不会获批药物的16倍(风险比,16.4;95%CI,8.40至32.2)。
一旦P1CTs完成,RESOLVED2就能准确预测至FDA批准的时间。我们提供了概念验证,即药物研发结果可以通过机器学习策略进行预测。