• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用机器学习技术进行临床试验设计和加速患者入组流程的新型模型。

A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding Process.

作者信息

Iyer Abhirvey, Narayanaswami Sundaravalli

机构信息

Department of Pharmaceutical Engineering & Technology, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, India.

Public Systems Group, Indian Institute of Management Ahmedabad, Ahmedabad, Gujarat, India.

出版信息

Clinicoecon Outcomes Res. 2025 Jan 16;17:1-18. doi: 10.2147/CEOR.S479603. eCollection 2025.

DOI:10.2147/CEOR.S479603
PMID:39839913
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11745069/
Abstract

INTRODUCTION

Clinical trials are critical for drug development and patient care; however, they often need more efficient trial design and patient enrolment processes. This research explores integrating machine learning (ML) techniques to address these challenges. Specifically, the study investigates ML models for two critical aspects: (1) streamlining clinical trial design parameters (like the site of drug action, type of Interventional/Observational model, etc) and (2) optimizing patient/volunteer enrolment for trials through efficient classification techniques.

METHODS

The study utilized two datasets: the first, with 55,000 samples (from ClinicalTrials.gov), was divided into five subsets (10,000-15,000 rows each) for model evaluation, focusing on trial parameter optimization. The second dataset targeted patient eligibility classification (from the UCI ML Repository). Five ML models-XGBoost, Random Forest, Support Vector Classifier (SVC), Logistic Regression, and Decision Tree-were applied to both datasets, alongside Artificial Neural Networks (ANN) for the second dataset. Model performance was evaluated using precision, recall, balanced accuracy, ROC-AUC, and weighted F1-score, with results averaged across k-fold cross-validation.

RESULTS

In the first phase, XGBoost and Random Forest emerged as the best-performing models across all five subsets, achieving an average balanced accuracy of 0.71 and an average ROC-AUC of 0.7. The second dataset analysis revealed that while SVC and ANN performed well, ANN was preferred for its scalability to larger datasets. ANN achieved a test accuracy of 0.73714, demonstrating its potential for real-world implementation in patient streamlining.

DISCUSSION

The study highlights the effectiveness of ML models in improving clinical trial workflows. XGBoost and Random Forest demonstrated robust performance for large clinical datasets in optimizing trial parameters, while ANN proved advantageous for patient eligibility classification due to its scalability. These findings underscore the potential of ML to enhance decision-making, reduce delays, and improve the accuracy of clinical trial outcomes. As ML technology continues to evolve, its integration into clinical research could drive innovation and improve patient care.

摘要

引言

临床试验对于药物研发和患者护理至关重要;然而,它们往往需要更高效的试验设计和患者招募流程。本研究探索整合机器学习(ML)技术来应对这些挑战。具体而言,该研究针对两个关键方面调查ML模型:(1)简化临床试验设计参数(如药物作用部位、介入/观察模型类型等),以及(2)通过高效分类技术优化试验的患者/志愿者招募。

方法

该研究使用了两个数据集:第一个数据集有55000个样本(来自ClinicalTrials.gov),被分为五个子集(每个子集10000 - 15000行)用于模型评估,重点是试验参数优化。第二个数据集用于患者资格分类(来自UCI机器学习库)。五个ML模型——XGBoost、随机森林、支持向量分类器(SVC)、逻辑回归和决策树——被应用于这两个数据集,第二个数据集还使用了人工神经网络(ANN)。使用精确率、召回率、平衡准确率、ROC-AUC和加权F1分数评估模型性能,结果是k折交叉验证的平均值。

结果

在第一阶段,XGBoost和随机森林在所有五个子集中表现最佳,平均平衡准确率达到0.71,平均ROC-AUC为0.7。对第二个数据集的分析表明,虽然SVC和ANN表现良好,但由于ANN对更大数据集的可扩展性,它更受青睐。ANN的测试准确率达到0.73714,证明了其在患者精简方面在实际应用中的潜力。

讨论

该研究突出了ML模型在改善临床试验工作流程方面的有效性。XGBoost和随机森林在优化试验参数方面对大型临床数据集表现出强大性能,而ANN因其可扩展性在患者资格分类方面被证明具有优势。这些发现强调了ML在增强决策、减少延迟和提高临床试验结果准确性方面的潜力。随着ML技术不断发展,将其整合到临床研究中可以推动创新并改善患者护理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/d2f9ada916e3/CEOR-17-1-g0016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/b8dedfb21707/CEOR-17-1-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/e05506fdf00c/CEOR-17-1-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/3ed395b19023/CEOR-17-1-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/71f9a151398d/CEOR-17-1-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/cc1acf9837ad/CEOR-17-1-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/33c8fce1380a/CEOR-17-1-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/373619945366/CEOR-17-1-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/7822716930e4/CEOR-17-1-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/53cca2461abd/CEOR-17-1-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/b737c9c09e89/CEOR-17-1-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/1e4da81c4ea8/CEOR-17-1-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/1e08dd4a5cdc/CEOR-17-1-g0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/26c7a75f8c31/CEOR-17-1-g0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/8ade0d106611/CEOR-17-1-g0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/1ef4257fb5d1/CEOR-17-1-g0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/d2f9ada916e3/CEOR-17-1-g0016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/b8dedfb21707/CEOR-17-1-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/e05506fdf00c/CEOR-17-1-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/3ed395b19023/CEOR-17-1-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/71f9a151398d/CEOR-17-1-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/cc1acf9837ad/CEOR-17-1-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/33c8fce1380a/CEOR-17-1-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/373619945366/CEOR-17-1-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/7822716930e4/CEOR-17-1-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/53cca2461abd/CEOR-17-1-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/b737c9c09e89/CEOR-17-1-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/1e4da81c4ea8/CEOR-17-1-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/1e08dd4a5cdc/CEOR-17-1-g0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/26c7a75f8c31/CEOR-17-1-g0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/8ade0d106611/CEOR-17-1-g0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/1ef4257fb5d1/CEOR-17-1-g0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b1b/11745069/d2f9ada916e3/CEOR-17-1-g0016.jpg

相似文献

1
A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding Process.一种使用机器学习技术进行临床试验设计和加速患者入组流程的新型模型。
Clinicoecon Outcomes Res. 2025 Jan 16;17:1-18. doi: 10.2147/CEOR.S479603. eCollection 2025.
2
Revolutionizing cardiovascular disease classification through machine learning and statistical methods.通过机器学习和统计方法革新心血管疾病分类。
J Biopharm Stat. 2024 Nov 24:1-23. doi: 10.1080/10543406.2024.2429524.
3
Enhancing diagnostic accuracy in symptom-based health checkers: a comprehensive machine learning approach with clinical vignettes and benchmarking.提高基于症状的健康检查中的诊断准确性:一种结合临床病例和基准测试的综合机器学习方法。
Front Artif Intell. 2024 Oct 1;7:1397388. doi: 10.3389/frai.2024.1397388. eCollection 2024.
4
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
5
Machine learning discrimination of Gleason scores below GG3 and above GG4 for HSPC patients diagnosis.机器学习对 HSPC 患者诊断中 Gleason 评分低于 GG3 且高于 GG4 的区分。
Sci Rep. 2024 Oct 27;14(1):25641. doi: 10.1038/s41598-024-77033-1.
6
Performance of machine learning models in predicting difficult laryngoscopy in the emergency department: a single-centre retrospective study comparing with conventional regression method.机器学习模型在急诊科预测困难喉镜检查中的性能:一项与传统回归方法比较的单中心回顾性研究
BMC Emerg Med. 2025 Feb 21;25(1):28. doi: 10.1186/s12873-025-01185-0.
7
Machine learning models predict triage levels, massive transfusion protocol activation, and mortality in trauma utilizing patients hemodynamics on admission.机器学习模型利用创伤患者入院时的血流动力学来预测分诊级别、大量输血方案的激活和死亡率。
Comput Biol Med. 2024 Sep;179:108880. doi: 10.1016/j.compbiomed.2024.108880. Epub 2024 Jul 16.
8
A machine learning approach in a monocentric cohort for predicting primary refractory disease in Diffuse Large B-cell lymphoma patients.一项在单中心队列中进行的机器学习方法,用于预测弥漫性大 B 细胞淋巴瘤患者的原发性难治性疾病。
PLoS One. 2024 Oct 1;19(10):e0311261. doi: 10.1371/journal.pone.0311261. eCollection 2024.
9
Enhanced osteoporotic fracture prediction in postmenopausal women using Bayesian optimization of machine learning models with genetic risk score.使用具有遗传风险评分的机器学习模型的贝叶斯优化增强绝经后女性骨质疏松性骨折预测
J Bone Miner Res. 2024 May 2;39(4):462-472. doi: 10.1093/jbmr/zjae025.
10
Artificial intelligence-driven translational medicine: a machine learning framework for predicting disease outcomes and optimizing patient-centric care.人工智能驱动的转化医学:一种用于预测疾病预后和优化以患者为中心的医疗护理的机器学习框架。
J Transl Med. 2025 Mar 10;23(1):302. doi: 10.1186/s12967-025-06308-6.

本文引用的文献

1
Improving clinical trial design using interpretable machine learning based prediction of early trial termination.利用基于可解释机器学习的早期试验终止预测来改进临床试验设计。
Sci Rep. 2023 Jan 4;13(1):121. doi: 10.1038/s41598-023-27416-7.
2
Predictive modeling of clinical trial terminations using feature engineering and embedding learning.使用特征工程和嵌入学习对临床试验终止进行预测建模。
Sci Rep. 2021 Feb 10;11(1):3446. doi: 10.1038/s41598-021-82840-x.
3
Recruitment and retention of participants in clinical studies: Critical issues and challenges.
临床研究中参与者的招募与留存:关键问题与挑战
Perspect Clin Res. 2020 Apr-Jun;11(2):51-53. doi: 10.4103/picr.PICR_6_20. Epub 2020 May 6.
4
Importance of Drug Pharmacokinetics at the Site of Action.药物在作用部位的药代动力学的重要性。
Clin Transl Sci. 2017 May;10(3):133-142. doi: 10.1111/cts.12448. Epub 2017 Feb 3.
5
Key cost drivers of pharmaceutical clinical trials in the United States.美国药物临床试验的关键成本驱动因素。
Clin Trials. 2016 Apr;13(2):117-26. doi: 10.1177/1740774515625964. Epub 2016 Feb 8.
6
Unsuccessful trial accrual and human subjects protections: an empirical analysis of recently closed trials.试验招募失败与人体受试者保护:对近期结束试验的实证分析
Clin Trials. 2015 Feb;12(1):77-83. doi: 10.1177/1740774514558307. Epub 2014 Dec 4.
7
Fifteen common mistakes encountered in clinical research.临床研究中常见的 15 个错误。
J Prosthodont Res. 2011 Jan;55(1):1-6. doi: 10.1016/j.jpor.2010.09.002. Epub 2010 Nov 20.
8
Psychological, behavioral, and immune changes after a psychological intervention: a clinical trial.一项心理干预后的心理、行为及免疫变化:一项临床试验。
J Clin Oncol. 2004 Sep 1;22(17):3570-80. doi: 10.1200/JCO.2004.06.030.
9
An overview of clinical research: the lay of the land.临床研究概述:现状
Lancet. 2002 Jan 5;359(9300):57-61. doi: 10.1016/S0140-6736(02)07283-5.
10
Predicting hospital mortality for patients in the intensive care unit: a comparison of artificial neural networks with logistic regression models.预测重症监护病房患者的医院死亡率:人工神经网络与逻辑回归模型的比较
Crit Care Med. 2001 Feb;29(2):291-6. doi: 10.1097/00003246-200102000-00012.