• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用开源大语言模型分析试验信息的探索:一项包含去中心化元素的临床试验案例研究

Exploration of Using an Open-Source Large Language Model for Analyzing Trial Information: A Case Study of Clinical Trials With Decentralized Elements.

作者信息

Huh Ki Young, Song Ildae, Kim Yoonjin, Park Jiyeon, Ryu Hyunwook, Koh JaeEun, Yu Kyung-Sang, Kim Kyung Hwan, Lee SeungHwan

机构信息

Department of Clinical Pharmacology and Therapeutics, Seoul National University College of Medicine and Hospital, Seoul, Republic of Korea.

Department of Pharmaceutical Science and Technology, Kyungsung University, Busan, Republic of Korea.

出版信息

Clin Transl Sci. 2025 Mar;18(3):e70183. doi: 10.1111/cts.70183.

DOI:10.1111/cts.70183
PMID:40025837
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11873368/
Abstract

Despite interest in clinical trials with decentralized elements (DCTs), analysis of their trends in trial registries is lacking due to heterogeneous designs and unstandardized terms. We explored Llama 3, an open-source large language model, to efficiently evaluate these trends. Trial data were sourced from Aggregate Analysis of ClinicalTrials.gov, focusing on drug trials conducted between 2018 and 2023. We utilized three Llama 3 models with a different number of parameters: 8b (model 1), fine-tuned 8b (model 2) with curated data, and 70b (model 3). Prompt engineering enabled sophisticated tasks such as classification of DCTs with explanations and extracting decentralized elements. Model performance, evaluated on a 3-month exploratory test dataset, demonstrated that sensitivity could be improved after fine-tuning from 0.0357 to 0.5385. Low positive predictive value in the fine-tuned model 2 could be improved by focusing on trials with DCT-associated expressions from 0.5385 to 0.9167. However, the extraction of decentralized elements was only properly performed by model 3, which had a larger number of parameters. Based on the results, we screened the entire 6-year dataset after applying DCT-associated expressions. After the subsequent application of models 2 and 3, we identified 692 DCTs. We found that a total of 213 trials were classified as phase 2, followed by 162 phase 4 trials, 112 phase 3 trials, and 92 phase 1 trials. In conclusion, our study demonstrated the potential of large language models for analyzing clinical trial information not structured in a machine-readable format. Managing potential biases during model application is crucial.

摘要

尽管人们对具有去中心化元素的临床试验(DCTs)感兴趣,但由于设计的异质性和术语的不标准化,缺乏对试验注册库中其趋势的分析。我们探索了开源大语言模型Llama 3,以有效评估这些趋势。试验数据来自ClinicalTrials.gov的汇总分析,重点关注2018年至2023年期间进行的药物试验。我们使用了三种具有不同参数数量的Llama 3模型:8b(模型1)、使用精选数据微调的8b(模型2)和70b(模型3)。提示工程实现了复杂的任务,如对DCTs进行带解释的分类以及提取去中心化元素。在一个为期3个月的探索性测试数据集上评估的模型性能表明,微调后敏感性可以从0.0357提高到0.5385。通过关注具有DCT相关表达的试验,微调后的模型2中较低的阳性预测值可以从0.5385提高到0.9167。然而,只有参数数量较多的模型3才能正确执行去中心化元素的提取。基于这些结果,我们在应用DCT相关表达后筛选了整个6年的数据集。在随后应用模型2和3之后,我们确定了692个DCTs。我们发现,共有213项试验被分类为2期,其次是162项4期试验、112项3期试验和92项1期试验。总之,我们的研究证明了大语言模型在分析非机器可读格式结构的临床试验信息方面的潜力。在模型应用过程中管理潜在偏差至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6f7/11873368/3b240a8887d0/CTS-18-e70183-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6f7/11873368/3a6dc5ad12a8/CTS-18-e70183-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6f7/11873368/2dac3521d931/CTS-18-e70183-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6f7/11873368/3b240a8887d0/CTS-18-e70183-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6f7/11873368/3a6dc5ad12a8/CTS-18-e70183-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6f7/11873368/2dac3521d931/CTS-18-e70183-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6f7/11873368/3b240a8887d0/CTS-18-e70183-g002.jpg

相似文献

1
Exploration of Using an Open-Source Large Language Model for Analyzing Trial Information: A Case Study of Clinical Trials With Decentralized Elements.利用开源大语言模型分析试验信息的探索:一项包含去中心化元素的临床试验案例研究
Clin Transl Sci. 2025 Mar;18(3):e70183. doi: 10.1111/cts.70183.
2
The landscape of decentralized clinical trials (DCTs): focusing on the FDA and EMA guidance.去中心化临床试验(DCTs)的概况:聚焦于美国食品药品监督管理局(FDA)和欧洲药品管理局(EMA)的指南
Transl Clin Pharmacol. 2024 Mar;32(1):41-51. doi: 10.12793/tcp.2024.32.e2. Epub 2024 Feb 20.
3
Assessing Completeness of Clinical Histories Accompanying Imaging Orders Using Adapted Open-Source and Closed-Source Large Language Models.使用适配的开源和闭源大语言模型评估影像检查申请单所附临床病史的完整性
Radiology. 2025 Feb;314(2):e241051. doi: 10.1148/radiol.241051.
4
Distilling large language models for matching patients to clinical trials.提炼大型语言模型以实现患者与临床试验的匹配。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1953-1963. doi: 10.1093/jamia/ocae073.
5
Assessing the Financial Value of Decentralized Clinical Trials.评估去中心化临床试验的财务价值。
Ther Innov Regul Sci. 2023 Mar;57(2):209-219. doi: 10.1007/s43441-022-00454-5. Epub 2022 Sep 14.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Decentralized Clinical Trials in the Era of Real-World Evidence: A Statistical Perspective.真实世界证据时代的去中心化临床试验:统计学视角
Clin Transl Sci. 2025 Feb;18(2):e70117. doi: 10.1111/cts.70117.
8
Decentralized clinical trials (DCTs): A few ethical considerations.去中心化临床试验(DCTs):几点伦理考虑。
Front Public Health. 2022 Dec 15;10:1081150. doi: 10.3389/fpubh.2022.1081150. eCollection 2022.
9
Applying Systems Thinking to Inform Decentralized Clinical Trial Planning and Deployment.应用系统思维指导去中心化临床试验规划和部署。
Ther Innov Regul Sci. 2023 Sep;57(5):1081-1098. doi: 10.1007/s43441-023-00540-2. Epub 2023 Jun 30.
10
GPT for RCTs? Using AI to determine adherence to clinical trial reporting guidelines.用于随机对照试验的GPT?利用人工智能确定对临床试验报告指南的遵循情况。
BMJ Open. 2025 Mar 18;15(3):e088735. doi: 10.1136/bmjopen-2024-088735.

本文引用的文献

1
Utilizing Large Language Models for Enhanced Clinical Trial Matching: A Study on Automation in Patient Screening.利用大语言模型加强临床试验匹配:患者筛选自动化研究
Cureus. 2024 May 10;16(5):e60044. doi: 10.7759/cureus.60044. eCollection 2024 May.
2
Assessing the Risk of Bias in Randomized Clinical Trials With Large Language Models.使用大型语言模型评估随机临床试验的偏倚风险。
JAMA Netw Open. 2024 May 1;7(5):e2412687. doi: 10.1001/jamanetworkopen.2024.12687.
3
The landscape of decentralized clinical trials (DCTs): focusing on the FDA and EMA guidance.
去中心化临床试验(DCTs)的概况:聚焦于美国食品药品监督管理局(FDA)和欧洲药品管理局(EMA)的指南
Transl Clin Pharmacol. 2024 Mar;32(1):41-51. doi: 10.12793/tcp.2024.32.e2. Epub 2024 Feb 20.
4
Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine.诊断推理提示揭示了医学中大型语言模型可解释性的潜力。
NPJ Digit Med. 2024 Jan 24;7(1):20. doi: 10.1038/s41746-024-01010-1.
5
Applications of Advanced Natural Language Processing for Clinical Pharmacology.高级自然语言处理在临床药理学中的应用。
Clin Pharmacol Ther. 2024 Apr;115(4):786-794. doi: 10.1002/cpt.3161. Epub 2024 Jan 24.
6
From a decentralized clinical trial to a decentralized and clinical-trial-in-a-box platform: Towards patient-centric and equitable trials.从分散式临床试验到分散式一体化临床试验平台:迈向以患者为中心的公平试验。
J Clin Transl Sci. 2023 Oct 9;7(1):e236. doi: 10.1017/cts.2023.629. eCollection 2023.
7
Transforming clinical trials: the emerging roles of large language models.变革临床试验:大语言模型的新兴作用
Transl Clin Pharmacol. 2023 Sep;31(3):131-138. doi: 10.12793/tcp.2023.31.e16. Epub 2023 Sep 19.
8
Prompt Engineering as an Important Emerging Skill for Medical Professionals: Tutorial.医学专业人员的新兴技能:提示工程教程
J Med Internet Res. 2023 Oct 4;25:e50638. doi: 10.2196/50638.
9
Data-Driven and Technology-Enabled Trial Innovations Toward Decentralization of Clinical Trials: Opportunities and Considerations.数据驱动和技术支持的试验创新,推动临床试验去中心化:机遇与考虑。
Mayo Clin Proc. 2023 Sep;98(9):1404-1421. doi: 10.1016/j.mayocp.2023.02.003.
10
Creation and Adoption of Large Language Models in Medicine.医学领域中大型语言模型的创建与采用。
JAMA. 2023 Sep 5;330(9):866-869. doi: 10.1001/jama.2023.14217.