Cawley Michelle, Carlson Rebecca, Vest Tyler A, Eckel Stephen F
University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
University of Vermont Health Network, Burlington, VT, and University of North Carolina Eshelman School of Pharmacy, Chapel Hill, NC, USA.
Am J Health Syst Pharm. 2025 May 21;82(10):551-558. doi: 10.1093/ajhp/zxae357.
This article summarizes a novel methodology of applying machine learning (ML) algorithms trained with external training data to assist with article screening for 2 annual review series related to the medication-use process (MUP) generally and the MUP in ambulatory care settings (ACMUP) specifically. As the literature review for these 2 series grew over time, it became essential for the authors to develop methods to be efficient while still capturing most of the relevant literature. The ML model can be used to predict whether search results are likely to be relevant or not relevant. Results least likely to be relevant can then be excluded without manual screening, allowing research teams to save time that would otherwise be spent reviewing a portion of the search results for inclusion. ML models require a large training dataset typically derived from the unclassified corpus. In this study, the authors demonstrate the efficacy of training the ML model using external training data, which is possible in scenarios such as a systematic review update or ongoing review series such as those for the MUP and ACMUP.
The authors ran 3 simulations using screening decisions from previous publications and in-process manuscripts for the MUP and ACMUP review series to test the efficacy of the approach. The simulations were compared to actual manual screening decisions made by the research teams to include or exclude articles using title and abstract text. For each simulation, the authors developed a training dataset using a sample of screening decisions from previous years to predict article relevance in an "unclassified" corpus. In this case, the screening decisions for the unclassified corpus were actually known, allowing us to calculate recall (percent of relevant articles captured) and time saved using the number of articles that would be excluded without manual review. Combined, the ML approach correctly labeled 187 of 192 relevant studies. The 3 simulations included 17,227 unique studies, and using ML the authors demonstrated that 13,201 studies could have been excluded without manual screening while still maintaining recall of relevant articles of 95% or greater.
This novel approach is applicable to systematic reviews and ongoing review series, including those for the MUP and ACMUP. Pharmacists have a duty to review and incorporate best practices into their organizations to improve the efficiency and cost of care, optimally utilize technology, and reduce the potential for medication errors. This methodology will allow evidence syntheses for the MUP and other disciplines in pharmacy practice to be published more expeditiously by saving significant time during the article screening step.
本文总结了一种新颖的方法,即应用使用外部训练数据训练的机器学习(ML)算法,以协助对两个年度综述系列进行文章筛选,这两个系列总体上与用药过程(MUP)相关,具体而言与门诊护理环境中的用药过程(ACMUP)相关。随着这两个系列的文献综述随着时间的推移不断增加,作者开发高效方法同时仍能涵盖大多数相关文献变得至关重要。ML模型可用于预测搜索结果可能相关或不相关。然后可以在不进行人工筛选的情况下排除最不可能相关的结果,使研究团队能够节省原本用于审查一部分搜索结果以确定是否纳入的时间。ML模型通常需要从未分类语料库中获取的大型训练数据集。在本研究中,作者展示了使用外部训练数据训练ML模型的有效性,这在诸如系统评价更新或正在进行的综述系列(如MUP和ACMUP的综述系列)等场景中是可行的。
作者使用来自先前出版物和MUP及ACMUP综述系列正在撰写的手稿中的筛选决策进行了3次模拟,以测试该方法的有效性。将模拟结果与研究团队使用标题和摘要文本对文章进行纳入或排除的实际人工筛选决策进行比较。对于每次模拟,作者使用前几年的筛选决策样本开发了一个训练数据集,以预测“未分类”语料库中文章的相关性。在这种情况下,未分类语料库的筛选决策实际上是已知的,这使我们能够计算召回率(捕获的相关文章百分比),并使用无需人工审查即可排除的文章数量来计算节省的时间。综合来看,ML方法正确标记了192项相关研究中的187项。这3次模拟包括17227项独特的研究,作者使用ML证明,在不进行人工筛选的情况下可以排除13201项研究,同时仍能保持95%或更高的相关文章召回率。
这种新颖的方法适用于系统评价和正在进行的综述系列,包括MUP和ACMUP的综述系列。药剂师有责任审查并将最佳实践纳入其组织,以提高护理效率和成本、优化技术利用并减少用药错误的可能性。这种方法将通过在文章筛选步骤中节省大量时间,使MUP和药学实践中其他学科的证据综合能够更快地发表。