Children's Hospital of Eastern Ontario, Ottawa, Ontario, Canada.
Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada.
Int J Technol Assess Health Care. 2020 Dec 18;37:e7. doi: 10.1017/S0266462320002159.
Solutions like crowd screening and machine learning can assist systematic reviewers with heavy screening burdens but require training sets containing a mix of eligible and ineligible studies. This study explores using PubMed's Best Match algorithm to create small training sets containing at least five relevant studies.
Six systematic reviews were examined retrospectively. MEDLINE searches were converted and run in PubMed. The ranking of included studies was studied under both Best Match and Most Recent sort conditions.
Retrieval sizes for the systematic reviews ranged from 151 to 5,406 records and the numbers of relevant records ranged from 8 to 763. The median ranking of relevant records was higher in Best Match for all six reviews, when compared with Most Recent sort. Best Match placed a total of thirty relevant records in the first fifty, at least one for each systematic review. Most Recent sorting placed only ten relevant records in the first fifty. Best Match sorting outperformed Most Recent in all cases and placed five or more relevant records in the first fifty in three of six cases.
Using a predetermined set size such as fifty may not provide enough true positives for an effective systematic review training set. However, screening PubMed records ranked by Best Match and continuing until the desired number of true positives are identified is efficient and effective.
The Best Match sort in PubMed improves the ranking and increases the proportion of relevant records in the first fifty records relative to sorting by recency.
人群筛选和机器学习等解决方案可以帮助系统审查员减轻繁重的筛选负担,但需要包含合格和不合格研究的混合训练集。本研究探讨了使用 PubMed 的最佳匹配算法创建至少包含五个相关研究的小训练集。
回顾性研究了 6 项系统评价。将 MEDLINE 检索转换并在 PubMed 中运行。在最佳匹配和最新排序条件下研究了纳入研究的排名。
系统评价的检索规模从 151 条到 5406 条不等,相关记录的数量从 8 条到 763 条不等。与最新排序相比,在所有 6 项综述中,最佳匹配的相关记录中位数排名更高。最佳匹配总共将 30 条相关记录排在前 50 位,每个系统综述至少有一条。最新排序在前 50 名中仅放置了 10 条相关记录。最佳匹配在所有情况下均优于最新排序,并在前 50 名中放置了 5 条或更多相关记录,在 6 个案例中有 3 个案例中。
使用预定的大小(如 50)可能无法为有效的系统评价训练集提供足够的真阳性。但是,按最佳匹配对 PubMed 记录进行筛选,并继续筛选,直到确定所需数量的真阳性,这种方法是高效且有效的。
PubMed 中的最佳匹配排序提高了排名,并增加了前 50 条记录中相关记录的比例,相对于按最新排序。