INSERM, Centre de Recherche des Cordeliers, UMRS 1138, Université de Paris, Université Sorbonne Paris Cité, Paris, France.
Department of Medical Informatics, Hôpital Européen Georges Pompidou, Assistance publique-Hôpitaux de Paris, Paris, France.
J Am Med Inform Assoc. 2021 Mar 1;28(3):504-515. doi: 10.1093/jamia/ocaa261.
The increasing complexity of data streams and computational processes in modern clinical health information systems makes reproducibility challenging. Clinical natural language processing (NLP) pipelines are routinely leveraged for the secondary use of data. Workflow management systems (WMS) have been widely used in bioinformatics to handle the reproducibility bottleneck.
To evaluate if WMS and other bioinformatics practices could impact the reproducibility of clinical NLP frameworks.
Based on the literature across multiple researcho fields (NLP, bioinformatics and clinical informatics) we selected articles which (1) review reproducibility practices and (2) highlight a set of rules or guidelines to ensure tool or pipeline reproducibility. We aggregate insight from the literature to define reproducibility recommendations. Finally, we assess the compliance of 7 NLP frameworks to the recommendations.
We identified 40 reproducibility features from 8 selected articles. Frameworks based on WMS match more than 50% of features (26 features for LAPPS Grid, 22 features for OpenMinted) compared to 18 features for current clinical NLP framework (cTakes, CLAMP) and 17 features for GATE, ScispaCy, and Textflows.
34 recommendations are endorsed by at least 2 articles from our selection. Overall, 15 features were adopted by every NLP Framework. Nevertheless, frameworks based on WMS had a better compliance with the features.
NLP frameworks could benefit from lessons learned from the bioinformatics field (eg, public repositories of curated tools and workflows or use of containers for shareability) to enhance the reproducibility in a clinical setting.
现代临床健康信息系统中数据流和计算过程日益复杂,使得可重复性成为挑战。临床自然语言处理 (NLP) 管道通常被用于数据的二次利用。工作流管理系统 (WMS) 已广泛应用于生物信息学,以解决可重复性瓶颈问题。
评估 WMS 和其他生物信息学实践是否会影响临床 NLP 框架的可重复性。
根据跨多个研究领域(NLP、生物信息学和临床信息学)的文献,我们选择了以下文章:(1) 综述可重复性实践,(2) 强调了一组规则或指南,以确保工具或管道的可重复性。我们从文献中收集见解,以定义可重复性建议。最后,我们评估了 7 个 NLP 框架对这些建议的遵从情况。
我们从 8 篇选定的文章中确定了 40 个可重复性特征。基于 WMS 的框架符合 50%以上的特征(LAPPS Grid 有 26 个特征,OpenMinted 有 22 个特征),而当前临床 NLP 框架(cTakes、CLAMP)符合 18 个特征,GATE、ScispaCy 和 Textflows 符合 17 个特征。
我们的选择中有至少 2 篇文章支持 34 条建议。总体而言,每个 NLP 框架都采用了 15 个特征。然而,基于 WMS 的框架与这些特征的一致性更好。
NLP 框架可以从生物信息学领域吸取经验教训(例如,经过审核的工具和工作流程的公共存储库,或使用容器进行共享),以提高临床环境中的可重复性。