Wallace Byron C
Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA.
IJCAI (U S). 2019 Aug;2019:6416-6420. doi: 10.24963/ijcai.2019/899.
Ideally decisions regarding medical treatments would be informed by the totality of the available evidence. The best evidence we currently have is in published natural language articles describing the conduct and results of clinical trials. Because these are unstructured, it is difficult for domain experts (e.g., physicians) to sort through and appraise the evidence pertaining to a given clinical question. Natural language technologies have the potential to improve access to the evidence via semi-automated processing of the biomedical literature. In this brief paper I highlight work on developing tasks, corpora, and models to support semi-automated evidence retrieval and extraction. The aim is to design models that can consume articles describing clinical trials and automatically extract from these key clinical variables and findings, and estimate their reliability. Completely automating 'machine reading' of evidence remains a distant aim given current technologies; the more immediate hope is to use such technologies to help domain experts access and make sense of unstructured biomedical evidence more efficiently, with the ultimate aim of improving patient care. Aside from their practical importance, these tasks pose core NLP challenges that directly motivate methodological innovation.
理想情况下,医疗治疗决策应以所有现有证据为依据。我们目前拥有的最佳证据来自已发表的自然语言文章,这些文章描述了临床试验的开展情况和结果。由于这些文章是非结构化的,领域专家(如医生)很难梳理和评估与特定临床问题相关的证据。自然语言技术有潜力通过对生物医学文献的半自动处理来改善对证据的获取。在这篇简短的论文中,我重点介绍了在开发任务、语料库和模型以支持半自动证据检索和提取方面所做的工作。目的是设计出能够读取描述临床试验的文章,并自动从中提取关键临床变量和研究结果,并评估其可靠性的模型。鉴于当前技术水平,完全自动化的证据“机器阅读”仍是一个遥远的目标;更直接的希望是利用此类技术帮助领域专家更高效地获取和理解非结构化生物医学证据,最终目标是改善患者护理。除了其实际重要性外,这些任务还带来了核心自然语言处理挑战,直接推动了方法创新。