Goodwin Travis R, Harabagiu Sanda M
Human Language Technology Research Institute, Department of Computer Science, University of Texas at Dallas, 800 W. Campbell Rd., Richardson, Texas 75080.
Proc ACM Int Conf Inf Knowl Manag. 2016 Oct;2016:297-306. doi: 10.1145/2983323.2983819.
The goal of modern Clinical Decision Support (CDS) systems is to provide physicians with information relevant to their management of patient care. When faced with a medical case, a physician asks questions about the diagnosis, the tests, or treatments that should be administered. Recently, the TREC-CDS track has addressed this challenge by evaluating results of retrieving relevant scientific articles where the answers of medical questions in support of CDS can be found. Although retrieving relevant medical articles instead of identifying the answers was believed to be an easier task, state-of-the-art results are not yet sufficiently promising. In this paper, we present a novel framework for answering medical questions in the spirit of TREC-CDS by first discovering the answer and then selecting and ranking scientific articles that contain the answer. Answer discovery is the result of probabilistic inference which operates on a probabilistic knowledge graph, automatically generated by processing the medical language of large collections of electronic medical records (EMRs). The probabilistic inference of answers combines knowledge from medical practice (EMRs) with knowledge from medical research (scientific articles). It also takes into account the medical knowledge automatically discerned from the medical case description. We show that this novel form of medical question answering (Q/A) produces very promising results in (a) identifying accurately the answers and (b) it improves medical article ranking by 40%.
现代临床决策支持(CDS)系统的目标是为医生提供与患者护理管理相关的信息。面对一个医疗案例时,医生会询问关于诊断、检查或应给予的治疗方面的问题。最近,TREC-CDS赛道通过评估检索相关科学文章的结果来应对这一挑战,在这些文章中可以找到支持CDS的医学问题的答案。尽管检索相关医学文章而非识别答案被认为是一项更简单的任务,但目前最先进的结果仍不尽如人意。在本文中,我们提出了一个新颖的框架,用于按照TREC-CDS的思路回答医学问题,即首先发现答案,然后选择并对包含该答案的科学文章进行排名。答案发现是概率推理的结果,该推理在一个概率知识图谱上运行,该图谱是通过处理大量电子病历(EMR)的医学语言自动生成的。答案的概率推理将来自医学实践(EMR)的知识与来自医学研究(科学文章)的知识相结合。它还考虑了从医疗案例描述中自动识别出的医学知识。我们表明,这种新颖的医学问答(Q/A)形式在(a)准确识别答案以及(b)将医学文章排名提高40%方面产生了非常有前景的结果。