College of Information Science and Technology, Drexel University, Philadelphia, PA 19104, USA.
Comput Methods Programs Biomed. 2010 Jul;99(1):1-24. doi: 10.1016/j.cmpb.2009.10.003. Epub 2009 Nov 13.
In this survey, we reviewed the current state of the art in biomedical QA (Question Answering), within a broader framework of semantic knowledge-based QA approaches, and projected directions for the future research development in this critical area of intersection between Artificial Intelligence, Information Retrieval, and Biomedical Informatics.
We devised a conceptual framework within which to categorize current QA approaches. In particular, we used "semantic knowledge-based QA" as a category under which to subsume QA techniques and approaches, both corpus-based and knowledge base (KB)-based, that utilize semantic knowledge-informed techniques in the QA process, and we further classified those approaches into three subcategories: (1) semantics-based, (2) inference-based, and (3) logic-based. Based on the framework, we first conducted a survey of open-domain or non-biomedical-domain QA approaches that belong to each of the three subcategories. We then conducted an in-depth review of biomedical QA, by first noting the characteristics of, and resources available for, biomedical QA and then reviewing medical QA approaches and biological QA approaches, in turn. The research articles reviewed in this paper were found and selected through online searches.
Our review suggested the following tasks ahead for the future research development in this area: (1) Construction of domain-specific typology and taxonomy of questions (biological QA), (2) Development of more sophisticated techniques for natural language (NL) question analysis and classification, (3) Development of effective methods for answer generation from potentially conflicting evidences, (4) More extensive and integrated utilization of semantic knowledge throughout the QA process, and (5) Incorporation of logic and reasoning mechanisms for answer inference.
Corresponding to the growth of biomedical information, there is a growing need for QA systems that can help users better utilize the ever-accumulating information. Continued research toward development of more sophisticated techniques for processing NL text, for utilizing semantic knowledge, and for incorporating logic and reasoning mechanisms, will lead to more useful QA systems.
在更广泛的语义知识为基础的问答方法框架内,回顾生物医学 QA(问答)的当前技术状态,并预测人工智能、信息检索和生物医学信息学交叉领域未来研究的发展方向。
我们设计了一个概念框架,用以对当前的问答方法进行分类。特别是,我们将“语义知识为基础的 QA”作为一个类别,包含利用 QA 过程中的语义知识的技术和方法,这些方法既包括基于语料库的方法,也包括基于知识库(KB)的方法,并进一步将这些方法分为三个子类:(1)基于语义的、(2)基于推理的和(3)基于逻辑的。基于该框架,我们首先对属于这三个子类的开放领域或非生物医学领域的问答方法进行了调查。然后,我们首先注意到生物医学 QA 的特点和资源,然后依次回顾医学 QA 方法和生物 QA 方法,对生物医学 QA 进行了深入的回顾。本文综述的研究文章是通过在线搜索找到和选择的。
我们的综述为该领域未来的研究发展提出了以下任务:(1)构建生物医学 QA 的特定领域类型学和分类学,(2)开发更复杂的自然语言(NL)问题分析和分类技术,(3)开发从潜在冲突证据中生成有效答案的方法,(4)在整个 QA 过程中更广泛和综合地利用语义知识,(5)纳入逻辑和推理机制进行答案推理。
随着生物医学信息的增长,需要开发能够帮助用户更好地利用不断积累的信息的 QA 系统。继续研究更复杂的 NL 文本处理技术、利用语义知识和纳入逻辑和推理机制,将导致更有用的 QA 系统。