Luo Yuan, Uzuner Özlem, Szolovits Peter
Brief Bioinform. 2017 Jan;18(1):160-178. doi: 10.1093/bib/bbw001. Epub 2016 Feb 5.
Research on extracting biomedical relations has received growing attention recently, with numerous biological and clinical applications including those in pharmacogenomics, clinical trial screening and adverse drug reaction detection. The ability to accurately capture both semantic and syntactic structures in text expressing these relations becomes increasingly critical to enable deep understanding of scientific papers and clinical narratives. Shared task challenges have been organized by both bioinformatics and clinical informatics communities to assess and advance the state-of-the-art research. Significant progress has been made in algorithm development and resource construction. In particular, graph-based approaches bridge semantics and syntax, often achieving the best performance in shared tasks. However, a number of problems at the frontiers of biomedical relation extraction continue to pose interesting challenges and present opportunities for great improvement and fruitful research. In this article, we place biomedical relation extraction against the backdrop of its versatile applications, present a gentle introduction to its general pipeline and shared resources, review the current state-of-the-art in methodology advancement, discuss limitations and point out several promising future directions.
最近,生物医学关系提取研究受到了越来越多的关注,其具有众多生物学和临床应用,包括药物基因组学、临床试验筛选和药物不良反应检测等方面的应用。准确捕捉表达这些关系的文本中的语义和句法结构的能力,对于深入理解科学论文和临床记录变得越来越关键。生物信息学和临床信息学社区都组织了共享任务挑战,以评估和推进当前的前沿研究。在算法开发和资源建设方面已经取得了重大进展。特别是,基于图的方法将语义和句法联系起来,在共享任务中常常取得最佳性能。然而,生物医学关系提取前沿的一些问题仍然构成有趣的挑战,并为巨大改进和富有成果的研究提供了机会。在本文中,我们将生物医学关系提取置于其广泛应用的背景下,简要介绍其一般流程和共享资源,回顾方法学进展的当前前沿,讨论局限性并指出几个有前景的未来方向。