Zhou Deyu, He Yulan
Informatics Research Centre, The University of Reading, Reading, RG6 6BX, UK.
J Biomed Inform. 2008 Apr;41(2):393-407. doi: 10.1016/j.jbi.2007.11.008. Epub 2007 Dec 15.
During the last decade, biomedicine has witnessed a tremendous development. Large amounts of experimental and computational biomedical data have been generated along with new discoveries, which are accompanied by an exponential increase in the number of biomedical publications describing these discoveries. In the meantime, there has been a great interest with scientific communities in text mining tools to find knowledge such as protein-protein interactions, which is most relevant and useful for specific analysis tasks. This paper provides a outline of the various information extraction methods in biomedical domain, especially for discovery of protein-protein interactions. It surveys methodologies involved in plain texts analyzing and processing, categorizes current work in biomedical information extraction, and provides examples of these methods. Challenges in the field are also presented and possible solutions are discussed.
在过去十年中,生物医学取得了巨大的发展。随着新发现的出现,大量实验性和计算性生物医学数据得以产生,与此同时,描述这些发现的生物医学出版物数量呈指数级增长。与此同时,科学界对文本挖掘工具产生了浓厚兴趣,以发现诸如蛋白质 - 蛋白质相互作用等知识,这些知识对于特定分析任务最为相关且有用。本文概述了生物医学领域的各种信息提取方法,特别是用于发现蛋白质 - 蛋白质相互作用的方法。它考察了涉及纯文本分析和处理的方法,对生物医学信息提取的当前工作进行了分类,并给出了这些方法的示例。还介绍了该领域面临的挑战并讨论了可能的解决方案。