Zimmermann Marc, Fluck Juliane, Thi Le Thuy Bui, Kolárik Corinna, Kumpf Kai, Hofmann Martin
Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, D-53754 St. Augustin, Germany.
Curr Top Med Chem. 2005;5(8):785-96. doi: 10.2174/1568026054637692.
Information extraction approaches have been successfully applied to mine the scientific literature in biology and medicine. So far, the main focus of research and development in this domain was on the recognition and extraction of gene and protein names in the context of molecular biology and genome research and on disease names and other medical terms in the context of clinical research. Similar to biology and medical sciences, medicinal chemistry, pharmacology and toxicology are descriptive sciences. However, information extraction approaches in these disciplines encounter a number of problems that are specific to the fact that these scientific areas are essentially centred at chemical compounds and their structures. In this review, we will give a short overview on general information extraction strategies in the life sciences and we will introduce new approaches to apply information extraction to the domain of pharmacology, medicinal chemistry and toxicology. Finally, we will emphasize on how information extraction approaches will support public and commercial research in medicinal chemistry, pharmacology and toxicology by linking information on chemical structures to biological information.
信息提取方法已成功应用于挖掘生物学和医学领域的科学文献。到目前为止,该领域研发的主要重点是在分子生物学和基因组研究背景下识别和提取基因和蛋白质名称,以及在临床研究背景下识别和提取疾病名称及其他医学术语。与生物学和医学科学类似,药物化学、药理学和毒理学是描述性科学。然而,这些学科中的信息提取方法遇到了一些特定问题,这些问题源于这些科学领域本质上以化合物及其结构为核心这一事实。在本综述中,我们将简要概述生命科学中的一般信息提取策略,并介绍将信息提取应用于药理学、药物化学和毒理学领域的新方法。最后,我们将强调信息提取方法如何通过将化学结构信息与生物信息相联系,来支持药物化学、药理学和毒理学领域的公共研究和商业研究。