迈向生物医学源模式的自动生成。
Towards the automatic generation of biomedical sources schema.
作者信息
Mougin Fleur, Burgun Anita, Loréal Olivier, Le Beux Pierre
机构信息
Laboratoire d'Informatique Médicale, Faculté de Médicine, Université de Rennes 1, France.
出版信息
Stud Health Technol Inform. 2004;107(Pt 2):783-7.
Biologists and physicians need to access biological and medical data for their experimentations and researches. This information is available on the Internet and is scattered over many heterogeneous data sources. Collecting information is consequently tedious, time consuming and must be improved. To cope with this difficulty, our overall objective is to realize a mediator-based system to integrate heterogeneous biomedical data sources. This requires first an automatic generation of source schema, which is the goal of this work. For that, we describe an algorithm which is based on information extraction. It consists of the extraction of meta-information from each source to infer their schema. Our system enables users to access relevant and specific data, which are up-to-date. To solve the semantic heterogeneity of data sources, we are considering the creation of an ontology. Finally, the management of source evolution is discussed
生物学家和医生在进行实验和研究时需要获取生物和医学数据。这些信息可在互联网上获取,且分散在许多异构数据源中。因此,收集信息既繁琐又耗时,必须加以改进。为应对这一难题,我们的总体目标是实现一个基于中介器的系统,以集成异构生物医学数据源。这首先需要自动生成源模式,这就是本工作的目标。为此,我们描述了一种基于信息提取的算法。它包括从每个源中提取元信息以推断其模式。我们的系统使用户能够访问相关的、特定的且最新的数据。为了解决数据源的语义异构性,我们正在考虑创建一个本体。最后,讨论了源演化的管理。