Lacroix Zoé
Arizona State University, Tempe 85287-6106, USA.
IEEE Trans Inf Technol Biomed. 2002 Jun;6(2):123-8. doi: 10.1109/titb.2002.1006299.
Nowadays scientific data is inevitably digital and stored in a wide variety of formats in heterogeneous systems. Scientists need to access an integrated view of remote or local heterogeneous data sources with advanced data accessing, analyzing, and visualization tools. Building a digital library for scientific data requires accessing and manipulating data extracted from flat files or databases, documents retrieved from the Web as well as data generated by software. We present an approach to wrapping web data sources, databases, flat files, or data generated by tools through a database view mechanism. Generally, a wrapper has two tasks: it first sends a query to the source to retrieve data and, second builds the expected output with respect to the virtual structure. Our wrappers are composed of a retrieval component based on an intermediate object view mechanism called search views mapping the source capabilities to attributes, and an eXtensible Markup Language (XML) engine, respectively, to perform these two tasks. The originality of the approach consists of: 1) a generic view mechanism to access seamlessly data sources with limited capabilities and 2) the ability to wrap data sources as well as the useful specific tools they may provide. Our approach has been developed and demonstrated as part of the multidatabase system supporting queries via uniform object protocol model (OPM) interfaces.
如今,科学数据不可避免地以数字形式存在,并以各种格式存储在异构系统中。科学家们需要借助先进的数据访问、分析和可视化工具,来获取远程或本地异构数据源的集成视图。构建科学数据数字图书馆需要访问和处理从平面文件或数据库中提取的数据、从网络检索到的文档以及由软件生成的数据。我们提出了一种通过数据库视图机制来包装网络数据源、数据库、平面文件或工具生成的数据的方法。一般来说,包装器有两项任务:首先,它向数据源发送查询以检索数据;其次,根据虚拟结构构建预期输出。我们的包装器分别由一个基于中间对象视图机制(称为搜索视图,将源功能映射到属性)的检索组件和一个可扩展标记语言(XML)引擎组成,以执行这两项任务。该方法的独特之处在于:1)一种通用视图机制,可无缝访问功能有限的数据源;2)能够包装数据源以及它们可能提供的有用特定工具。我们的方法是作为支持通过统一对象协议模型(OPM)接口进行查询的多数据库系统的一部分而开发和演示的。