Cheung Kei-Hoi, Yip Kevin Y, Townsend Jeffrey P, Scotch Matthew
Center for Medical Informatics, Yale University, 300 George Street, Suite 501, New Haven, CT 06511, USA.
J Biomed Inform. 2008 Oct;41(5):694-705. doi: 10.1016/j.jbi.2008.04.001. Epub 2008 Apr 11.
We describe the potential of current Web 2.0 technologies to achieve data mashup in the health care and life sciences (HCLS) domains, and compare that potential to the nascent trend of performing semantic mashup. After providing an overview of Web 2.0, we demonstrate two scenarios of data mashup, facilitated by the following Web 2.0 tools and sites: Yahoo! Pipes, Dapper, Google Maps and GeoCommons. In the first scenario, we exploited Dapper and Yahoo! Pipes to implement a challenging data integration task in the context of DNA microarray research. In the second scenario, we exploited Yahoo! Pipes, Google Maps, and GeoCommons to create a geographic information system (GIS) interface that allows visualization and integration of diverse categories of public health data, including cancer incidence and pollution prevalence data. Based on these two scenarios, we discuss the strengths and weaknesses of these Web 2.0 mashup technologies. We then describe Semantic Web, the mainstream Web 3.0 technology that enables more powerful data integration over the Web. We discuss the areas of intersection of Web 2.0 and Semantic Web, and describe the potential benefits that can be brought to HCLS research by combining these two sets of technologies.
我们描述了当前Web 2.0技术在医疗保健和生命科学(HCLS)领域实现数据混搭的潜力,并将该潜力与进行语义混搭的新兴趋势进行比较。在对Web 2.0进行概述之后,我们展示了由以下Web 2.0工具和网站促成的两种数据混搭场景:雅虎管道(Yahoo! Pipes)、达珀(Dapper)、谷歌地图(Google Maps)和地理共享(GeoCommons)。在第一个场景中,我们利用达珀和雅虎管道在DNA微阵列研究的背景下完成了一项具有挑战性的数据集成任务。在第二个场景中,我们利用雅虎管道、谷歌地图和地理共享创建了一个地理信息系统(GIS)界面,该界面允许对包括癌症发病率和污染流行率数据在内的各类公共卫生数据进行可视化和集成。基于这两个场景,我们讨论了这些Web 2.0混搭技术的优缺点。然后,我们描述语义网,这是主流的Web 3.0技术,能够在网络上实现更强大的数据集成。我们讨论了Web 2.0和语义网的交叉领域,并描述了将这两组技术结合起来可以为HCLS研究带来的潜在好处。