Boulos Maged N, Roudsari Abdul V, Carson Ewart R
Centre for Measurement and Information in Medicine, School of Informatics, City University, London, UK.
Med Sci Monit. 2002 Jul;8(7):MT124-36.
HealthCyberMap (http://healthcybermap.semanticweb.org/) aims at mapping Internet health information resources in novel ways for enhanced retrieval and navigation. This is achieved by collecting appropriate resource metadata in an unambiguous form that preserves semantics.
MATERIAL/METHODS: We modelled a qualified Dublin Core (DC) metadata set ontology with extra elements for resource quality and geographical provenance in Prot g -2000. A metadata collection form helps acquiring resource instance data within Prot g . The DC subject field is populated with UMLS terms directly imported from UMLS Knowledge Source Server using UMLS tab, a Prot g -2000 plug-in. The project is saved in RDFS/RDF.
The ontology and associated form serve as a free tool for building and maintaining an RDF medical resource metadata base. The UMLS tab enables browsing and searching for concepts that best describe a resource, and importing them to DC subject fields. The resultant metadata base can be used with a search and inference engine, and have textual and/or visual navigation interface(s) applied to it, to ultimately build a medical Semantic Web portal. Different ways of exploiting Prot g -2000 RDF output are discussed.
By making the context and semantics of resources, not merely their raw text and formatting, amenable to computer 'understanding,' we can build a Semantic Web that is more useful to humans than the current Web. This requires proper use of metadata and ontologies. Clinical codes can reliably describe the subjects of medical resources, establish the semantic relationships (as defined by underlying coding scheme) between related resources, and automate their topical categorisation.
材料/方法:我们在Protégé-2000中对一个合格的都柏林核心(DC)元数据集本体进行了建模,并添加了用于资源质量和地理来源的额外元素。一个元数据收集表单有助于在Protégé中获取资源实例数据。DC主题字段使用Protégé-2000插件UMLS选项卡直接从UMLS知识源服务器导入的UMLS术语进行填充。该项目以RDFS/RDF格式保存。
该本体和相关表单可作为构建和维护RDF医学资源元数据库的免费工具。UMLS选项卡能够浏览和搜索最能描述资源的概念,并将其导入DC主题字段。生成的元数据库可与搜索和推理引擎一起使用,并应用文本和/或可视化导航界面,最终构建一个医学语义网门户。讨论了利用Protégé-2000 RDF输出的不同方法。
通过使资源的上下文和语义,而不仅仅是其原始文本和格式,便于计算机“理解”,我们可以构建一个对人类比当前网络更有用的语义网。这需要正确使用元数据和本体。临床代码可以可靠地描述医学资源的主题,建立相关资源之间的语义关系(由基础编码方案定义),并自动进行主题分类。