Nohle David G, Hackman Barbara A, Ayers Leona W
The Mid-Region AIDS and Cancer Specimen Resource (ACSR), Department of Pathology, The Ohio State University, Columbus, OH, USA.
BMC Med Inform Decis Mak. 2005 Aug 8;5:25. doi: 10.1186/1472-6947-5-25.
The AIDS and Cancer Specimen Resource (ACSR) is an HIV/AIDS tissue bank consortium sponsored by the National Cancer Institute (NCI) Division of Cancer Treatment and Diagnosis (DCTD). The ACSR offers to approved researchers HIV infected biologic samples and uninfected control tissues including tissue cores in micro-arrays (TMA) accompanied by de-identified clinical data. Researchers interested in the type and quality of TMA tissue cores and the associated clinical data need an efficient method for viewing available TMA materials. Because each of the tissue samples within a TMA has separate data including a core tissue digital image and clinical data, an organized, standard approach to producing, navigating and publishing such data is necessary. The Association for Pathology Informatics (API) extensible mark-up language (XML) TMA data exchange specification (TMA DES) proposed in April 2003 provides a common format for TMA data. Exporting TMA data into the proposed format offers an opportunity to implement the API TMA DES. Using our public BrowseTMA tool, we created a web site that organizes and cross references TMA lists, digital "virtual slide" images, TMA DES export data, linked legends and clinical details for researchers. Microsoft Excel and Microsoft Word are used to convert tabular clinical data and produce an XML file in the TMA DES format. The BrowseTMA tool contains Extensible Stylesheet Language Transformation (XSLT) scripts that convert XML data into Hyper-Text Mark-up Language (HTML) web pages with hyperlinks automatically added to allow rapid navigation.
Block lists, virtual slide images, legends, clinical details and exports have been placed on the ACSR web site for 14 blocks with 1623 cores of 2.0, 1.0 and 0.6 mm sizes. Our virtual microscope can be used to view and annotate these TMA images. Researchers can readily navigate from TMA block lists to TMA legends and to clinical details for a selected tissue core. Exports for 11 blocks with 3812 cores from three other institutions were processed with the BrowseTMA tool. Fifty common data elements (CDE) from the TMA DES were used and 42 more created for site-specific data. Researchers can download TMA clinical data in the TMA DES format.
Virtual TMAs with clinical data can be viewed on the Internet by interested researchers using the BrowseTMA tool. We have organized our approach to producing, sorting, navigating and publishing TMA information to facilitate such review. We have converted Excel TMA data into TMA DES XML, and imported it and TMA DES XML from another institution into BrowseTMA to produce web pages that allow us to browse through the merged data. We proposed enhancements to the TMA DES as a result of this experience. We implemented improvements to the API TMA DES as a result of using exported data from several institutions. A document type definition was written for the API TMA DES (that optionally includes proposed enhancements). Independent validators can be used to check exports against the DTD (with or without the proposed enhancements). Linking tissue core images to readily navigable clinical data greatly improves the value of the TMA.
艾滋病与癌症标本资源库(ACSR)是一个由美国国立癌症研究所(NCI)癌症治疗与诊断司(DCTD)赞助的HIV/AIDS组织库联盟。ACSR向获批的研究人员提供HIV感染的生物样本以及未感染的对照组织,包括微阵列(TMA)中的组织芯,并附带去识别化的临床数据。对TMA组织芯的类型和质量以及相关临床数据感兴趣的研究人员需要一种有效的方法来查看可用的TMA材料。由于TMA中的每个组织样本都有单独的数据,包括核心组织数字图像和临床数据,因此需要一种有组织的、标准的方法来生成、浏览和发布此类数据。2003年4月提出的病理信息学协会(API)可扩展标记语言(XML)TMA数据交换规范(TMA DES)为TMA数据提供了一种通用格式。将TMA数据导出为提议的格式为实施API TMA DES提供了一个机会。使用我们的公共BrowseTMA工具,我们创建了一个网站,为研究人员整理并交叉引用TMA列表、数字“虚拟切片”图像、TMA DES导出数据、关联的图例和临床细节。Microsoft Excel和Microsoft Word用于转换表格临床数据并生成TMA DES格式的XML文件。BrowseTMA工具包含可扩展样式表语言转换(XSLT)脚本,可将XML数据转换为超文本标记语言(HTML)网页,并自动添加超链接以实现快速导航。
已将块列表、虚拟切片图像、图例、临床细节和导出数据放置在ACSR网站上,涉及14个块,包含1623个尺寸为2.0、1.0和0.6毫米的芯。我们的虚拟显微镜可用于查看和注释这些TMA图像。研究人员可以轻松地从TMA块列表导航到TMA图例以及所选组织芯的临床细节。使用BrowseTMA工具处理了来自其他三个机构的11个块、3812个芯的导出数据。使用了TMA DES中的50个通用数据元素(CDE),并为特定站点数据创建了另外42个。研究人员可以下载TMA DES格式的TMA临床数据。
感兴趣的研究人员可以使用BrowseTMA工具在互联网上查看带有临床数据的虚拟TMA。我们已经整理了生成、分类、浏览和发布TMA信息的方法,以方便进行此类审查。我们已将Excel TMA数据转换为TMA DES XML,并将其与来自另一个机构的TMA DES XML导入BrowseTMA,以生成允许我们浏览合并后数据的网页。基于此经验,我们对TMA DES提出了改进建议。由于使用了来自多个机构的导出数据,我们对API TMA DES进行了改进。为API TMA DES编写了文档类型定义(可选地包括提议的改进)。独立验证器可用于对照DTD检查导出数据(有无提议改进均可)。将组织芯图像与易于导航的临床数据相链接极大地提高了TMA的价值。