Chen Wenjin, Schmidt Cristina, Parashar Manish, Reiss Michael, Foran David J
Center for Biomedical Imaging & Informatics, UMDNJ-Robert Wood Johnson Medical School.
Cancer Inform. 2007 Jun 6;2:373-88.
Tissue microarray technology (TMA) is a relatively new approach for efficiently and economically assessing protein and gene expression across large ensembles of tissue specimens. Tissue microarray technology holds great potential for reducing the time and cost associated with conducting research in tissue banking, proteomics, and outcome studies. However, the sheer volume of images and other data generated from even limited studies involving tissue microarrays quickly approach the processing capacity and resources of a division or department. This challenge is compounded by the fact that large-scale projects in several areas of modern research rely upon multi-institutional efforts in which investigators and resources are spread out over multiple campuses, cities, and states. To address some of the data management issues several leading institutions have begun to develop their own "in-house" systems, independently, but such data will be only minimally useful if it isn't accessible to others in the scientific community. Investigators at different institutions studying the same or related disorders might benefit from the synergy of sharing results. To facilitate sharing of TMA data across different database implementations, the Technical Standards Committee of the Association for Pathology Informatics organized workshops in efforts to establish a standardized TMA data exchange specification. The focus of our research does not relate to the establishment of standards for exchange, but rather builds on these efforts and concentrates on the design, development and deployment of a decentralized collaboratory for the unsupervised characterization, and seamless and secure discovery and sharing of TMA data. Specifically, we present a self-organizing, peer-to-peer indexing and discovery infrastructure for quantitatively assessing digitized TMA's. The system utilizes a novel, optimized decentralized search engine that supports flexible querying, while guaranteeing that once information has been stored in the system, it will be found with bounded costs.
组织微阵列技术(TMA)是一种相对较新的方法,可高效且经济地评估大量组织标本中的蛋白质和基因表达。组织微阵列技术在减少与组织库、蛋白质组学和结果研究相关的研究时间和成本方面具有巨大潜力。然而,即使是涉及组织微阵列的有限研究产生的大量图像和其他数据,也很快接近一个部门或科室的处理能力和资源。现代研究几个领域的大规模项目依赖多机构合作,研究人员和资源分布在多个校区、城市和州,这一事实使这一挑战更加复杂。为了解决一些数据管理问题,几个领先机构已开始独立开发自己的“内部”系统,但如果科学界的其他人无法访问这些数据,那么这些数据的用处将微乎其微。研究相同或相关疾病的不同机构的研究人员可能会从共享结果的协同效应中受益。为了促进跨不同数据库实现的TMA数据共享,病理学信息学协会技术标准委员会组织了研讨会,努力建立标准化的TMA数据交换规范。我们的研究重点并非建立交换标准,而是基于这些努力,专注于设计、开发和部署一个分散式协作平台,用于对TMA数据进行无监督特征描述、无缝安全地发现和共享。具体而言,我们提出了一种用于定量评估数字化TMA的自组织、对等索引和发现基础设施。该系统利用一种新颖、优化的分散式搜索引擎,支持灵活查询,同时保证一旦信息存储在系统中,就能以有限成本找到。