Müller Thomas H
Institute for Medical Informatics, Biometry and Epidemiology, University of Munich, Munich, Germany.
Stud Health Technol Inform. 2012;180:524-8.
Separating patient identification data from clinical data and/or information about biomaterial samples is an effective data protection measure, especially in clinical research employing "on-line", i.e., web-based, data capture. In this paper, we show that this specialised technique can be generalised into a network architecture of interconnected on-line databases potentially serving a variety of purposes. The basic idea of this approach consists of maintaining logical links, i.e., common record keys, between corresponding data structures in pairs of databases while keeping the actual key values hidden from clients. For client systems, simultaneous access to corresponding records is mediated by temporary access tokens. At the relational level, these links are represented by arbitrary unique record keys common to both databases. This architecture allows for integration of related data in different databases without replicating or permanently sharing this data in one place. Each participating on-line database can determine the degree of integration by specifying linkage keys only for those data structures that may be logically connected to other data. Logical links can de designed for specific use cases. In addition, each database controls user access by enforcing its own authorisation scheme. Another advantage is that individual database owners retain considerable leeway in adapting to changing local requirements without compromising the integration into the network. Beyond protecting individual subject identification data, this architecture permits splitting a cooperatively used data pool to achieve many kinds of objectives. Application examples could be clinical registries needing subject contact information for follow-up, biomaterial banks with or without genetic information, and automatic or assisted integration of data from electronic medical records into research data.
将患者识别数据与临床数据和/或生物材料样本信息分开是一种有效的数据保护措施,尤其在采用“在线”(即基于网络)数据采集的临床研究中。在本文中,我们表明这种专门技术可以推广到一个相互连接的在线数据库网络架构中,该架构可能服务于多种目的。这种方法的基本思想是在数据库对中的相应数据结构之间保持逻辑链接,即通用记录键,同时对客户端隐藏实际键值。对于客户端系统,通过临时访问令牌来介导对相应记录的同时访问。在关系层面,这些链接由两个数据库共有的任意唯一记录键表示。这种架构允许在不同数据库中集成相关数据,而无需在一处复制或永久共享这些数据。每个参与的在线数据库可以通过仅为那些可能与其他数据逻辑连接的数据结构指定链接键来确定集成程度。逻辑链接可以针对特定用例进行设计。此外,每个数据库通过执行自己的授权方案来控制用户访问。另一个优点是,各个数据库所有者在适应不断变化的本地需求方面有很大的回旋余地,而不会影响融入网络。除了保护个体受试者识别数据外,这种架构还允许分割合作使用的数据集以实现多种目标。应用示例可以是需要受试者联系信息进行随访的临床登记处、有或没有基因信息的生物样本库,以及将电子病历数据自动或辅助集成到研究数据中。