University Information Technology Services, Indiana University, 535 W. Michigan St., Indianapolis, IN 46202, USA.
Alcohol. 2010 Nov-Dec;44(7-8):643-7. doi: 10.1016/j.alcohol.2009.08.007. Epub 2009 Dec 29.
Many previous attempts by fetal alcohol spectrum disorders researchers to compare data across multiple prospective and retrospective human studies have failed because of both structural differences in the collected data and difficulty in coming to agreement on the precise meaning of the terminology used to describe the collected data. Although some groups of researchers have an established track record of successfully integrating data, attempts to integrate data more broadly among different groups of researchers have generally faltered. Lack of tools to help researchers share and integrate data has also hampered data analysis. This situation has delayed improving diagnosis, intervention, and treatment before and after birth. We worked with various researchers and research programs in the Collaborative Initiative on Fetal Alcohol Spectrum Disorders (CI-FASD) to develop a set of common data dictionaries to describe the data to be collected, including definitions of terms and specification of allowable values. The resulting data dictionaries were the basis for creating a central data repository (CI-FASD Central Repository) and software tools to input and query data. Data entry restrictions ensure that only data that conform to the data dictionaries reach the CI-FASD Central Repository. The result is an effective system for centralized and unified management of the data collected and analyzed by the initiative, including a secure, long-term data repository. CI-FASD researchers are able to integrate and analyze data of different types, using multiple methods, and collected from multiple populations, and data are retained for future reuse in a secure, robust repository.
许多胎儿酒精谱系障碍研究人员之前试图比较多个前瞻性和回顾性人类研究的数据,但都因所收集数据的结构差异以及难以就用于描述所收集数据的术语的确切含义达成一致而失败。尽管一些研究人员小组在成功整合数据方面有可靠的记录,但在不同研究人员小组之间更广泛地整合数据的尝试通常都失败了。缺乏帮助研究人员共享和整合数据的工具也阻碍了数据分析。这种情况延误了在出生前后改善诊断、干预和治疗。我们与胎儿酒精谱系障碍协作倡议(CI-FASD)中的各个研究人员和研究计划合作,制定了一套通用数据字典来描述要收集的数据,包括术语的定义和允许值的规范。由此产生的数据字典是创建中央数据存储库(CI-FASD 中央存储库)和用于输入和查询数据的软件工具的基础。数据输入限制确保只有符合数据字典的数据才能进入 CI-FASD 中央存储库。其结果是建立了一个有效的系统,用于集中和统一管理该倡议收集和分析的数据,包括一个安全的长期数据存储库。CI-FASD 研究人员能够使用多种方法整合和分析来自不同人群的不同类型的数据,并将数据保留在安全、强大的存储库中以备将来重复使用。