Miller Kate M, Flack Felicity S, Smith Merran B, Bennett Vicki, Marshall Carina Ecremen
Population Health Research Network, University of Western Australia, Crawley, Western Australia, Australia.
Australian Institute of Health and Welfare, Canberra, Australian Capital Territory, Australia.
Int J Popul Data Sci. 2025 Apr 30;10(1):2461. doi: 10.23889/ijpds.v10i1.2461. eCollection 2025.
Metadata plays a crucial role in the health research infrastructure ecosystem. Despite the abundance of metadata for data collections in Australia, the vast and diverse data custodian landscape poses challenges for linked data researchers to find relevant information for multiple data collections, often making it an arduous and time-intensive task.
The project comprised three phases: an initial scoping exercise to understand the current state of metadata and related best practice; a national consultation involving researchers, data linkage staff and data custodians to develop a high-fidelity prototype of a metadata platform; and a final build and implementation phase. The platform underwent several prototyping and testing cycles to refine the digital experience.
Expert interviews confirmed that there is a wealth of metadata available, but it is difficult for researchers to access and evaluate. Consultations with researchers identified opportunities to standardise metadata across collections and provide a centralised platform to enhance the discoverability of data collections for research using linked data. High value platform features included searching, browsing and filtering capabilities, data item list metadata, standardised formats, sample data, and frequently asked questions. The final design and functionality reflected user consultations and data custodian input on feasibility.
The Population Health Research Network developed a metadata platform to enable researchers to evaluate the suitability of Australian data collections for linked data projects more effectively. The platform has standardised the way in which metadata is presented for data collections nationally. Improved metadata quality, readability and accessibility will save time and enhance the quality of applications for linked data.
元数据在健康研究基础设施生态系统中发挥着关键作用。尽管澳大利亚的数据收集有丰富的元数据,但庞大且多样的数据保管格局给关联数据研究人员查找多个数据收集的相关信息带来了挑战,这往往使其成为一项艰巨且耗时的任务。
该项目包括三个阶段:初步范围界定活动,以了解元数据的当前状态和相关最佳实践;全国性咨询,涉及研究人员、数据关联人员和数据保管人,以开发元数据平台的高保真原型;以及最后的构建和实施阶段。该平台经历了多个原型设计和测试周期,以优化数字体验。
专家访谈证实有大量可用的元数据,但研究人员难以获取和评估。与研究人员的咨询确定了跨数据集标准化元数据并提供一个集中平台以提高使用关联数据进行研究的数据收集的可发现性的机会。高价值的平台功能包括搜索、浏览和筛选功能、数据项列表元数据、标准化格式、示例数据和常见问题解答。最终的设计和功能反映了用户咨询和数据保管人对可行性的意见。
人口健康研究网络开发了一个元数据平台,以使研究人员能够更有效地评估澳大利亚数据收集对关联数据项目的适用性。该平台已在全国范围内对数据收集的元数据呈现方式进行了标准化。改进后的元数据质量、可读性和可访问性将节省时间并提高关联数据应用程序的质量。