Department of Information and Library Science, Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, Indiana, US.
Sci Data. 2022 Jun 17;9(1):345. doi: 10.1038/s41597-022-01428-w.
Data sharing can accelerate scientific discovery while increasing return on investment beyond the researcher or group that produced them. Data repositories enable data sharing and preservation over the long term, but little is known about scientists' perceptions of them and their perspectives on data management and sharing practices. Using focus groups with scientists from five disciplines (atmospheric and earth science, computer science, chemistry, ecology, and neuroscience), we asked questions about data management to lead into a discussion of what features they think are necessary to include in data repository systems and services to help them implement the data sharing and preservation parts of their data management plans. Participants identified metadata quality control and training as problem areas in data management. Additionally, participants discussed several desired repository features, including: metadata control, data traceability, security, stable infrastructure, and data use restrictions. We present their desired repository features as a rubric for the research community to encourage repository utilization. Future directions for research are discussed.
数据共享可以加速科学发现,同时提高产生数据的研究人员或团体的投资回报。数据存储库能够长期实现数据共享和保存,但人们对科学家对它们的看法以及他们对数据管理和共享实践的看法知之甚少。我们使用来自五个学科(大气和地球科学、计算机科学、化学、生态学和神经科学)的科学家的焦点小组,询问了有关数据管理的问题,以讨论他们认为数据存储库系统和服务中必须包含哪些功能,以帮助他们实施数据共享和保存部分数据管理计划。参与者确定元数据质量控制和培训是数据管理中的问题领域。此外,参与者还讨论了一些所需的存储库功能,包括:元数据控制、数据可追溯性、安全性、稳定的基础设施和数据使用限制。我们将他们期望的存储库功能作为研究界的一个标准,以鼓励存储库的使用。还讨论了未来的研究方向。