DuBois James M, Mozersky Jessica, Parsons Meredith, Walsh Heidi A, Friedrich Annie, Pienta Amy
Bioethics Research Center, Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110.
ICPSR, Institute for Social Research, University of Michigan, Ann Arbor, MI 48106.
Proc Natl Acad Sci U S A. 2023 Oct 24;120(43):e2206981120. doi: 10.1073/pnas.2206981120. Epub 2023 Oct 13.
In January 2023, a new NIH policy on data sharing went into effect. The policy applies to both quantitative and qualitative research (QR) data such as data from interviews or focus groups. QR data are often sensitive and difficult to deidentify, and thus have rarely been shared in the United States. Over the past 5 y, our research team has engaged stakeholders on QR data sharing, developed software to support data deidentification, produced guidance, and collaborated with the ICPSR data repository to pilot the deposit of 30 QR datasets. In this perspective article, we share important lessons learned by addressing eight clusters of questions on issues such as where, when, and what to share; how to deidentify data and support high-quality secondary use; budgeting for data sharing; and the permissions needed to share data. We also offer a brief assessment of the state of preparedness of data repositories, QR journals, and QR textbooks to support data sharing. While QR data sharing could yield important benefits to the research community, we quickly need to develop enforceable standards, expertise, and resources to support responsible QR data sharing. Absent these resources, we risk violating participant confidentiality and wasting a significant amount of time and funding on data that are not useful for either secondary use or data transparency and verification.
2023年1月,美国国立卫生研究院(NIH)一项关于数据共享的新政策生效。该政策适用于定量和定性研究(QR)数据,如访谈或焦点小组的数据。QR数据通常很敏感,难以去识别化,因此在美国很少被共享。在过去5年里,我们的研究团队与利益相关者就QR数据共享进行了接触,开发了支持数据去识别化的软件,制定了指南,并与校际政治与社会研究联盟(ICPSR)数据存储库合作,试点存入了30个QR数据集。在这篇观点文章中,我们分享了通过解决八个问题组所学到的重要经验教训,这些问题组涉及诸如在哪里、何时以及分享什么;如何对数据进行去识别化并支持高质量的二次使用;数据共享的预算;以及共享数据所需的权限等问题。我们还简要评估了数据存储库、QR期刊和QR教科书在支持数据共享方面的准备情况。虽然QR数据共享可能会给研究界带来重要益处,但我们迫切需要制定可执行的标准、专业知识和资源,以支持负责任的QR数据共享。如果缺乏这些资源,我们就有可能侵犯参与者的隐私,并在对二次使用或数据透明度及验证均无用处的数据上浪费大量时间和资金。