临床试验中的数据共享——试验数据集匿名化实用指南。

Data sharing in clinical trials - practical guidance on anonymising trial datasets.

作者信息

Keerie Catriona, Tuck Christopher, Milne Garry, Eldridge Sandra, Wright Neil, Lewis Steff C

机构信息

Edinburgh Clinical Trials Unit, Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Nine Bioquarter, 9 Little France Road, Edinburgh, EH16 4UX, UK.

Queen Mary University of London, London, UK.

出版信息

Trials. 2018 Jan 10;19(1):25. doi: 10.1186/s13063-017-2382-9.

DOI:10.1186/s13063-017-2382-9

PMID:29321053

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5763739/

Abstract

BACKGROUND

There is an increasing demand by non-commercial funders that trialists should provide access to trial data once the primary analysis is completed. This has to take into account concerns about identifying individual trial participants, and the legal and regulatory requirements.

METHODS

Using the good practice guideline laid out by the work funded by the Medical Research Council Hubs for Trials Methodology Research (MRC HTMR), we anonymised a dataset from a recently completed trial. Using this example, we present practical guidance on how to anonymise a dataset, and describe rules that could be used on other trial datasets. We describe how these might differ if the trial was to be made freely available to all, or if the data could only be accessed with specific permission and data usage agreements in place.

RESULTS

Following the good practice guidelines, we successfully created a controlled access model for trial data sharing. The data were assessed on a case-by-case basis classifying variables as direct, indirect and superfluous identifiers with differing methods of anonymisation assigned depending on the type of identifier. A final dataset was created and checks of the anonymised dataset were applied. Lastly, a procedure for release of the data was implemented to complete the process.

CONCLUSIONS

We have implemented a practical solution to the data anonymisation process resulting in a bespoke anonymised dataset for a recently completed trial. We have gained useful learnings in terms of efficiency of the process going forward, the need to balance anonymity with data utilisation and future work that should be undertaken.

摘要

背景

非商业资助者对试验者的需求日益增加，要求他们在完成主要分析后提供试验数据的访问权限。这必须考虑到对识别个体试验参与者的担忧以及法律和监管要求。

方法

利用医学研究理事会试验方法研究中心（MRC HTMR）资助的工作所制定的良好实践指南，我们对最近完成的一项试验的数据集进行了匿名化处理。以这个例子为基础，我们提供了关于如何对数据集进行匿名化处理的实用指南，并描述了可用于其他试验数据集的规则。我们描述了如果试验要向所有人免费提供，或者数据只能在有特定许可和数据使用协议的情况下访问，这些规则可能会有何不同。

结果

遵循良好实践指南，我们成功创建了一个试验数据共享的受控访问模型。根据具体情况对数据进行评估，将变量分类为直接标识符、间接标识符和多余标识符，并根据标识符的类型采用不同的匿名化方法。创建了最终数据集，并对匿名化数据集进行了检查。最后，实施了数据发布程序以完成整个过程。

结论

我们为数据匿名化过程实施了一个切实可行的解决方案，为最近完成的一项试验生成了一个定制的匿名数据集。我们在该过程的效率、平衡匿名性与数据利用的必要性以及未来应开展的工作方面获得了有益的经验教训。

相似文献

Data sharing in clinical trials - practical guidance on anonymising trial datasets.临床试验中的数据共享——试验数据集匿名化实用指南。

Trials. 2018 Jan 10;19(1):25. doi: 10.1186/s13063-017-2382-9.

Current recommendations/practices for anonymising data from clinical trials in order to make it available for sharing: A scoping review.当前为了使临床试验数据可供共享而对其进行匿名化的建议/实践：范围综述。

Clin Trials. 2022 Aug;19(4):452-463. doi: 10.1177/17407745221087469. Epub 2022 Jun 22.

Resource implications of preparing individual participant data from a clinical trial to share with external researchers.为与外部研究人员共享而准备来自临床试验的个体参与者数据所涉及的资源问题。

Trials. 2017 Jul 17;18(1):319. doi: 10.1186/s13063-017-2067-4.

How should individual participant data (IPD) from publicly funded clinical trials be shared?来自公共资助临床试验的个体参与者数据（IPD）应如何共享？

BMC Med. 2015 Dec 17;13:298. doi: 10.1186/s12916-015-0532-z.

Protecting patient privacy when sharing patient-level data from clinical trials.在共享临床试验中患者层面的数据时保护患者隐私。

BMC Med Res Methodol. 2016 Jul 8;16 Suppl 1(Suppl 1):77. doi: 10.1186/s12874-016-0169-4.

The project data sphere initiative: accelerating cancer research by sharing data.项目数据领域计划：通过数据共享加速癌症研究

Oncologist. 2015 May;20(5):464-e20. doi: 10.1634/theoncologist.2014-0431. Epub 2015 Apr 15.

Preventing Unintended Disclosure of Personally Identifiable Data Following Anonymisation.防止匿名化后个人身份信息的意外泄露。

Stud Health Technol Inform. 2017;235:313-317.

Preparing individual patient data from clinical trials for sharing: the GlaxoSmithKline approach.准备来自临床试验的个体患者数据以供共享：葛兰素史克公司的方法。

Pharm Stat. 2014 May-Jun;13(3):179-83. doi: 10.1002/pst.1615. Epub 2014 Mar 25.

Optimizing the synthesis of clinical trial data using sequential trees.使用序贯树优化临床试验数据的合成

J Am Med Inform Assoc. 2021 Jan 15;28(1):3-13. doi: 10.1093/jamia/ocaa249.

A Global, Neutral Platform for Sharing Trial Data.一个用于共享试验数据的全球中立平台。

N Engl J Med. 2016 Jun 23;374(25):2411-3. doi: 10.1056/NEJMp1605348. Epub 2016 May 11.

引用本文的文献

Efficacy of peroneal nerve functional electrical stimulation (FES) for the reduction of bradykinesia in Parkinson's disease: an assessor-blinded randomised controlled trial (STEPS II)-study protocol.腓总神经功能性电刺激（FES）对减轻帕金森病运动迟缓的疗效：一项评估者盲法随机对照试验（STEPS II）——研究方案

BMJ Open. 2025 Sep 5;15(9):e097010. doi: 10.1136/bmjopen-2024-097010.

A survey on UK researchers' views regarding their experiences with the de-identification, anonymisation, release methods and re-identification risk estimation for clinical trial datasets.一项关于英国研究人员对临床试验数据集的去识别化、匿名化、发布方法及重新识别风险评估经验的看法的调查。

Clin Trials. 2025 Feb;22(1):11-23. doi: 10.1177/17407745241259086. Epub 2024 Jun 19.

Sharing sensitive data in life sciences: an overview of centralized and federated approaches.生命科学领域中敏感数据的共享：集中式和联邦式方法概述。

Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae262.

The Costs of Anonymization: Case Study Using Clinical Data.匿名化的成本：使用临床数据的案例研究

J Med Internet Res. 2024 Apr 24;26:e49445. doi: 10.2196/49445.

The effect of a cognitive training therapy based on stimulation of brain oscillations in patients with mild cognitive impairment in a Chilean sample: study protocol for a phase IIb, 2 × 3 mixed factorial, double-blind randomised controlled trial.基于脑振荡刺激的认知训练疗法对智利轻度认知障碍患者的影响：一项 2×3 混合因子、双盲随机对照二期临床试验研究方案。

Trials. 2024 Feb 23;25(1):144. doi: 10.1186/s13063-024-07972-7.

Real-world evidence to advance knowledge in pulmonary hypertension: Status, challenges, and opportunities. A consensus statement from the Pulmonary Vascular Research Institute's Innovative Drug Development Initiative's Real-world Evidence Working Group.推进肺动脉高压知识的真实世界证据：现状、挑战与机遇。肺血管研究所创新药物开发倡议真实世界证据工作组的共识声明。

Pulm Circ. 2023 Dec 21;13(4):e12317. doi: 10.1002/pul2.12317. eCollection 2023 Oct.

A 10-year update to the principles for clinical trial data sharing by pharmaceutical companies: perspectives based on a decade of literature and policies.制药公司临床试验数据共享原则的 10 年更新：基于十年文献和政策的观点。

BMC Med. 2023 Oct 23;21(1):400. doi: 10.1186/s12916-023-03113-0.

Qualitative data sharing practices in clinical trials in the UK and Ireland: towards the production of good practice guidance.英国和爱尔兰临床试验中的定性数据共享实践：迈向良好实践指南的制定

HRB Open Res. 2023 Feb 6;6:10. doi: 10.12688/hrbopenres.13667.1. eCollection 2023.

Ten simple rules for designing and conducting undergraduate replication projects.设计和开展本科生复制项目的 10 个简单规则。

PLoS Comput Biol. 2023 Mar 16;19(3):e1010957. doi: 10.1371/journal.pcbi.1010957. eCollection 2023 Mar.

A randomised controlled trial to investigate the clinical effectiveness and cost effectiveness of Mindfulness-Based Cognitive Therapy (MBCT) for depressed non-responders to Increasing Access to Psychological Therapies (IAPT) high-intensity therapies: study protocol.一项随机对照试验，旨在调查正念认知疗法（MBCT）对接受增加心理治疗机会（IAPT）高强度治疗后仍未缓解的抑郁症患者的临床效果和成本效果：研究方案。

Trials. 2023 Jan 19;24(1):43. doi: 10.1186/s13063-022-06882-w.

本文引用的文献

Trials. 2017 Jul 17;18(1):319. doi: 10.1186/s13063-017-2067-4.

Mercaptopurine versus placebo to prevent recurrence of Crohn's disease after surgical resection (TOPPIC): a multicentre, double-blind, randomised controlled trial.巯嘌呤与安慰剂预防克罗恩病手术后复发的疗效比较（TOPPIC）：一项多中心、双盲、随机对照试验。

Lancet Gastroenterol Hepatol. 2016 Dec;1(4):273-282. doi: 10.1016/S2468-1253(16)30078-4. Epub 2016 Aug 30.

Public responses to the sharing and linkage of health data for research purposes: a systematic review and thematic synthesis of qualitative studies.公众对用于研究目的的健康数据共享和关联的反应：定性研究的系统评价与主题综合

BMC Med Ethics. 2016 Nov 10;17(1):73. doi: 10.1186/s12910-016-0153-x.

How should individual participant data (IPD) from publicly funded clinical trials be shared?来自公共资助临床试验的个体参与者数据（IPD）应如何共享？

BMC Med. 2015 Dec 17;13:298. doi: 10.1186/s12916-015-0532-z.

Bronchiolitis of Infancy Discharge Study (BIDS): a multicentre, parallel-group, double-blind, randomised controlled, equivalence trial with economic evaluation.婴儿细支气管炎出院研究（BIDS）：一项多中心、平行组、双盲、随机对照、等效性试验及经济学评估。

Health Technol Assess. 2015 Sep;19(71):i-xxiii, 1-172. doi: 10.3310/hta19710.

Sharing data from clinical trials: the rationale for a controlled access approach.分享临床试验数据：采用受控访问方法的基本原理。

Trials. 2015 Mar 23;16:104. doi: 10.1186/s13063-015-0604-6.

GaPP: a pilot randomised controlled trial of the efficacy of action of gabapentin for the management of chronic pelvic pain in women: study protocol.加巴喷丁治疗女性慢性盆腔疼痛疗效的初步随机对照试验：研究方案

BMJ Open. 2012 Jun 8;2(3). doi: 10.1136/bmjopen-2012-001297. Print 2012.

Preparing raw clinical data for publication: guidance for journal editors, authors, and peer reviewers.准备原始临床数据用于发表：期刊编辑、作者和同行评审者指南。

BMJ. 2010 Jan 28;340:c181. doi: 10.1136/bmj.c181.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验