通过无上下文编码缺失数据改进研究中的数据共享。

Improving data sharing in research with context-free encoded missing data.

作者信息

Hoevenaar-Blom Marieke P, Guillemont Juliette, Ngandu Tiia, Beishuizen Cathrien R L, Coley Nicola, Moll van Charante Eric P, Andrieu Sandrine, Kivipelto Miia, Soininen Hilkka, Brayne Carol, Meiller Yannick, Richard Edo

机构信息

Department of Neurology, Academic Medical Center, University of Amsterdam, Amsterdam, The Netherlands.

INSERM, University of Toulouse, Toulouse, France.

出版信息

PLoS One. 2017 Sep 12;12(9):e0182362. doi: 10.1371/journal.pone.0182362. eCollection 2017.

DOI:10.1371/journal.pone.0182362

PMID:28898245

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5595279/

Abstract

Lack of attention to missing data in research may result in biased results, loss of power and reduced generalizability. Registering reasons for missing values at the time of data collection, or-in the case of sharing existing data-before making data available to other teams, can save time and efforts, improve scientific value and help to prevent erroneous assumptions and biased results. To ensure that encoding of missing data is sufficient to understand the reason why data are missing, it should ideally be context-free. Therefore, 11 context-free codes of missing data were carefully designed based on three completed randomized controlled clinical trials and tested in a new randomized controlled clinical trial by an international team consisting of clinical researchers and epidemiologists with extended experience in designing and conducting trials and an Information System expert. These codes can be divided into missing due to participant and/or participation characteristics (n = 6), missing by design (n = 4), and due to a procedural error (n = 1). Broad implementation of context-free missing data encoding may enhance the possibilities of data sharing and pooling, thus allowing more powerful analyses using existing data.

摘要

研究中对缺失数据缺乏关注可能会导致结果有偏差、效能降低和普遍性下降。在数据收集时记录缺失值的原因，或者——在共享现有数据的情况下——在将数据提供给其他团队之前记录原因，可以节省时间和精力，提高科学价值，并有助于防止错误的假设和有偏差的结果。为确保缺失数据的编码足以理解数据缺失的原因，理想情况下它应该是无背景的。因此，基于三项完成的随机对照临床试验，精心设计了11种无背景的缺失数据编码，并由一个由临床研究人员、在设计和开展试验方面有丰富经验的流行病学家以及一名信息系统专家组成的国际团队，在一项新的随机对照临床试验中进行了测试。这些编码可分为因参与者和/或参与特征导致的缺失（n = 6）、设计导致的缺失（n = 4）以及程序错误导致的缺失（n = 1）。广泛实施无背景的缺失数据编码可能会增加数据共享和合并的可能性，从而允许使用现有数据进行更有力的分析。

相似文献

Improving data sharing in research with context-free encoded missing data.通过无上下文编码缺失数据改进研究中的数据共享。

PLoS One. 2017 Sep 12;12(9):e0182362. doi: 10.1371/journal.pone.0182362. eCollection 2017.

The project data sphere initiative: accelerating cancer research by sharing data.项目数据领域计划：通过数据共享加速癌症研究

Oncologist. 2015 May;20(5):464-e20. doi: 10.1634/theoncologist.2014-0431. Epub 2015 Apr 15.

Preparing individual patient data from clinical trials for sharing: the GlaxoSmithKline approach.准备来自临床试验的个体患者数据以供共享：葛兰素史克公司的方法。

Pharm Stat. 2014 May-Jun;13(3):179-83. doi: 10.1002/pst.1615. Epub 2014 Mar 25.

Researchers' Experience with Clinical Data Sharing.研究人员在临床数据共享方面的经验。

J Am Board Fam Med. 2016 Nov 12;29(6):805-807. doi: 10.3122/jabfm.2016.06.160198.

American Society of Clinical Oncology policy statement: oversight of clinical research.美国临床肿瘤学会政策声明：临床研究监督

J Clin Oncol. 2003 Jun 15;21(12):2377-86. doi: 10.1200/JCO.2003.04.026. Epub 2003 Apr 29.

Conditions for making trial data available to other investigators.

BMJ. 2016 Mar 21;352:i1573. doi: 10.1136/bmj.i1573.

Data Sharing and Cardiology: Platforms and Possibilities.数据共享与心脏病学：平台与可能性。

J Am Coll Cardiol. 2017 Dec 19;70(24):3018-3025. doi: 10.1016/j.jacc.2017.10.037.

International summit on proteomics data release and sharing policy.蛋白质组学数据发布与共享政策国际峰会

J Proteome Res. 2008 Nov;7(11):4609. doi: 10.1021/pr800779q. Epub 2008 Oct 7.

Clinical trial data sharing: here's the challenge.临床试验数据共享：挑战在于此。

BMJ Open. 2019 Aug 21;9(8):e032334. doi: 10.1136/bmjopen-2019-032334.

Biomedical Data Sharing and Reuse: Attitudes and Practices of Clinical and Scientific Research Staff.生物医学数据共享与再利用：临床与科研人员的态度及实践

PLoS One. 2015 Jun 24;10(6):e0129506. doi: 10.1371/journal.pone.0129506. eCollection 2015.

引用本文的文献

Pooling individual participant data from randomized controlled trials: Exploring potential loss of information.从随机对照试验中汇总个体参与者数据：探索潜在的信息损失。

PLoS One. 2020 May 12;15(5):e0232970. doi: 10.1371/journal.pone.0232970. eCollection 2020.

本文引用的文献

Effectiveness of a 6-year multidomain vascular care intervention to prevent dementia (preDIVA): a cluster-randomised controlled trial.6 年多领域血管护理干预预防痴呆的效果（preDIVA）：一项群组随机对照试验。

Lancet. 2016 Aug 20;388(10046):797-805. doi: 10.1016/S0140-6736(16)30950-3. Epub 2016 Jul 26.

Healthy Ageing Through Internet Counselling in the Elderly: the HATICE randomised controlled trial for the prevention of cardiovascular disease and cognitive impairment.通过互联网咨询促进老年人健康老龄化：预防心血管疾病和认知障碍的 HATICE 随机对照试验。

BMJ Open. 2016 Jun 10;6(6):e010806. doi: 10.1136/bmjopen-2015-010806.

MAPT STUDY: A MULTIDOMAIN APPROACH FOR PREVENTING ALZHEIMER'S DISEASE: DESIGN AND BASELINE DATA.微管相关蛋白tau（MAPT）研究：一种预防阿尔茨海默病的多领域方法：设计与基线数据

J Prev Alzheimers Dis. 2014 Jun;1(1):13-22.

A 2 year multidomain intervention of diet, exercise, cognitive training, and vascular risk monitoring versus control to prevent cognitive decline in at-risk elderly people (FINGER): a randomised controlled trial.一项针对高危老年人的饮食、运动、认知训练和血管风险监测的 2 年多领域干预措施，以预防认知能力下降（FINGER）：一项随机对照试验。

Lancet. 2015 Jun 6;385(9984):2255-63. doi: 10.1016/S0140-6736(15)60461-5. Epub 2015 Mar 12.

The prevention and treatment of missing data in clinical trials.临床试验中缺失数据的预防与处理

N Engl J Med. 2012 Oct 4;367(14):1355-60. doi: 10.1056/NEJMsr1203730.

Avoidable waste in the production and reporting of research evidence.研究证据生产与报告中的可避免浪费。

Lancet. 2009 Jul 4;374(9683):86-9. doi: 10.1016/S0140-6736(09)60329-9. Epub 2009 Jun 12.

Methods for handling missing values in clinical trials.临床试验中处理缺失值的方法。

J Rheumatol. 1999 Aug;26(8):1654-6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验