质量、数量和协调：DataSHaPER 方法在生物临床研究中整合数据。

Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies.

机构信息

Public Population Project in Genomics (P³G), Montreal, QC, Canada.

出版信息

Int J Epidemiol. 2010 Oct;39(5):1383-93. doi: 10.1093/ije/dyq139. Epub 2010 Sep 2.

DOI:10.1093/ije/dyq139

PMID:20813861

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2972444/

Abstract

BACKGROUND

Vast sample sizes are often essential in the quest to disentangle the complex interplay of the genetic, lifestyle, environmental and social factors that determine the aetiology and progression of chronic diseases. The pooling of information between studies is therefore of central importance to contemporary bioscience. However, there are many technical, ethico-legal and scientific challenges to be overcome if an effective, valid, pooled analysis is to be achieved. Perhaps most critically, any data that are to be analysed in this way must be adequately 'harmonized'. This implies that the collection and recording of information and data must be done in a manner that is sufficiently similar in the different studies to allow valid synthesis to take place.

METHODS

This conceptual article describes the origins, purpose and scientific foundations of the DataSHaPER (DataSchema and Harmonization Platform for Epidemiological Research; http://www.datashaper.org), which has been created by a multidisciplinary consortium of experts that was pulled together and coordinated by three international organizations: P³G (Public Population Project in Genomics), PHOEBE (Promoting Harmonization of Epidemiological Biobanks in Europe) and CPT (Canadian Partnership for Tomorrow Project).

RESULTS

The DataSHaPER provides a flexible, structured approach to the harmonization and pooling of information between studies. Its two primary components, the 'DataSchema' and 'Harmonization Platforms', together support the preparation of effective data-collection protocols and provide a central reference to facilitate harmonization. The DataSHaPER supports both 'prospective' and 'retrospective' harmonization.

CONCLUSION

It is hoped that this article will encourage readers to investigate the project further: the more the research groups and studies are actively involved, the more effective the DataSHaPER programme will ultimately be.

摘要

背景

在探索决定慢性病病因和进展的遗传、生活方式、环境和社会因素的复杂相互作用时，通常需要大量样本。因此，研究之间的信息汇集对于当代生物科学至关重要。然而，如果要实现有效的、有效的、汇集的分析，还需要克服许多技术、伦理法律和科学挑战。也许最重要的是，任何要以这种方式分析的数据都必须进行充分的“协调”。这意味着信息和数据的收集和记录必须以在不同研究中足够相似的方式进行，以允许进行有效的综合。

方法

本文通过多学科专家组成的联盟创建了 DataSHaPER（流行病学研究的数据模式和协调平台；http://www.datashaper.org），描述了 DataSHaPER 的起源、目的和科学基础。该联盟由三个国际组织：P³G（公共人口基因组计划）、PHOEBE（促进欧洲流行病学生物库协调）和 CPT（加拿大明天项目伙伴关系）共同召集和协调。

结果

DataSHaPER 为研究之间的信息协调和汇集提供了一种灵活、结构化的方法。它的两个主要组成部分，“DataSchema”和“Harmonization Platforms”，共同支持有效的数据收集协议的制定，并提供中央参考以促进协调。DataSHaPER 既支持“前瞻性”又支持“回溯性”协调。

结论

希望本文能鼓励读者进一步研究该项目：研究小组和研究参与得越多，DataSHaPER 计划最终就会越有效。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/2972444/5ef5c6a213b5/dyq139f1.jpg

相似文献

Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies.

Int J Epidemiol. 2010 Oct;39(5):1383-93. doi: 10.1093/ije/dyq139. Epub 2010 Sep 2.

Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies.

Int J Epidemiol. 2011 Oct;40(5):1314-28. doi: 10.1093/ije/dyr106. Epub 2011 Jul 30.

DataSHIELD: resolving a conflict in contemporary bioscience--performing a pooled analysis of individual-level data without sharing the data.

Int J Epidemiol. 2010 Oct;39(5):1372-82. doi: 10.1093/ije/dyq111. Epub 2010 Jul 14.

Harmonization of the Health and Risk Factor Questionnaire data of the Canadian Partnership for Tomorrow Project: a descriptive analysis.

CMAJ Open. 2019 Apr 23;7(2):E272-E282. doi: 10.9778/cmajo.20180062. Print 2019 Apr-Jun.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

The future of Cochrane Neonatal.

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

Development and use of a flexible data harmonization platform to facilitate the harmonization of individual patient data for meta-analyses.

BMC Res Notes. 2019 Mar 22;12(1):164. doi: 10.1186/s13104-019-4210-7.

Maelstrom Research guidelines for rigorous retrospective data harmonization.

Int J Epidemiol. 2017 Feb 1;46(1):103-105. doi: 10.1093/ije/dyw075.

Toward Rigorous Data Harmonization in Cancer Epidemiology Research: One Approach.

Am J Epidemiol. 2015 Dec 15;182(12):1033-8. doi: 10.1093/aje/kwv133. Epub 2015 Nov 20.

The NCI All Ireland Cancer Conference.

Oncologist. 1999;4(4):275-277.

引用本文的文献

DataSHIELD: mitigating disclosure risk in a multi-site federated analysis platform.

Bioinform Adv. 2025 Mar 10;5(1):vbaf046. doi: 10.1093/bioadv/vbaf046. eCollection 2025.

Harmonization of SDQ and ASEBA Phenotypes: Measurement Variance Across Cohorts.

J Psychopathol Behav Assess. 2025;47(1):27. doi: 10.1007/s10862-025-10204-0. Epub 2025 Mar 7.

Prospective harmonisation of four international randomised controlled trials in Canada, China, India and South Africa: the Healthy Life Trajectories Initiative.

BMJ Open. 2025 Mar 3;15(3):e086233. doi: 10.1136/bmjopen-2024-086233.

Is maternal diabetes during pregnancy associated with neurodevelopmental, cognitive and behavioural outcomes in children? Insights from individual participant data meta-analysis in ten birth cohorts.

BMC Pediatr. 2025 Jan 30;25(1):76. doi: 10.1186/s12887-024-05365-y.

Gender differences in the association between adherence to healthy diet principles and adherence to cardiopreventive medication among adults from Québec (Canada).

Br J Nutr. 2025 Jan 16;133(3):1-11. doi: 10.1017/S0007114525000030.

Investigating a Domain Adaptation Approach for Integrating Different Measurement Instruments in a Longitudinal Clinical Registry.

Biom J. 2025 Feb;67(1):e70023. doi: 10.1002/bimj.70023.

Harmonizing the CBCL and SDQ ADHD scores by using linear equating, kernel equating, item response theory and machine learning methods.

Front Psychol. 2024 Jul 10;15:1345406. doi: 10.3389/fpsyg.2024.1345406. eCollection 2024.

Folate intake and colorectal cancer risk according to genetic subtypes defined by targeted tumor sequencing.

Am J Clin Nutr. 2024 Sep;120(3):664-673. doi: 10.1016/j.ajcnut.2024.07.012. Epub 2024 Jul 16.

Pioneering a multi-phase framework to harmonize self-reported sleep data across cohorts.

Sleep. 2024 Sep 9;47(9). doi: 10.1093/sleep/zsae115.

Diet patterns associated with cognitive decline: methods to harmonize data from European and US cohort studies.

Front Nutr. 2024 Mar 21;11:1379531. doi: 10.3389/fnut.2024.1379531. eCollection 2024.

本文引用的文献

GENOTYPE-ENVIRONMENT INTERACTION AND THE EVOLUTION OF PHENOTYPIC PLASTICITY.

Evolution. 1985 May;39(3):505-522. doi: 10.1111/j.1558-5646.1985.tb00391.x.

DataSHIELD: resolving a conflict in contemporary bioscience--performing a pooled analysis of individual-level data without sharing the data.

Int J Epidemiol. 2010 Oct;39(5):1372-82. doi: 10.1093/ije/dyq111. Epub 2010 Jul 14.

The Canadian Partnership for Tomorrow Project: building a pan-Canadian research platform for disease prevention.

CMAJ. 2010 Aug 10;182(11):1197-201. doi: 10.1503/cmaj.091540. Epub 2010 Apr 26.

Case-control study of overweight, obesity, and colorectal cancer risk, overall and by tumor microsatellite instability status.

J Natl Cancer Inst. 2010 Mar 17;102(6):391-400. doi: 10.1093/jnci/djq011. Epub 2010 Mar 5.

PhenX: a toolkit for interdisciplinary genetics research.

Curr Opin Lipidol. 2010 Apr;21(2):136-40. doi: 10.1097/MOL.0b013e3283377395.

The Gene, Environment Association Studies consortium (GENEVA): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditions.

Genet Epidemiol. 2010 May;34(4):364-72. doi: 10.1002/gepi.20492.

Genome-wide association study identifies five loci associated with lung function.

Nat Genet. 2010 Jan;42(1):36-44. doi: 10.1038/ng.501. Epub 2009 Dec 13.

Thinking big: large-scale collaborative research in observational epidemiology.

Eur J Epidemiol. 2009;24(12):727-31. doi: 10.1007/s10654-009-9412-1. Epub 2009 Dec 5.

The Canadian longitudinal study on aging (CLSA).

Can J Aging. 2009 Sep;28(3):221-9. doi: 10.1017/S0714980809990055.

Evolution of the Global Tobacco Surveillance System (GTSS) 1998-2008.

Glob Health Promot. 2009 Sep;16(2 Suppl):4-37. doi: 10.1177/1757975909342181.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

质量、数量和协调：DataSHaPER 方法在生物临床研究中整合数据。

Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies.

机构信息

Public Population Project in Genomics (P³G), Montreal, QC, Canada.

出版信息

Int J Epidemiol. 2010 Oct;39(5):1383-93. doi: 10.1093/ije/dyq139. Epub 2010 Sep 2.

DOI:10.1093/ije/dyq139

PMID:20813861

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2972444/

Abstract

BACKGROUND

METHODS

RESULTS

CONCLUSION

摘要

背景

方法

结果

结论

希望本文能鼓励读者进一步研究该项目：研究小组和研究参与得越多，DataSHaPER 计划最终就会越有效。

质量、数量和协调：DataSHaPER 方法在生物临床研究中整合数据。

Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

质量、数量和协调：DataSHaPER 方法在生物临床研究中整合数据。

Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献