• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

严格的回顾性协调是否可行?DataSHaPER 方法在 53 项大型研究中的应用。

Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies.

机构信息

Research Institute - McGill University Health Centre, Montreal, Quebec, Canada.

出版信息

Int J Epidemiol. 2011 Oct;40(5):1314-28. doi: 10.1093/ije/dyr106. Epub 2011 Jul 30.

DOI:10.1093/ije/dyr106
PMID:21804097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3204208/
Abstract

BACKGROUND

Proper understanding of the roles of, and interactions between genetic, lifestyle, environmental and psycho-social factors in determining the risk of development and/or progression of chronic diseases requires access to very large high-quality databases. Because of the financial, technical and time burdens related to developing and maintaining very large studies, the scientific community is increasingly synthesizing data from multiple studies to construct large databases. However, the data items collected by individual studies must be inferentially equivalent to be meaningfully synthesized. The DataSchema and Harmonization Platform for Epidemiological Research (DataSHaPER; http://www.datashaper.org) was developed to enable the rigorous assessment of the inferential equivalence, i.e. the potential for harmonization, of selected information from individual studies.

METHODS

This article examines the value of using the DataSHaPER for retrospective harmonization of established studies. Using the DataSHaPER approach, the potential to generate 148 harmonized variables from the questionnaires and physical measures collected in 53 large population-based studies (6.9 million participants) was assessed. Variable and study characteristics that might influence the potential for data synthesis were also explored.

RESULTS

Out of all assessment items evaluated (148 variables for each of the 53 studies), 38% could be harmonized. Certain characteristics of variables (i.e. relative importance, individual targeted, reference period) and of studies (i.e. observational units, data collection start date and mode of questionnaire administration) were associated with the potential for harmonization. For example, for variables deemed to be essential, 62% of assessment items paired could be harmonized.

CONCLUSION

The current article shows that the DataSHaPER provides an effective and flexible approach for the retrospective harmonization of information across studies. To implement data synthesis, some additional scientific, ethico-legal and technical considerations must be addressed. The success of the DataSHaPER as a harmonization approach will depend on its continuing development and on the rigour and extent of its use. The DataSHaPER has the potential to take us closer to a truly collaborative epidemiology and offers the promise of enhanced research potential generated through synthesized databases.

摘要

背景

要正确理解遗传、生活方式、环境和心理社会因素在决定慢性病的发生和/或发展风险方面的作用和相互关系,需要访问非常大的高质量数据库。由于开发和维护大型研究的财务、技术和时间负担,科学界越来越多地综合来自多个研究的数据,以构建大型数据库。然而,各个研究收集的数据项必须具有可推断的等效性,才能进行有意义的综合。流行病学研究的数据模式和协调平台(DataSchema and Harmonization Platform for Epidemiological Research,DataSHaPER;http://www.datashaper.org)旨在对个体研究中选定信息的推断等效性(即协调潜力)进行严格评估。

方法

本文研究了使用 DataSHaPER 对已建立的研究进行回顾性协调的价值。使用 DataSHaPER 方法,评估了从 53 项大型基于人群的研究(690 万参与者)的问卷和身体测量中生成 148 个协调变量的潜力。还探讨了可能影响数据综合潜力的变量和研究特征。

结果

在所评估的所有评估项目中(53 项研究中的每一项都有 148 个变量),有 38%可以协调。变量和研究的某些特征(即相对重要性、个体针对性、参考期)与研究(即观测单位、数据收集开始日期和问卷管理模式)与协调潜力相关。例如,对于被认为是必不可少的变量,62%的配对评估项可以协调。

结论

本文表明,DataSHaPER 为跨研究信息的回顾性协调提供了一种有效且灵活的方法。要实施数据综合,必须考虑一些额外的科学、伦理法律和技术问题。DataSHaPER 作为一种协调方法的成功将取决于其持续发展以及其使用的严谨性和程度。DataSHaPER 有可能使我们更接近真正的协作流行病学,并通过综合数据库提供增强的研究潜力。

相似文献

1
Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies.严格的回顾性协调是否可行?DataSHaPER 方法在 53 项大型研究中的应用。
Int J Epidemiol. 2011 Oct;40(5):1314-28. doi: 10.1093/ije/dyr106. Epub 2011 Jul 30.
2
Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies.质量、数量和协调:DataSHaPER 方法在生物临床研究中整合数据。
Int J Epidemiol. 2010 Oct;39(5):1383-93. doi: 10.1093/ije/dyq139. Epub 2010 Sep 2.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Maelstrom Research guidelines for rigorous retrospective data harmonization.大漩涡研究严格回顾性数据协调指南。
Int J Epidemiol. 2017 Feb 1;46(1):103-105. doi: 10.1093/ije/dyw075.
5
Evaluating the harmonization potential of oral health-related questionnaires in national longitudinal birth and child cohort surveys.评估国家纵向出生和儿童队列调查中口腔健康相关问卷的协调性潜力。
J Public Health Dent. 2024 Sep;84(3):307-320. doi: 10.1111/jphd.12632. Epub 2024 Jul 2.
6
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
7
Harmonization of the Health and Risk Factor Questionnaire data of the Canadian Partnership for Tomorrow Project: a descriptive analysis.加拿大明日项目伙伴关系健康与风险因素调查问卷数据的协调:描述性分析
CMAJ Open. 2019 Apr 23;7(2):E272-E282. doi: 10.9778/cmajo.20180062. Print 2019 Apr-Jun.
8
Toward Rigorous Data Harmonization in Cancer Epidemiology Research: One Approach.迈向癌症流行病学研究中的严格数据协调:一种方法。
Am J Epidemiol. 2015 Dec 15;182(12):1033-8. doi: 10.1093/aje/kwv133. Epub 2015 Nov 20.
9
Data harmonization and federated analysis of population-based studies: the BioSHaRE project.基于人群研究的数据协调与联合分析:BioSHaRE项目。
Emerg Themes Epidemiol. 2013 Nov 21;10(1):12. doi: 10.1186/1742-7622-10-12.
10
[Data harmonization and sharing in study cohorts of respiratory diseases].[呼吸系统疾病研究队列中的数据协调与共享]
Zhonghua Liu Xing Bing Xue Za Zhi. 2018 Feb 10;39(2):233-239. doi: 10.3760/cma.j.issn.0254-6450.2018.02.019.

引用本文的文献

1
Harmonization of SDQ and ASEBA Phenotypes: Measurement Variance Across Cohorts.优势与困难问卷(SDQ)和青少年自我报告表(ASEBA)表型的协调:不同队列间的测量差异
J Psychopathol Behav Assess. 2025;47(1):27. doi: 10.1007/s10862-025-10204-0. Epub 2025 Mar 7.
2
The DREAM BIG project as a model for harmonizing early measures of parental care and parent-child interactions across epidemiological cohorts.“胸怀大志”项目作为协调不同流行病学队列中父母养育早期测量指标及亲子互动的典范。
Front Child Adolesc Psychiatry. 2023 Oct 20;2:1206922. doi: 10.3389/frcha.2023.1206922. eCollection 2023.
3
Data Resource Profile: Harmonized health survey data for 240 cities across 11 countries in Latin America: the SALURBAL project.数据资源简介:拉丁美洲11个国家240个城市的统一健康调查数据:SALURBAL项目
Int J Epidemiol. 2024 Dec 16;54(1). doi: 10.1093/ije/dyae171.
4
Evaluating the current methodological practices and issues in existing literature in pooling complex surveys: a systematic review.评估现有文献中复杂调查汇总的当前方法学实践和问题:系统评价。
BMC Med Res Methodol. 2024 Nov 13;24(1):279. doi: 10.1186/s12874-024-02400-5.
5
Harmonizing the CBCL and SDQ ADHD scores by using linear equating, kernel equating, item response theory and machine learning methods.通过使用线性等值、核等值、项目反应理论和机器学习方法使儿童行为检查表(CBCL)和优势与困难问卷(SDQ)中多动症得分相协调。
Front Psychol. 2024 Jul 10;15:1345406. doi: 10.3389/fpsyg.2024.1345406. eCollection 2024.
6
Pioneering a multi-phase framework to harmonize self-reported sleep data across cohorts.开创多阶段框架,以协调队列之间的自我报告睡眠数据。
Sleep. 2024 Sep 9;47(9). doi: 10.1093/sleep/zsae115.
7
Statistical assessment of reliability of anthropometric measurements in the multi-site South African National Dietary Intake Survey 2022.2022 年南非多地点国家膳食摄入量调查中人体测量学测量可靠性的统计评估。
Eur J Clin Nutr. 2024 Nov;78(11):1005-1013. doi: 10.1038/s41430-024-01449-1. Epub 2024 May 14.
8
Longitudinal natural history studies based on real-world data in rare diseases: Opportunity and a novel approach.基于真实世界数据的罕见病纵向自然史研究:机遇与新方法。
Mol Genet Metab. 2024 May;142(1):108453. doi: 10.1016/j.ymgme.2024.108453. Epub 2024 Mar 18.
9
A General Primer for Data Harmonization.数据协调通用指南
Sci Data. 2024 Jan 31;11(1):152. doi: 10.1038/s41597-024-02956-3.
10
Facilitating Harmonization of Variables in Framingham, MESA, ARIC, and REGARDS Studies Through a Metadata Repository.通过元数据存储库促进弗雷明汉、MESA、ARIC 和 REGARDS 研究中变量的协调。
Circ Cardiovasc Qual Outcomes. 2023 Nov;16(11):e009938. doi: 10.1161/CIRCOUTCOMES.123.009938. Epub 2023 Oct 18.

本文引用的文献

1
SAIL--a software system for sample and phenotype availability across biobanks and cohorts.SAIL——一个用于生物库和队列中样本和表型可用性的软件系统。
Bioinformatics. 2011 Feb 15;27(4):589-91. doi: 10.1093/bioinformatics/btq693. Epub 2010 Dec 17.
2
LifeGene--a large prospective population-based study of global relevance.生命基因——一项具有全球重要意义的大型前瞻性基于人群的研究。
Eur J Epidemiol. 2011 Jan;26(1):67-77. doi: 10.1007/s10654-010-9521-x. Epub 2010 Nov 21.
3
Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies.质量、数量和协调:DataSHaPER 方法在生物临床研究中整合数据。
Int J Epidemiol. 2010 Oct;39(5):1383-93. doi: 10.1093/ije/dyq139. Epub 2010 Sep 2.
4
DataSHIELD: resolving a conflict in contemporary bioscience--performing a pooled analysis of individual-level data without sharing the data.DataSHIELD:解决当代生物科学中的冲突——在不共享数据的情况下对个体水平数据进行汇总分析。
Int J Epidemiol. 2010 Oct;39(5):1372-82. doi: 10.1093/ije/dyq111. Epub 2010 Jul 14.
5
The Canadian Partnership for Tomorrow Project: building a pan-Canadian research platform for disease prevention.加拿大明日伙伴计划:构建一个全加拿大疾病预防研究平台。
CMAJ. 2010 Aug 10;182(11):1197-201. doi: 10.1503/cmaj.091540. Epub 2010 Apr 26.
6
Thinking big: large-scale collaborative research in observational epidemiology.高瞻远瞩:观察性流行病学中的大规模协作研究。
Eur J Epidemiol. 2009;24(12):727-31. doi: 10.1007/s10654-009-9412-1. Epub 2009 Dec 5.
7
The Canadian longitudinal study on aging (CLSA).加拿大老龄化纵向研究(CLSA)。
Can J Aging. 2009 Sep;28(3):221-9. doi: 10.1017/S0714980809990055.
8
Designing genome-wide association studies: sample size, power, imputation, and the choice of genotyping chip.设计全基因组关联研究:样本量、效能、填补以及基因分型芯片的选择
PLoS Genet. 2009 May;5(5):e1000477. doi: 10.1371/journal.pgen.1000477. Epub 2009 May 15.
9
Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.全基因组关联位点对人类疾病和性状的潜在病因学及功能影响。
Proc Natl Acad Sci U S A. 2009 Jun 9;106(23):9362-7. doi: 10.1073/pnas.0903103106. Epub 2009 May 27.
10
Size matters: just how big is BIG?: Quantifying realistic sample size requirements for human genome epidemiology.规模很重要:究竟多大才算大?:量化人类基因组流行病学中实际样本量的要求
Int J Epidemiol. 2009 Feb;38(1):263-73. doi: 10.1093/ije/dyn147. Epub 2008 Aug 1.