研究就绪数据：C-Surv 数据模型。

Research-ready data: the C-Surv data model.

机构信息

Department of Psychiatry, University of Oxford, Oxford, United Kingdom.

Swansea University Medical School, Swansea University, Swansea, United Kingdom.

出版信息

Eur J Epidemiol. 2023 Feb;38(2):179-187. doi: 10.1007/s10654-022-00916-y. Epub 2023 Jan 7.

DOI:10.1007/s10654-022-00916-y

PMID:36609896

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9825071/

Abstract

Research-ready data (data curated to a defined standard) increase scientific opportunity and rigour by integrating the data environment. The development of research platforms has highlighted the value of research-ready data, particularly for multi-cohort analyses. Following stakeholder consultation, a standard data model (C-Surv) optimised for data discovery, was developed using data from 5 population and clinical cohort studies. The model uses a four-tier nested structure based on 18 data themes selected according to user behaviour or technology. Standard variable naming conventions are applied to uniquely identify variables within the context of longitudinal studies. The data model was used to develop a harmonised dataset for 11 cohorts. This dataset populated the Cohort Explorer data discovery tool for assessing the feasibility of an analysis prior to making a data access request. Data preparation times were compared between cohort specific data models and C-Surv.It was concluded that adopting a common data model as a data standard for the discovery and analysis of research cohort data offers multiple benefits.

摘要

研究就绪数据（经过定义的标准进行整理的数据）通过整合数据环境，增加了科学机会和严谨性。研究平台的发展凸显了研究就绪数据的价值，特别是对于多队列分析。在利益相关者协商后，使用来自 5 个人群和临床队列研究的数据，开发了一个针对数据发现优化的标准数据模型 (C-Surv)。该模型使用基于根据用户行为或技术选择的 18 个数据主题的四层嵌套结构。标准变量命名约定应用于在纵向研究的上下文中唯一标识变量。该数据模型用于为 11 个队列开发一个协调数据集。该数据集填充了 Cohort Explorer 数据发现工具，用于在提出数据访问请求之前评估分析的可行性。比较了特定于队列的数据模型和 C-Surv 之间的数据准备时间。得出的结论是，采用通用数据模型作为研究队列数据的发现和分析的数据标准具有多种好处。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c05b/9905161/f9864d47a44e/10654_2022_916_Fig1_HTML.jpg

相似文献

Research-ready data: the C-Surv data model.

Eur J Epidemiol. 2023 Feb;38(2):179-187. doi: 10.1007/s10654-022-00916-y. Epub 2023 Jan 7.

Evaluating the harmonisation potential of diverse cohort datasets.

Eur J Epidemiol. 2023 Jun;38(6):605-615. doi: 10.1007/s10654-023-00997-3. Epub 2023 Apr 26.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

VAE-Surv: A novel approach for genetic-based clustering and prognosis prediction in myelodysplastic syndromes.

Comput Methods Programs Biomed. 2025 Apr;261:108605. doi: 10.1016/j.cmpb.2025.108605. Epub 2025 Jan 20.

The National Sleep Research Resource: towards a sleep data commons.

J Am Med Inform Assoc. 2018 Oct 1;25(10):1351-1358. doi: 10.1093/jamia/ocy064.

Hearing loss before and after cisplatin-based chemotherapy in testicular cancer survivors: a longitudinal study.

Acta Oncol. 2018 Aug;57(8):1075-1083. doi: 10.1080/0284186X.2018.1433323. Epub 2018 Jan 31.

Mortality and Morbidity Effects of Long-Term Exposure to Low-Level PM, BC, NO, and O: An Analysis of European Cohorts in the ELAPSE Project.

Res Rep Health Eff Inst. 2021 Sep;2021(208):1-127.

The Dementias Platform UK (DPUK) Data Portal.

Eur J Epidemiol. 2020 Jun;35(6):601-611. doi: 10.1007/s10654-020-00633-4. Epub 2020 Apr 23.

Identifying miRNA-mRNA Integration Set Associated With Survival Time.

Front Genet. 2021 Jun 29;12:634922. doi: 10.3389/fgene.2021.634922. eCollection 2021.

引用本文的文献

A natural language processing approach to support biomedical data harmonization: Leveraging large language models.

PLoS One. 2025 Jul 24;20(7):e0328262. doi: 10.1371/journal.pone.0328262. eCollection 2025.

We must discuss research environments.

R Soc Open Sci. 2024 Jun 26;11(6):231742. doi: 10.1098/rsos.231742. eCollection 2024 Jun.

National and international collaborations to advance research into vascular contributions to cognitive decline.

Cereb Circ Cogn Behav. 2023 Dec 14;6:100195. doi: 10.1016/j.cccb.2023.100195. eCollection 2024.

The pursuit of approaches to federate data to accelerate Alzheimer's disease and related dementia research: GAAIN, DPUK, and ADDI.

Front Neuroinform. 2023 May 25;17:1175689. doi: 10.3389/fninf.2023.1175689. eCollection 2023.

Evaluating the harmonisation potential of diverse cohort datasets.

Eur J Epidemiol. 2023 Jun;38(6):605-615. doi: 10.1007/s10654-023-00997-3. Epub 2023 Apr 26.

Neurodegenerative disease of the brain: a survey of interdisciplinary approaches.

J R Soc Interface. 2023 Jan;20(198):20220406. doi: 10.1098/rsif.2022.0406. Epub 2023 Jan 18.

本文引用的文献

Developing a Dementia Platform Databank Using Multiple Existing Cohorts.

Yonsei Med J. 2021 Nov;62(11):1062-1068. doi: 10.3349/ymj.2021.62.11.1062.

The Dementias Platform UK (DPUK) Data Portal.

Eur J Epidemiol. 2020 Jun;35(6):601-611. doi: 10.1007/s10654-020-00633-4. Epub 2020 Apr 23.

Deep and Frequent Phenotyping study protocol: an observational study in prodromal Alzheimer's disease.

BMJ Open. 2019 Mar 23;9(3):e024498. doi: 10.1136/bmjopen-2018-024498.

Cognitive and imaging markers in non-demented subjects attending a memory clinic: study design and baseline findings of the MEMENTO cohort.

Alzheimers Res Ther. 2017 Aug 29;9(1):67. doi: 10.1186/s13195-017-0288-0.

De novo sequencing, assembly and analysis of eight different transcriptomes from the Malayan pangolin.

Sci Rep. 2016 Sep 13;6:28199. doi: 10.1038/srep28199.

Parkinson's Disease Subtypes in the Oxford Parkinson Disease Centre (OPDC) Discovery Cohort.

J Parkinsons Dis. 2015;5(2):269-79. doi: 10.3233/JPD-140523.

The Global Alzheimer's Association Interactive Network.

Alzheimers Dement. 2016 Jan;12(1):49-54. doi: 10.1016/j.jalz.2015.06.1896. Epub 2015 Aug 28.

UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age.

PLoS Med. 2015 Mar 31;12(3):e1001779. doi: 10.1371/journal.pmed.1001779. eCollection 2015 Mar.

The Airwave Health Monitoring Study of police officers and staff in Great Britain: rationale, design and methods.

Environ Res. 2014 Oct;134:280-5. doi: 10.1016/j.envres.2014.07.025. Epub 2014 Sep 6.

Characterizing mild cognitive impairment in incident Parkinson disease: the ICICLE-PD study.

Neurology. 2014 Jan 28;82(4):308-16. doi: 10.1212/WNL.0000000000000066. Epub 2013 Dec 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

研究就绪数据：C-Surv 数据模型。

Research-ready data: the C-Surv data model.

机构信息

Department of Psychiatry, University of Oxford, Oxford, United Kingdom.

Swansea University Medical School, Swansea University, Swansea, United Kingdom.

出版信息

Eur J Epidemiol. 2023 Feb;38(2):179-187. doi: 10.1007/s10654-022-00916-y. Epub 2023 Jan 7.

DOI:10.1007/s10654-022-00916-y

PMID:36609896

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9825071/

Abstract

摘要

研究就绪数据：C-Surv 数据模型。

Research-ready data: the C-Surv data model.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

研究就绪数据：C-Surv 数据模型。

Research-ready data: the C-Surv data model.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献