大型生物样本库表型组库的标准化架构：百万退伍军人计划（MVP）

Standardized Architecture for a Mega-Biobank Phenomic Library: The Million Veteran Program (MVP).

作者信息

Knight Kathryn E, Honerlaw Jacqueline, Danciu Ioana, Linares Franciel, Ho Yuk-Lam, Gagnon David R, Rush Everett, Gaziano J Michael, Begoli Edmon, Cho Kelly

机构信息

Oak Ridge National Laboratory, Oak Ridge, TN.

Division of Population Health and Data Science, Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA.

出版信息

AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:326-334. eCollection 2020.

PMID:32477652

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7233040/

Abstract

Electronic health records (EHRs) provide a wealth of data for phenotype development in population health studies, and researchers invest considerable time to curate data elements and validate disease definitions. The ability to reproduce well-defined phenotypes increases data quality, comparability of results and expedites research. In this paper, we present a standardized approach to organize and capture phenotype definitions, resulting in the creation of an open, online repository of phenotypes. This resource captures phenotype development, provenance and process from the Million Veteran Program, a national mega-biobank embedded in the Veterans Health Administration (VHA). To ensure that the repository is searchable, extendable, and sustainable, it is necessary to develop both a proper digital catalog architecture and underlying metadata infrastructure to enable effective management of the data fields required to define each phenotype. Our methods provide a resource for VHA investigators and a roadmap for researchers interested in standardizing their phenotype definitions to increase portability.

摘要

电子健康记录（EHRs）为人群健康研究中的表型发展提供了丰富的数据，研究人员投入了大量时间来精心整理数据元素并验证疾病定义。重现定义明确的表型的能力可提高数据质量、结果的可比性并加快研究速度。在本文中，我们提出了一种标准化方法来组织和获取表型定义，从而创建一个开放的在线表型知识库。该资源记录了百万退伍军人计划（Million Veteran Program）的表型发展、出处和过程，该计划是嵌入退伍军人健康管理局（VHA）的一个国家级大型生物样本库。为确保该知识库可搜索、可扩展且可持续，有必要开发适当的数字目录架构和基础元数据基础设施，以有效管理定义每个表型所需的数据字段。我们的方法为VHA研究人员提供了一种资源，并为有兴趣标准化其表型定义以提高可移植性的研究人员提供了路线图。

相似文献

Standardized Architecture for a Mega-Biobank Phenomic Library: The Million Veteran Program (MVP).大型生物样本库表型组库的标准化架构：百万退伍军人计划（MVP）

AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:326-334. eCollection 2020.

Baseline Characterization and Annual Trends of Body Mass Index for a Mega-Biobank Cohort of US Veterans 2011-2017.2011 - 2017年美国退伍军人大型生物样本队列的体重指数基线特征及年度趋势

J Health Res Rev Dev Ctries. 2018;5(2):98-107.

Million Veteran Program: A mega-biobank to study genetic influences on health and disease.百万退伍军人计划：一个大型生物银行，用于研究遗传对健康和疾病的影响。

J Clin Epidemiol. 2016 Feb;70:214-23. doi: 10.1016/j.jclinepi.2015.09.016. Epub 2015 Oct 9.

Provenance for Biomedical Ontologies with RDF and Git.使用RDF和Git的生物医学本体来源

Stud Health Technol Inform. 2019 Sep 3;267:230-237. doi: 10.3233/SHTI190832.

Creating a next-generation phenotype library: the health data research UK Phenotype Library.创建下一代表型库：英国健康数据研究表型库

JAMIA Open. 2024 Jun 17;7(2):ooae049. doi: 10.1093/jamiaopen/ooae049. eCollection 2024 Jul.

Sharing and Reusing Computable Phenotype Definitions.共享和重用可计算表型定义。

medRxiv. 2023 Sep 18:2023.09.17.23295681. doi: 10.1101/2023.09.17.23295681.

Phenoflow: A Microservice Architecture for Portable Workflow-based Phenotype Definitions. Phenoflow：一种用于便携式基于工作流的表型定义的微服务架构。

AMIA Jt Summits Transl Sci Proc. 2021 May 17;2021:142-151. eCollection 2021.

Prevalence of Ideal Cardiovascular Health Metrics in the Million Veteran Program.百万退伍军人计划中理想心血管健康指标的患病率。

Am J Cardiol. 2018 Jul 15;122(2):347-352. doi: 10.1016/j.amjcard.2018.04.002. Epub 2018 Apr 12.

Care Coordination/Home Telehealth: the systematic implementation of health informatics, home telehealth, and disease management to support the care of veteran patients with chronic conditions.护理协调/家庭远程医疗：系统实施健康信息学、家庭远程医疗和疾病管理，以支持对患有慢性病的退伍军人患者的护理。

Telemed J E Health. 2008 Dec;14(10):1118-26. doi: 10.1089/tmj.2008.0021.

Accelerating Genome- and Phenome-Wide Association Studies using GPUs - A case study using data from the Million Veteran Program.使用图形处理器加速全基因组和全表型关联研究——一项使用百万退伍军人计划数据的案例研究

bioRxiv. 2024 May 22:2024.05.17.594583. doi: 10.1101/2024.05.17.594583.

引用本文的文献

A landmark federal interagency collaboration to promote data science in health care: Million Veteran Program-Computational Health Analytics for Medical Precision to Improve Outcomes Now.一项具有里程碑意义的联邦跨部门合作，旨在促进医疗保健领域的数据科学发展：百万退伍军人计划——用于医疗精准性以改善当前治疗效果的计算健康分析。

JAMIA Open. 2024 Nov 6;7(4):ooae126. doi: 10.1093/jamiaopen/ooae126. eCollection 2024 Dec.

Centralized Interactive Phenomics Resource: an integrated online phenomics knowledgebase for health data users.集中式交互表型资源：为健康数据用户提供集成的在线表型知识库。

J Am Med Inform Assoc. 2024 Apr 19;31(5):1126-1134. doi: 10.1093/jamia/ocae042.

Genome-wide association study identifies four pan-ancestry loci for suicidal ideation in the Million Veteran Program.全基因组关联研究在百万退伍军人计划中鉴定出四个泛血统自杀意念的位点。

PLoS Genet. 2023 Mar 20;19(3):e1010623. doi: 10.1371/journal.pgen.1010623. eCollection 2023 Mar.

Why does human phenomics matter today?为什么人类表型组学在当今至关重要？

Learn Health Syst. 2020 Sep 28;4(4):e10249. doi: 10.1002/lrh2.10249. eCollection 2020 Oct.

本文引用的文献

A phenotyping algorithm to identify acute ischemic stroke accurately from a national biobank: the Million Veteran Program.一种从国家生物样本库中准确识别急性缺血性卒中的表型分析算法：百万退伍军人计划

Clin Epidemiol. 2018 Oct 16;10:1509-1521. doi: 10.2147/CLEP.S160764. eCollection 2018.

Genetics of blood lipids among ~300,000 multi-ethnic participants of the Million Veteran Program.《百万退伍军人计划中约 30 万多民族参与者的血脂遗传学》。

Nat Genet. 2018 Nov;50(11):1514-1523. doi: 10.1038/s41588-018-0222-9. Epub 2018 Oct 1.

PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability.PheKB：一个用于创建可移植电子表型算法的目录和工作流程。

J Am Med Inform Assoc. 2016 Nov;23(6):1046-1052. doi: 10.1093/jamia/ocv202. Epub 2016 Mar 28.

Million Veteran Program: A mega-biobank to study genetic influences on health and disease.百万退伍军人计划：一个大型生物银行，用于研究遗传对健康和疾病的影响。

J Clin Epidemiol. 2016 Feb;70:214-23. doi: 10.1016/j.jclinepi.2015.09.016. Epub 2015 Oct 9.

Reporting of loss to follow-up information in randomised controlled trials with time-to-event outcomes: a literature survey.报告随机对照试验中以时间为事件结局的失访信息：文献综述。

BMC Med Res Methodol. 2011 Sep 21;11:130. doi: 10.1186/1471-2288-11-130.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。