• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LabWAS:来自两个独立生物库中临床实验室的荟萃分析的新发现和研究设计建议。

LabWAS: Novel findings and study design recommendations from a meta-analysis of clinical labs in two independent biobanks.

机构信息

Department of Pathology, Northwestern University Feinberg School of Medicine, Chicago Illinois, United States of America.

Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, United States of America.

出版信息

PLoS Genet. 2020 Nov 11;16(11):e1009077. doi: 10.1371/journal.pgen.1009077. eCollection 2020 Nov.

DOI:10.1371/journal.pgen.1009077
PMID:33175840
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7682892/
Abstract

Phenotypes extracted from Electronic Health Records (EHRs) are increasingly prevalent in genetic studies. EHRs contain hundreds of distinct clinical laboratory test results, providing a trove of health data beyond diagnoses. Such lab data is complex and lacks a ubiquitous coding scheme, making it more challenging than diagnosis data. Here we describe the first large-scale cross-health system genome-wide association study (GWAS) of EHR-based quantitative laboratory-derived phenotypes. We meta-analyzed 70 lab traits matched between the BioVU cohort from the Vanderbilt University Health System and the Michigan Genomics Initiative (MGI) cohort from Michigan Medicine. We show high replication of known association for these traits, validating EHR-based measurements as high-quality phenotypes for genetic analysis. Notably, our analysis provides the first replication for 699 previous GWAS associations across 46 different traits. We discovered 31 novel associations at genome-wide significance for 22 distinct traits, including the first reported associations for two lab-based traits. We replicated 22 of these novel associations in an independent tranche of BioVU samples. The summary statistics for all association tests are freely available to benefit other researchers. Finally, we performed mirrored analyses in BioVU and MGI to assess competing analytic practices for EHR lab traits. We find that using the mean of all available lab measurements provides a robust summary value, but alternate summarizations can improve power in certain circumstances. This study provides a proof-of-principle for cross health system GWAS and is a framework for future studies of quantitative EHR lab traits.

摘要

从电子健康记录(EHR)中提取的表型在遗传研究中越来越普遍。EHR 包含数百种不同的临床实验室测试结果,提供了超越诊断的丰富健康数据。这些实验室数据很复杂,缺乏普遍的编码方案,因此比诊断数据更具挑战性。在这里,我们描述了第一个基于电子病历的大规模跨健康系统全基因组关联研究(GWAS),用于研究基于实验室的定量衍生表型。我们对范德比尔特大学健康系统的 BioVU 队列和密歇根医学的密歇根基因组倡议(MGI)队列之间匹配的 70 种实验室特征进行了荟萃分析。我们展示了这些特征的已知关联的高度复制,验证了基于电子病历的测量值作为遗传分析的高质量表型。值得注意的是,我们的分析为 46 种不同特征的 699 个先前 GWAS 关联提供了首次复制。我们在 22 种不同特征中发现了 31 个具有全基因组意义的新关联,包括两种基于实验室的特征的首次报道的关联。我们在 BioVU 的另一批独立样本中复制了其中 22 个新的关联。所有关联测试的汇总统计信息均可免费获得,以惠益其他研究人员。最后,我们在 BioVU 和 MGI 中进行了镜像分析,以评估电子病历实验室特征的竞争分析实践。我们发现,使用所有可用实验室测量值的平均值提供了稳健的汇总值,但在某些情况下,其他汇总方法可以提高功效。这项研究为跨健康系统 GWAS 提供了原理证明,并且为未来研究定量 EHR 实验室特征提供了框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10f2/7682892/7063c2ed557f/pgen.1009077.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10f2/7682892/d3f5e853c8d7/pgen.1009077.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10f2/7682892/1e7dc056bf89/pgen.1009077.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10f2/7682892/7793a39e7020/pgen.1009077.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10f2/7682892/7063c2ed557f/pgen.1009077.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10f2/7682892/d3f5e853c8d7/pgen.1009077.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10f2/7682892/1e7dc056bf89/pgen.1009077.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10f2/7682892/7793a39e7020/pgen.1009077.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10f2/7682892/7063c2ed557f/pgen.1009077.g004.jpg

相似文献

1
LabWAS: Novel findings and study design recommendations from a meta-analysis of clinical labs in two independent biobanks.LabWAS:来自两个独立生物库中临床实验室的荟萃分析的新发现和研究设计建议。
PLoS Genet. 2020 Nov 11;16(11):e1009077. doi: 10.1371/journal.pgen.1009077. eCollection 2020 Nov.
2
Clinical laboratory test-wide association scan of polygenic scores identifies biomarkers of complex disease.临床实验室检测全基因组关联扫描识别复杂疾病的生物标志物。
Genome Med. 2021 Jan 13;13(1):6. doi: 10.1186/s13073-020-00820-8.
3
IDENTIFYING GENETIC ASSOCIATIONS WITH VARIABILITY IN METABOLIC HEALTH AND BLOOD COUNT LABORATORY VALUES: DIVING INTO THE QUANTITATIVE TRAITS BY LEVERAGING LONGITUDINAL DATA FROM AN EHR.识别与代谢健康和血细胞计数实验室值变异性相关的基因关联:利用电子健康记录中的纵向数据深入研究数量性状。
Pac Symp Biocomput. 2017;22:533-544. doi: 10.1142/9789813207813_0049.
4
INTEGRATING CLINICAL LABORATORY MEASURES AND ICD-9 CODE DIAGNOSES IN PHENOME-WIDE ASSOCIATION STUDIES.在全表型关联研究中整合临床实验室检测指标与ICD - 9编码诊断信息
Pac Symp Biocomput. 2016;21:168-79.
5
Phenotype risk scores (PheRS) for pancreatic cancer using time-stamped electronic health record data: Discovery and validation in two large biobanks.利用带时间戳的电子健康记录数据构建胰腺癌的表型风险评分(PheRS):在两个大型生物样本库中的发现与验证
J Biomed Inform. 2021 Jan;113:103652. doi: 10.1016/j.jbi.2020.103652. Epub 2020 Dec 3.
6
Investigation of multi-trait associations using pathway-based analysis of GWAS summary statistics.基于 GWAS 汇总统计数据的通路分析探究多性状关联。
BMC Genomics. 2019 Feb 4;20(Suppl 1):79. doi: 10.1186/s12864-018-5373-7.
7
Exploration of haplotype research consortium imputation for genome-wide association studies in 20,032 Generation Scotland participants.对20,032名苏格兰世代研究参与者进行全基因组关联研究的单倍型研究联盟归因分析探索。
Genome Med. 2017 Mar 7;9(1):23. doi: 10.1186/s13073-017-0414-4.
8
Association of Polygenic Risk Scores for Multiple Cancers in a Phenome-wide Study: Results from The Michigan Genomics Initiative.多癌种多基因风险评分在表型全基因组研究中的关联:密歇根基因组倡议的结果。
Am J Hum Genet. 2018 Jun 7;102(6):1048-1061. doi: 10.1016/j.ajhg.2018.04.001. Epub 2018 May 17.
9
Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks.癌症 PRSweb:一个具有主要癌症特征多基因风险评分的在线知识库及其在两个独立生物库中的评估。
Am J Hum Genet. 2020 Nov 5;107(5):815-836. doi: 10.1016/j.ajhg.2020.08.025. Epub 2020 Sep 28.
10
Exploring various polygenic risk scores for skin cancer in the phenomes of the Michigan genomics initiative and the UK Biobank with a visual catalog: PRSWeb.探索密歇根基因组倡议和英国生物库表型中用于皮肤癌的多种多基因风险评分:PRSWeb。
PLoS Genet. 2019 Jun 13;15(6):e1008202. doi: 10.1371/journal.pgen.1008202. eCollection 2019 Jun.

引用本文的文献

1
Exploring beyond diagnoses in electronic health records to improve discovery: a review of the phenome-wide association study.探索电子健康记录中的诊断之外的信息以改善发现:全表型关联研究综述
JAMIA Open. 2025 Feb 28;8(1):ooaf006. doi: 10.1093/jamiaopen/ooaf006. eCollection 2025 Feb.
2
Implications of gene × environment interactions in post-traumatic stress disorder risk and treatment.基因×环境相互作用对创伤后应激障碍风险及治疗的影响
J Clin Invest. 2025 Mar 3;135(5):e185102. doi: 10.1172/JCI185102.
3
Genetic determinants and phenotypic consequences of blood T-cell proportions in 207,000 diverse individuals.

本文引用的文献

1
Exploring and visualizing large-scale genetic associations by using PheWeb.使用PheWeb探索和可视化大规模基因关联。
Nat Genet. 2020 Jun;52(6):550-552. doi: 10.1038/s41588-020-0622-5.
2
The Korea Biobank Array: Design and Identification of Coding Variants Associated with Blood Biochemical Traits.韩国生物银行阵列:与血液生化特征相关的编码变异的设计和鉴定。
Sci Rep. 2019 Feb 4;9(1):1382. doi: 10.1038/s41598-018-37832-9.
3
Leveraging Polygenic Functional Enrichment to Improve GWAS Power.利用多基因功能富集提高 GWAS 效力。
207000 名不同个体的血液 T 细胞比例的遗传决定因素和表型后果。
Nat Commun. 2024 Aug 7;15(1):6732. doi: 10.1038/s41467-024-51095-1.
4
Characterising the genetic architecture of changes in adiposity during adulthood using electronic health records.利用电子健康记录描述成年期体脂变化的遗传结构。
Nat Commun. 2024 Jul 10;15(1):5801. doi: 10.1038/s41467-024-49998-0.
5
Using electronic health records for clinical pharmacology research: Challenges and considerations.利用电子健康记录进行临床药理学研究:挑战与考虑。
Clin Transl Sci. 2024 Jul;17(7):e13871. doi: 10.1111/cts.13871.
6
To weight or not to weight? The effect of selection bias in 3 large electronic health record-linked biobanks and recommendations for practice.是否要进行体重测量?3 个大型电子健康记录相关生物库中的选择偏倚效应及其实践建议。
J Am Med Inform Assoc. 2024 Jun 20;31(7):1479-1492. doi: 10.1093/jamia/ocae098.
7
Phenomewide Association Study of Health Outcomes Associated With the Genetic Correlates of 25 Hydroxyvitamin D Concentration and Vitamin D Binding Protein Concentration.与 25 羟维生素 D 浓度和维生素 D 结合蛋白浓度的遗传相关物相关的健康结果的全表型关联研究。
Twin Res Hum Genet. 2024 Apr;27(2):69-79. doi: 10.1017/thg.2024.19. Epub 2024 Apr 22.
8
To weight or not to weight? Studying the effect of selection bias in three large EHR-linked biobanks.是否加权?研究三个大型电子健康记录关联生物样本库中选择偏倚的影响。
medRxiv. 2024 Feb 13:2024.02.12.24302710. doi: 10.1101/2024.02.12.24302710.
9
The phenotype-genotype reference map: Improving biobank data science through replication.表型-基因型参考图谱:通过复制提高生物库数据科学。
Am J Hum Genet. 2023 Sep 7;110(9):1522-1533. doi: 10.1016/j.ajhg.2023.07.012. Epub 2023 Aug 21.
10
A longitudinal genome-wide association study of bone mineral density mean and variability in the UK Biobank.英国生物库中骨密度均值和变异性的纵向全基因组关联研究。
Osteoporos Int. 2023 Nov;34(11):1907-1916. doi: 10.1007/s00198-023-06852-1. Epub 2023 Jul 27.
Am J Hum Genet. 2019 Jan 3;104(1):65-75. doi: 10.1016/j.ajhg.2018.11.008. Epub 2018 Dec 27.
4
The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019.NHGRI-EBI GWAS Catalog 于 2019 年发布的已发表全基因组关联研究、靶向基因芯片和汇总统计数据
Nucleic Acids Res. 2019 Jan 8;47(D1):D1005-D1012. doi: 10.1093/nar/gky1120.
5
Genetics of blood lipids among ~300,000 multi-ethnic participants of the Million Veteran Program.《百万退伍军人计划中约 30 万多民族参与者的血脂遗传学》。
Nat Genet. 2018 Nov;50(11):1514-1523. doi: 10.1038/s41588-018-0222-9. Epub 2018 Oct 1.
6
Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes.全基因组关联分析鉴定出 143 个 2 型糖尿病风险变异和潜在调控机制。
Nat Commun. 2018 Jul 27;9(1):2941. doi: 10.1038/s41467-018-04951-w.
7
Genomic atlas of the human plasma proteome.人类血浆蛋白质组基因组图谱。
Nature. 2018 Jun;558(7708):73-79. doi: 10.1038/s41586-018-0175-2. Epub 2018 Jun 6.
8
Association of Polygenic Risk Scores for Multiple Cancers in a Phenome-wide Study: Results from The Michigan Genomics Initiative.多癌种多基因风险评分在表型全基因组研究中的关联:密歇根基因组倡议的结果。
Am J Hum Genet. 2018 Jun 7;102(6):1048-1061. doi: 10.1016/j.ajhg.2018.04.001. Epub 2018 May 17.
9
PheWAS and Beyond: The Landscape of Associations with Medical Diagnoses and Clinical Measures across 38,662 Individuals from Geisinger.PheWAS 及其他:在来自 Geisinger 的 38662 个人中,与医疗诊断和临床指标相关的关联全景。
Am J Hum Genet. 2018 Apr 5;102(4):592-608. doi: 10.1016/j.ajhg.2018.02.017. Epub 2018 Mar 29.
10
Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases.在日本人群中对数量性状的遗传分析将细胞类型与复杂的人类疾病联系起来。
Nat Genet. 2018 Mar;50(3):390-400. doi: 10.1038/s41588-018-0047-6. Epub 2018 Feb 5.