• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开启临床实验室检测大规模用于关联研究的大门:探索定义表型的不同方法。

OPENING THE DOOR TO THE LARGE SCALE USE OF CLINICAL LAB MEASURES FOR ASSOCIATION TESTING: EXPLORING DIFFERENT METHODS FOR DEFINING PHENOTYPES.

作者信息

Bauer Christopher R, Lavage Daniel, Snyder John, Leader Joseph, Mahoney J Matthew, Pendergrass Sarah A

机构信息

Biomedical & Translational Informatics, Geisinger Health System, 100 N. Academy Ave. Danville, PA 17821, USA,

出版信息

Pac Symp Biocomput. 2017;22:356-367. doi: 10.1142/9789813207813_0034.

DOI:10.1142/9789813207813_0034
PMID:27896989
Abstract

The past decade has seen exponential growth in the numbers of sequenced and genotyped individuals and a corresponding increase in our ability of collect and catalogue phenotypic data for use in the clinic. We now face the challenge of integrating these diverse data in new ways new that can provide useful diagnostics and precise medical interventions for individual patients. One of the first steps in this process is to accurately map the phenotypic consequences of the genetic variation in human populations. The most common approach for this is the genome wide association study (GWAS). While this technique is relatively simple to implement for a given phenotype, the choice of how to define a phenotype is critical. It is becoming increasingly common for each individual in a GWAS cohort to have a large profile of quantitative measures. The standard approach is to test for associations with one measure at a time; however, there are many justifiable ways to define a set of phenotypes, and the genetic associations that are revealed will vary based on these definitions. Some phenotypes may only show a significant genetic association signal when considered together, such as through principle components analysis (PCA). Combining correlated measures may increase the power to detect association by reducing the noise present in individual variables and reduce the multiple hypothesis testing burden. Here we show that PCA and k-means clustering are two complimentary methods for identifying novel genotype-phenotype relationships within a set of quantitative human traits derived from the Geisinger Health System electronic health record (EHR). Using a diverse set of approaches for defining phenotype may yield more insights into the genetic architecture of complex traits and the findings presented here highlight a clear need for further investigation into other methods for defining the most relevant phenotypes in a set of variables. As the data of EHR continue to grow, addressing these issues will become increasingly important in our efforts to use genomic data effectively in medicine.

摘要

在过去十年中,测序和基因分型个体的数量呈指数增长,我们收集和编目临床使用的表型数据的能力也相应提高。我们现在面临着以新的方式整合这些多样数据的挑战,以便能够为个体患者提供有用的诊断和精准的医疗干预。这一过程的首要步骤之一是准确描绘人类群体中基因变异的表型后果。对此最常见的方法是全基因组关联研究(GWAS)。虽然对于给定的表型,这种技术相对容易实施,但如何定义表型的选择至关重要。在GWAS队列中,每个个体拥有大量定量测量数据的情况越来越普遍。标准方法是一次针对一项测量进行关联测试;然而,有许多合理的方式来定义一组表型,并且根据这些定义所揭示的基因关联会有所不同。一些表型可能只有在综合考虑时才会显示出显著的基因关联信号,比如通过主成分分析(PCA)。合并相关测量可能会通过减少单个变量中的噪声来提高检测关联的能力,并减轻多重假设检验的负担。在这里,我们表明主成分分析和k均值聚类是两种互补的方法,用于在源自盖辛格健康系统电子健康记录(EHR)的一组人类定量性状中识别新的基因型 - 表型关系。使用多种不同的方法来定义表型可能会对复杂性状的遗传结构有更多的深入了解,并且这里呈现的研究结果凸显了对进一步研究其他方法以在一组变量中定义最相关表型的明确需求。随着电子健康记录数据的持续增长,在我们有效利用基因组数据进行医学研究的努力中,解决这些问题将变得越来越重要。

相似文献

1
OPENING THE DOOR TO THE LARGE SCALE USE OF CLINICAL LAB MEASURES FOR ASSOCIATION TESTING: EXPLORING DIFFERENT METHODS FOR DEFINING PHENOTYPES.开启临床实验室检测大规模用于关联研究的大门:探索定义表型的不同方法。
Pac Symp Biocomput. 2017;22:356-367. doi: 10.1142/9789813207813_0034.
2
INTEGRATING CLINICAL LABORATORY MEASURES AND ICD-9 CODE DIAGNOSES IN PHENOME-WIDE ASSOCIATION STUDIES.在全表型关联研究中整合临床实验室检测指标与ICD - 9编码诊断信息
Pac Symp Biocomput. 2016;21:168-79.
3
IDENTIFYING GENETIC ASSOCIATIONS WITH VARIABILITY IN METABOLIC HEALTH AND BLOOD COUNT LABORATORY VALUES: DIVING INTO THE QUANTITATIVE TRAITS BY LEVERAGING LONGITUDINAL DATA FROM AN EHR.识别与代谢健康和血细胞计数实验室值变异性相关的基因关联:利用电子健康记录中的纵向数据深入研究数量性状。
Pac Symp Biocomput. 2017;22:533-544. doi: 10.1142/9789813207813_0049.
4
LabWAS: Novel findings and study design recommendations from a meta-analysis of clinical labs in two independent biobanks.LabWAS:来自两个独立生物库中临床实验室的荟萃分析的新发现和研究设计建议。
PLoS Genet. 2020 Nov 11;16(11):e1009077. doi: 10.1371/journal.pgen.1009077. eCollection 2020 Nov.
5
Translating genome wide association study results to associations among common diseases: in silico study with an electronic medical record.将全基因组关联研究结果转化为常见疾病之间的关联:基于电子病历的计算机模拟研究。
Int J Med Inform. 2013 Sep;82(9):864-74. doi: 10.1016/j.ijmedinf.2013.05.003. Epub 2013 Jun 3.
6
The use of phenome-wide association studies (PheWAS) for exploration of novel genotype-phenotype relationships and pleiotropy discovery.利用表型全基因组关联研究(PheWAS)探索新的基因型-表型关系和多效性发现。
Genet Epidemiol. 2011 Jul;35(5):410-22. doi: 10.1002/gepi.20589. Epub 2011 May 18.
7
Next-generation analysis of cataracts: determining knowledge driven gene-gene interactions using Biofilter, and gene-environment interactions using the PhenX Toolkit.白内障的下一代分析:使用生物过滤器确定知识驱动的基因-基因相互作用,以及使用PhenX工具包确定基因-环境相互作用。
Pac Symp Biocomput. 2013:147-58.
8
Next-generation analysis of cataracts: determining knowledge driven gene-gene interactions using biofilter, and gene-environment interactions using the Phenx Toolkit*.白内障的下一代分析:使用生物过滤器确定知识驱动的基因-基因相互作用,以及使用Phenx工具包确定基因-环境相互作用*。
Pac Symp Biocomput. 2015:495-505.
9
Identifying the genetic variation of gene expression using gene sets: application of novel gene Set eQTL approach to PharmGKB and KEGG.利用基因集识别基因表达的遗传变异:新型基因集 eQTL 方法在 PharmGKB 和 KEGG 中的应用。
PLoS One. 2012;7(8):e43301. doi: 10.1371/journal.pone.0043301. Epub 2012 Aug 14.
10
PSEA: Phenotype Set Enrichment Analysis--a new method for analysis of multiple phenotypes.PSEA:表型集富集分析——一种分析多种表型的新方法。
Genet Epidemiol. 2012 Apr;36(3):244-52. doi: 10.1002/gepi.21617.

引用本文的文献

1
Characterization of genetic and phenotypic heterogeneity of obstructive sleep apnea using electronic health records.利用电子健康记录对阻塞性睡眠呼吸暂停的遗传和表型异质性进行特征描述。
BMC Med Genomics. 2020 Jul 25;13(1):105. doi: 10.1186/s12920-020-00755-4.
2
Interpretation of machine learning predictions for patient outcomes in electronic health records.电子健康记录中患者预后机器学习预测的解读。
AMIA Annu Symp Proc. 2020 Mar 4;2019:572-581. eCollection 2019.
3
Using Electronic Health Records To Generate Phenotypes For Research.利用电子健康记录生成用于研究的表型。
Curr Protoc Hum Genet. 2019 Jan;100(1):e80. doi: 10.1002/cphg.80. Epub 2018 Dec 5.
4
Rare variants in drug target genes contributing to complex diseases, phenome-wide.药物靶点基因中的罕见变异与复杂疾病表型全基因组关联研究
Sci Rep. 2018 Mar 15;8(1):4624. doi: 10.1038/s41598-018-22834-4.