• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用潜在变量结构方程模型评估千人基因组计划外显子测序数据中罕见和常见变异的总体效应。

Evaluating aggregate effects of rare and common variants in the 1000 Genomes Project exon sequencing data using latent variable structural equation modeling.

作者信息

Nock Nl, Zhang Lx

机构信息

Department of Epidemiology and Biostatistics, Case Western Reserve University, 2103 Cornell Road, Cleveland, OH 44106-7281, USA.

出版信息

BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S47. doi: 10.1186/1753-6561-5-S9-S47.

DOI:10.1186/1753-6561-5-S9-S47
PMID:22373404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3287884/
Abstract

Methods that can evaluate aggregate effects of rare and common variants are limited. Therefore, we applied a two-stage approach to evaluate aggregate gene effects in the 1000 Genomes Project data, which contain 24,487 single-nucleotide polymorphisms (SNPs) in 697 unrelated individuals from 7 populations. In stage 1, we identified potentially interesting genes (PIGs) as those having at least one SNP meeting Bonferroni correction using univariate, multiple regression models. In stage 2, we evaluate aggregate PIG effects on trait, Q1, by modeling each gene as a latent construct, which is defined by multiple common and rare variants, using the multivariate statistical framework of structural equation modeling (SEM). In stage 1, we found that PIGs varied markedly between a randomly selected replicate (replicate 137) and 100 other replicates, with the exception of FLT1. In stage 1, collapsing rare variants decreased false positives but increased false negatives. In stage 2, we developed a good-fitting SEM model that included all nine genes simulated to affect Q1 (FLT1, KDR, ARNT, ELAV4, FLT4, HIF1A, HIF3A, VEGFA, VEGFC) and found that FLT1 had the largest effect on Q1 (βstd = 0.33 ± 0.05). Using replicate 137 estimates as population values, we found that the mean relative bias in the parameters (loadings, paths, residuals) and their standard errors across 100 replicates was on average, less than 5%. Our latent variable SEM approach provides a viable framework for modeling aggregate effects of rare and common variants in multiple genes, but more elegant methods are needed in stage 1 to minimize type I and type II error.

摘要

能够评估罕见和常见变异总体效应的方法有限。因此,我们采用两阶段方法在千人基因组计划数据中评估总体基因效应,该数据包含来自7个群体的697名无关个体中的24487个单核苷酸多态性(SNP)。在第一阶段,我们将潜在有趣基因(PIG)定义为使用单变量、多元回归模型至少有一个SNP符合Bonferroni校正的基因。在第二阶段,我们通过使用结构方程模型(SEM)的多元统计框架,将每个基因建模为一个由多个常见和罕见变异定义的潜在结构,来评估PIG对性状Q1的总体效应。在第一阶段,我们发现除FLT1外,在随机选择的一个重复样本(重复样本137)和其他100个重复样本之间,PIG有显著差异。在第一阶段,合并罕见变异减少了假阳性,但增加了假阴性。在第二阶段,我们开发了一个拟合良好的SEM模型,该模型包括模拟影响Q1的所有九个基因(FLT1、KDR、ARNT、ELAV4、FLT4、HIF1A、HIF3A、VEGFA、VEGFC),并发现FLT1对Q1的影响最大(βstd = 0.33 ± 0.05)。使用重复样本137的估计值作为总体值,我们发现在100个重复样本中,参数(负荷、路径、残差)及其标准误差的平均相对偏差平均小于5%。我们的潜在变量SEM方法为建模多个基因中罕见和常见变异的总体效应提供了一个可行的框架,但在第一阶段需要更完善的方法来最小化I型和II型错误。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af09/3287884/1c176f9289d2/1753-6561-5-S9-S47-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af09/3287884/e5c3961fd105/1753-6561-5-S9-S47-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af09/3287884/b8f99d100bce/1753-6561-5-S9-S47-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af09/3287884/1c176f9289d2/1753-6561-5-S9-S47-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af09/3287884/e5c3961fd105/1753-6561-5-S9-S47-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af09/3287884/b8f99d100bce/1753-6561-5-S9-S47-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af09/3287884/1c176f9289d2/1753-6561-5-S9-S47-3.jpg

相似文献

1
Evaluating aggregate effects of rare and common variants in the 1000 Genomes Project exon sequencing data using latent variable structural equation modeling.使用潜在变量结构方程模型评估千人基因组计划外显子测序数据中罕见和常见变异的总体效应。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S47. doi: 10.1186/1753-6561-5-S9-S47.
2
Application of collapsing methods for continuous traits to the Genetic Analysis Workshop 17 exome sequence data.将连续性状的压缩方法应用于遗传分析研讨会17外显子组序列数据。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S121. doi: 10.1186/1753-6561-5-S9-S121.
3
Modeling the complex gene x environment interplay in the simulated rheumatoid arthritis GAW15 data using latent variable structural equation modeling.使用潜在变量结构方程模型对模拟类风湿性关节炎GAW15数据中的复杂基因与环境相互作用进行建模。
BMC Proc. 2007;1 Suppl 1(Suppl 1):S118. doi: 10.1186/1753-6561-1-s1-s118. Epub 2007 Dec 18.
4
Two-stage extreme phenotype sequencing design for discovering and testing common and rare genetic variants: efficiency and power.用于发现和测试常见及罕见遗传变异的两阶段极端表型测序设计:效率与效能
Hum Hered. 2012;73(3):139-47. doi: 10.1159/000337300. Epub 2012 Jun 7.
5
Use of principal components to aggregate rare variants in case-control and family-based association studies in the presence of multiple covariates.在存在多个协变量的情况下,在病例对照研究和基于家系的关联研究中使用主成分来汇总罕见变异。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S29. doi: 10.1186/1753-6561-5-S9-S29.
6
Identifying causal rare variants of disease through family-based analysis of Genetics Analysis Workshop 17 data set.通过对遗传分析研讨会17数据集进行基于家系的分析来识别疾病的因果罕见变异。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S21. doi: 10.1186/1753-6561-5-S9-S21.
7
Are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants?对于罕见变异和常见变异的分析而言,数量性状依赖抽样设计是否具有成本效益?
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S111. doi: 10.1186/1753-6561-5-S9-S111.
8
Identification of genes associated with complex traits by testing the genetic dissimilarity between individuals.通过检测个体间的遗传差异来鉴定与复杂性状相关的基因。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S120. doi: 10.1186/1753-6561-5-S9-S120.
9
GENOME-WIDE ASSOCIATION MAPPING AND RARE ALLELES: FROM POPULATION GENOMICS TO PERSONALIZED MEDICINE - Session Introduction.全基因组关联图谱与罕见等位基因:从群体基因组学到个性化医学——会议介绍
Pac Symp Biocomput. 2011:74-5. doi: 10.1142/9789814335058_0008.
10
Evaluation of a LASSO regression approach on the unrelated samples of Genetic Analysis Workshop 17.对遗传分析研讨会17的非相关样本进行套索回归方法评估。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S12. doi: 10.1186/1753-6561-5-S9-S12.

引用本文的文献

1
Including Phenotypic Causal Networks in Genome-Wide Association Studies Using Mixed Effects Structural Equation Models.使用混合效应结构方程模型将表型因果网络纳入全基因组关联研究
Front Genet. 2018 Oct 9;9:455. doi: 10.3389/fgene.2018.00455. eCollection 2018.
2
Analysis of Gene Expression Variance in Schizophrenia Using Structural Equation Modeling.使用结构方程模型分析精神分裂症中的基因表达变异
Front Mol Neurosci. 2018 Jun 11;11:192. doi: 10.3389/fnmol.2018.00192. eCollection 2018.
3
Inflated type I error rates when using aggregation methods to analyze rare variants in the 1000 Genomes Project exon sequencing data in unrelated individuals: summary results from Group 7 at Genetic Analysis Workshop 17.

本文引用的文献

1
Interrogating population structure and its impact on association tests.探究群体结构及其对关联检验的影响。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S25. doi: 10.1186/1753-6561-5-S9-S25.
2
Genetic Analysis Workshop 17 mini-exome simulation.遗传分析研讨会17小型外显子模拟
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S2. doi: 10.1186/1753-6561-5-S9-S2.
3
Statistical analysis of rare sequence variants: an overview of collapsing methods.稀有序列变异的统计分析:压缩方法概述。
在分析无关个体的 1000 基因组项目外显子测序数据中的罕见变异时,使用聚合方法会导致膨胀的Ⅰ型错误率:来自第 17 届遗传分析研讨会第 7 组的总结结果。
Genet Epidemiol. 2011;35 Suppl 1(Suppl 1):S56-60. doi: 10.1002/gepi.20650.
Genet Epidemiol. 2011;35 Suppl 1(Suppl 1):S12-7. doi: 10.1002/gepi.20643.
4
Interrogating local population structure for fine mapping in genome-wide association studies.全基因组关联研究中精细定位的局域人群结构分析。
Bioinformatics. 2010 Dec 1;26(23):2961-8. doi: 10.1093/bioinformatics/btq560. Epub 2010 Sep 30.
5
Defining genetic determinants of the Metabolic Syndrome in the Framingham Heart Study using association and structural equation modeling methods.在弗雷明汉心脏研究中,运用关联分析和结构方程模型方法确定代谢综合征的遗传决定因素。
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S50. doi: 10.1186/1753-6561-3-s7-s50.
6
Modeling the complex gene x environment interplay in the simulated rheumatoid arthritis GAW15 data using latent variable structural equation modeling.使用潜在变量结构方程模型对模拟类风湿性关节炎GAW15数据中的复杂基因与环境相互作用进行建模。
BMC Proc. 2007;1 Suppl 1(Suppl 1):S118. doi: 10.1186/1753-6561-1-s1-s118. Epub 2007 Dec 18.
7
PLINK: a tool set for whole-genome association and population-based linkage analyses.PLINK:一个用于全基因组关联分析和基于群体的连锁分析的工具集。
Am J Hum Genet. 2007 Sep;81(3):559-75. doi: 10.1086/519795. Epub 2007 Jul 25.