• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对适应连锁不平衡结构的常见变异的多重线性组合(MLC)回归检验。

Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure.

作者信息

Yoo Yun Joo, Sun Lei, Poirier Julia G, Paterson Andrew D, Bull Shelley B

机构信息

Department of Mathematics Education, Seoul National University, Seoul, South Korea.

Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea.

出版信息

Genet Epidemiol. 2017 Feb;41(2):108-121. doi: 10.1002/gepi.22024. Epub 2016 Nov 25.

DOI:10.1002/gepi.22024
PMID:27885705
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5245123/
Abstract

By jointly analyzing multiple variants within a gene, instead of one at a time, gene-based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under realistic trait models with linkage disequilibrium (LD) based on HapMap Asian haplotypes. MLC is a directional test that exploits LD structure in a gene to construct clusters of closely correlated variants recoded such that the majority of pairwise correlations are positive. It combines variant effects within the same cluster linearly, and aggregates cluster-specific effects in a quadratic sum of squares and cross-products, producing a test statistic with reduced degrees of freedom (df) equal to the number of clusters. By simulation studies of 1000 genes from across the genome, we demonstrate that MLC is a well-powered and robust choice among existing methods across a broad range of gene structures. Compared to minimum P-value, variance-component, and principal-component methods, the mean power of MLC is never much lower than that of other methods, and can be higher, particularly with multiple causal variants. Moreover, the variation in gene-specific MLC test size and power across 1000 genes is less than that of other methods, suggesting it is a complementary approach for discovery in genome-wide analysis. The cluster construction of the MLC test statistics helps reveal within-gene LD structure, allowing interpretation of clustered variants as haplotypic effects, while multiple regression helps to distinguish direct and indirect associations.

摘要

通过同时分析一个基因内的多个变异而非逐个分析,基于基因的多重回归可以提高遗传关联分析的效能、稳健性及可解释性。我们基于HapMap亚洲单倍型,在具有连锁不平衡(LD)的现实性状模型下,研究用于常见变异分析的多重线性组合(MLC)检验统计量。MLC是一种定向检验,它利用基因中的LD结构构建紧密相关变异的簇,这些变异被重新编码,使得大多数成对相关性为正。它线性组合同一簇内的变异效应,并在平方和与交叉积的二次和中汇总特定簇的效应,产生一个自由度(df)降低至簇数量的检验统计量。通过对全基因组1000个基因的模拟研究,我们证明,在广泛的基因结构范围内,MLC在现有方法中是一种效能良好且稳健的选择。与最小P值、方差成分和主成分方法相比,MLC的平均效能从不比其他方法低很多,并且可能更高,特别是存在多个因果变异时。此外,1000个基因中特定基因的MLC检验规模和效能的变化小于其他方法,这表明它是全基因组分析中一种互补的发现方法。MLC检验统计量的簇构建有助于揭示基因内的LD结构,允许将成簇变异解释为单倍型效应,而多重回归有助于区分直接和间接关联。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c57d/5245123/c16ba80cd600/GEPI-41-108-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c57d/5245123/0f9d35c77b89/GEPI-41-108-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c57d/5245123/262f1e00a161/GEPI-41-108-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c57d/5245123/c16ba80cd600/GEPI-41-108-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c57d/5245123/0f9d35c77b89/GEPI-41-108-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c57d/5245123/262f1e00a161/GEPI-41-108-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c57d/5245123/c16ba80cd600/GEPI-41-108-g003.jpg

相似文献

1
Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure.针对适应连锁不平衡结构的常见变异的多重线性组合(MLC)回归检验。
Genet Epidemiol. 2017 Feb;41(2):108-121. doi: 10.1002/gepi.22024. Epub 2016 Nov 25.
2
Functional linear models for association analysis of quantitative traits.功能线性模型在数量性状关联分析中的应用。
Genet Epidemiol. 2013 Nov;37(7):726-42. doi: 10.1002/gepi.21757.
3
Gene-based multiple regression association testing for combined examination of common and low frequency variants in quantitative trait analysis.基于基因的多重回归关联测试在数量性状分析中联合检测常见和低频变异。
Front Genet. 2013 Nov 12;4:233. doi: 10.3389/fgene.2013.00233. eCollection 2013.
4
On selecting markers for association studies: patterns of linkage disequilibrium between two and three diallelic loci.关于关联研究中标记的选择:两个和三个双等位基因位点间的连锁不平衡模式
Genet Epidemiol. 2003 Jan;24(1):57-67. doi: 10.1002/gepi.10217.
5
Multi-marker linkage disequilibrium mapping of quantitative trait loci.数量性状基因座的多标记连锁不平衡定位
Brief Bioinform. 2017 Mar 1;18(2):195-204. doi: 10.1093/bib/bbw006.
6
Detailed analysis of the relative power of direct and indirect association studies and the implications for their interpretation.直接关联研究与间接关联研究相对效力的详细分析及其解读的意义。
Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27.
7
Performance of a blockwise approach in variable selection using linkage disequilibrium information.使用连锁不平衡信息进行变量选择时的分块方法性能。
BMC Bioinformatics. 2015 May 8;16:148. doi: 10.1186/s12859-015-0556-6.
8
Efficiency and power in genetic association studies.基因关联研究中的效率与效能
Nat Genet. 2005 Nov;37(11):1217-23. doi: 10.1038/ng1669. Epub 2005 Oct 23.
9
Haplotype and linkage disequilibrium architecture for human cancer-associated genes.人类癌症相关基因的单倍型和连锁不平衡结构
Genome Res. 2002 Dec;12(12):1846-53. doi: 10.1101/gr.483802.
10
Clique-Based Clustering of Correlated SNPs in a Gene Can Improve Performance of Gene-Based Multi-Bin Linear Combination Test.基于基因中相关单核苷酸多态性的团簇聚类可提高基于基因的多箱线性组合检验的性能。
Biomed Res Int. 2015;2015:852341. doi: 10.1155/2015/852341. Epub 2015 Aug 4.

引用本文的文献

1
RegionScan: a comprehensive R package for region-level genome-wide association testing with integration and visualization of multiple-variant and single-variant hypothesis testing.RegionScan:一个用于区域水平全基因组关联测试的综合R包,集成并可视化多变异和单变异假设检验。
Bioinform Adv. 2025 Mar 13;5(1):vbaf052. doi: 10.1093/bioadv/vbaf052. eCollection 2025.
2
A copula-based set-variant association test for bivariate continuous, binary or mixed phenotypes.基于Copula 的二元连续型、二分型或混合表型的集合变异关联检验。
Int J Biostat. 2022 Oct 24;19(2):369-387. doi: 10.1515/ijb-2022-0010. eCollection 2023 Nov 1.
3

本文引用的文献

1
Clique-Based Clustering of Correlated SNPs in a Gene Can Improve Performance of Gene-Based Multi-Bin Linear Combination Test.基于基因中相关单核苷酸多态性的团簇聚类可提高基于基因的多箱线性组合检验的性能。
Biomed Res Int. 2015;2015:852341. doi: 10.1155/2015/852341. Epub 2015 Aug 4.
2
A Powerful Pathway-Based Adaptive Test for Genetic Association with Common or Rare Variants.一种用于常见或罕见变异基因关联分析的基于通路的强大自适应检验。
Am J Hum Genet. 2015 Jul 2;97(1):86-98. doi: 10.1016/j.ajhg.2015.05.018. Epub 2015 Jun 25.
3
Contribution of large region joint associations to complex traits genetics.
Deep polygenic neural network for predicting and identifying yield-associated genes in Indonesian rice accessions.
深度多基因神经网络预测和鉴定印度尼西亚水稻品种中的产量相关基因。
Sci Rep. 2022 Aug 15;12(1):13823. doi: 10.1038/s41598-022-16075-9.
4
Two-phase SSU and SKAT in genetic association studies.基因关联研究中的两阶段SSU和SKAT
J Genet. 2020;99.
5
Learning the optimal scale for GWAS through hierarchical SNP aggregation.通过层次 SNP 聚合学习 GWAS 的最佳规模。
BMC Bioinformatics. 2018 Nov 29;19(1):459. doi: 10.1186/s12859-018-2475-9.
6
Integrating epigenetic, genetic, and phenotypic data to uncover gene-region associations with triglycerides in the GOLDN study.在GOLDN研究中整合表观遗传学、遗传学和表型数据以揭示与甘油三酯相关的基因区域关联。
BMC Proc. 2018 Sep 17;12(Suppl 9):57. doi: 10.1186/s12919-018-0142-9. eCollection 2018.
7
A clustering linear combination approach to jointly analyze multiple phenotypes for GWAS.一种聚类线性组合方法,用于联合分析 GWAS 中的多种表型。
Bioinformatics. 2019 Apr 15;35(8):1373-1379. doi: 10.1093/bioinformatics/bty810.
8
KLF1 E325K-associated Congenital Dyserythropoietic Anemia Type IV: Insights Into the Variable Clinical Severity.KLF1 E325K相关的IV型先天性红细胞生成异常性贫血:对临床严重程度差异的见解
J Pediatr Hematol Oncol. 2018 Aug;40(6):e405-e409. doi: 10.1097/MPH.0000000000001056.
9
A new haplotype block detection method for dense genome sequencing data based on interval graph modeling of clusters of highly correlated SNPs.基于高度相关 SNPs 簇的区间图建模的密集基因组测序数据新型单倍型块检测方法。
Bioinformatics. 2018 Feb 1;34(3):388-397. doi: 10.1093/bioinformatics/btx609.
大区域联合关联对复杂性状遗传学的贡献。
PLoS Genet. 2015 Apr 9;11(4):e1005103. doi: 10.1371/journal.pgen.1005103. eCollection 2015 Apr.
4
Assessing the effects of multiple markers in genetic association studies.评估基因关联研究中多个标记物的作用。
Front Genet. 2015 Feb 24;6:66. doi: 10.3389/fgene.2015.00066. eCollection 2015.
5
Rare-variant association analysis: study designs and statistical tests.罕见变异关联分析:研究设计与统计检验。
Am J Hum Genet. 2014 Jul 3;95(1):5-23. doi: 10.1016/j.ajhg.2014.06.009.
6
A powerful and adaptive association test for rare variants.一种针对罕见变异的强大且自适应的关联测试。
Genetics. 2014 Aug;197(4):1081-95. doi: 10.1534/genetics.114.165035. Epub 2014 May 15.
7
On multi-marker tests for association in case-control studies.基于病例对照研究的多标志物关联检验。
Front Genet. 2013 Dec 16;4:252. doi: 10.3389/fgene.2013.00252. eCollection 2013.
8
Gene-based multiple regression association testing for combined examination of common and low frequency variants in quantitative trait analysis.基于基因的多重回归关联测试在数量性状分析中联合检测常见和低频变异。
Front Genet. 2013 Nov 12;4:233. doi: 10.3389/fgene.2013.00233. eCollection 2013.
9
Discovery and refinement of loci associated with lipid levels.发现和完善与脂质水平相关的基因座。
Nat Genet. 2013 Nov;45(11):1274-1283. doi: 10.1038/ng.2797. Epub 2013 Oct 6.
10
Identification of grouped rare and common variants via penalized logistic regression.基于惩罚逻辑回归的群组罕见及常见变异识别。
Genet Epidemiol. 2013 Sep;37(6):592-602. doi: 10.1002/gepi.21746. Epub 2013 Jul 8.