• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 PCA 的多重 SNP 基因-疾病关联的 bootstrap 置信区间检验。

PCA-based bootstrap confidence interval tests for gene-disease association involving multiple SNPs.

机构信息

Department of Epidemiology and Health Statistics, School of Public Health, Shandong University, Jinan 250012, PR China.

出版信息

BMC Genet. 2010 Jan 26;11:6. doi: 10.1186/1471-2156-11-6.

DOI:10.1186/1471-2156-11-6
PMID:20100356
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2825231/
Abstract

BACKGROUND

Genetic association study is currently the primary vehicle for identification and characterization of disease-predisposing variant(s) which usually involves multiple single-nucleotide polymorphisms (SNPs) available. However, SNP-wise association tests raise concerns over multiple testing. Haplotype-based methods have the advantage of being able to account for correlations between neighbouring SNPs, yet assuming Hardy-Weinberg equilibrium (HWE) and potentially large number degrees of freedom can harm its statistical power and robustness. Approaches based on principal component analysis (PCA) are preferable in this regard but their performance varies with methods of extracting principal components (PCs).

RESULTS

PCA-based bootstrap confidence interval test (PCA-BCIT), which directly uses the PC scores to assess gene-disease association, was developed and evaluated for three ways of extracting PCs, i.e., cases only(CAES), controls only(COES) and cases and controls combined(CES). Extraction of PCs with COES is preferred to that with CAES and CES. Performance of the test was examined via simulations as well as analyses on data of rheumatoid arthritis and heroin addiction, which maintains nominal level under null hypothesis and showed comparable performance with permutation test.

CONCLUSIONS

PCA-BCIT is a valid and powerful method for assessing gene-disease association involving multiple SNPs.

摘要

背景

遗传关联研究目前是鉴定和描述疾病易感变异的主要手段,通常涉及多个可用的单核苷酸多态性(SNP)。然而,SNP 关联测试引发了对多重检验的关注。基于单倍型的方法具有能够解释相邻 SNP 之间相关性的优势,但假设 Hardy-Weinberg 平衡(HWE)和潜在的大量自由度可能会损害其统计功效和稳健性。基于主成分分析(PCA)的方法在这方面更可取,但它们的性能因提取主成分(PC)的方法而异。

结果

开发了基于 PCA 的引导置信区间检验(PCA-BCIT),该检验直接使用 PC 分数来评估基因与疾病的关联,并针对提取 PC 的三种方法进行了评估,即仅病例(CAES)、仅对照(COES)和病例和对照合并(CES)。与 CAES 和 CES 相比,COES 提取 PC 的效果更好。通过模拟以及对类风湿关节炎和海洛因成瘾数据的分析来检验检验的性能,该检验在零假设下保持了名义水平,并表现出与置换检验相当的性能。

结论

PCA-BCIT 是一种有效且强大的方法,可用于评估涉及多个 SNP 的基因与疾病的关联。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9eb/2825231/c47d209e8809/1471-2156-11-6-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9eb/2825231/c8b21023346b/1471-2156-11-6-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9eb/2825231/dc7f37ae7326/1471-2156-11-6-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9eb/2825231/c47d209e8809/1471-2156-11-6-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9eb/2825231/c8b21023346b/1471-2156-11-6-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9eb/2825231/dc7f37ae7326/1471-2156-11-6-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9eb/2825231/c47d209e8809/1471-2156-11-6-3.jpg

相似文献

1
PCA-based bootstrap confidence interval tests for gene-disease association involving multiple SNPs.基于 PCA 的多重 SNP 基因-疾病关联的 bootstrap 置信区间检验。
BMC Genet. 2010 Jan 26;11:6. doi: 10.1186/1471-2156-11-6.
2
Testing association between disease and multiple SNPs in a candidate gene.检测候选基因中疾病与多个单核苷酸多态性之间的关联。
Genet Epidemiol. 2007 Jul;31(5):383-95. doi: 10.1002/gepi.20219.
3
A powerful score-based test statistic for detecting gene-gene co-association.一种用于检测基因-基因共关联的基于分数的强大检验统计量。
BMC Genet. 2016 Jan 29;17:31. doi: 10.1186/s12863-016-0331-3.
4
Tests of association between quantitative traits and haplotypes in a reduced-dimensional space.数量性状与降维空间中单体型之间的关联测试。
Ann Hum Genet. 2005 Nov;69(Pt 6):715-32. doi: 10.1111/j.1529-8817.2005.00216.x.
5
Principal component analysis for selection of optimal SNP-sets that capture intragenic genetic variation.用于选择捕获基因内遗传变异的最佳单核苷酸多态性(SNP)集的主成分分析。
Genet Epidemiol. 2004 Jan;26(1):11-21. doi: 10.1002/gepi.10292.
6
Association between two mu-opioid receptor gene (OPRM1) haplotype blocks and drug or alcohol dependence.两种μ-阿片受体基因(OPRM1)单倍型模块与药物或酒精依赖之间的关联。
Hum Mol Genet. 2006 Mar 15;15(6):807-19. doi: 10.1093/hmg/ddl024. Epub 2006 Feb 13.
7
Resampling-based multiple hypothesis testing procedures for genetic case-control association studies.基于重采样的遗传病例对照关联研究多重假设检验程序。
Genet Epidemiol. 2006 Sep;30(6):495-507. doi: 10.1002/gepi.20162.
8
Maximizing the power of principal-component analysis of correlated phenotypes in genome-wide association studies.最大化全基因组关联研究中相关表型主成分分析的功效。
Am J Hum Genet. 2014 May 1;94(5):662-76. doi: 10.1016/j.ajhg.2014.03.016. Epub 2014 Apr 17.
9
Weighted SNP set analysis in genome-wide association study.基于全基因组关联研究的加权 SNP 集分析。
PLoS One. 2013 Sep 30;8(9):e75897. doi: 10.1371/journal.pone.0075897. eCollection 2013.
10
Association test based on SNP set: logistic kernel machine based test vs. principal component analysis.基于 SNP 集的关联测试:逻辑核机器测试与主成分分析。
PLoS One. 2012;7(9):e44978. doi: 10.1371/journal.pone.0044978. Epub 2012 Sep 13.

引用本文的文献

1
Combined Use of Univariate and Multivariate Approaches to Detect Selection Signatures Associated with Milk or Meat Production in Cattle.单变量和多变量方法联合使用以检测与奶牛产奶或产肉相关的选择印记
Genes (Basel). 2024 Nov 26;15(12):1516. doi: 10.3390/genes15121516.
2
Principal Component Analysis Enhanced with Bootstrapped Confidence Interval for the Classification of Parkinsonian Patients Using Gaussian Mixture Model and Gait Initiation Parameters.基于 Bootstrap 置信区间的主成分分析增强用于基于高斯混合模型和步态启动参数的帕金森病患者分类。
Sensors (Basel). 2024 Mar 15;24(6):1885. doi: 10.3390/s24061885.
3
Gene-Based Genome-Wide Association Study Identified Genes for Agronomic Traits in Maize.

本文引用的文献

1
An approach to incorporate linkage disequilibrium structure into genomic association analysis.一种将连锁不平衡结构纳入基因组关联分析的方法。
J Genet Genomics. 2008 Jun;35(6):381-5. doi: 10.1016/S1673-8527(08)60055-7.
2
Association tests based on the principal-component analysis.基于主成分分析的关联测试。
BMC Proc. 2007;1 Suppl 1(Suppl 1):S130. doi: 10.1186/1753-6561-1-s1-s130. Epub 2007 Dec 18.
3
TRAF1-C5 as a risk locus for rheumatoid arthritis--a genomewide study.全基因组研究:TRAF1-C5作为类风湿关节炎的一个风险基因座
基于基因的全基因组关联研究鉴定出玉米农艺性状相关基因。
Biology (Basel). 2022 Nov 11;11(11):1649. doi: 10.3390/biology11111649.
4
A machine learning-based framework to identify type 2 diabetes through electronic health records.一种基于机器学习的通过电子健康记录识别2型糖尿病的框架。
Int J Med Inform. 2017 Jan;97:120-127. doi: 10.1016/j.ijmedinf.2016.09.014. Epub 2016 Oct 1.
5
Molecular mechanical differences between isoforms of contractile actin in the presence of isoforms of smooth muscle tropomyosin.收缩性肌动蛋白同工型与平滑肌原肌球蛋白同工型存在时的分子力学差异。
PLoS Comput Biol. 2013 Oct;9(10):e1003273. doi: 10.1371/journal.pcbi.1003273. Epub 2013 Oct 24.
6
Heritability of objectively assessed daily physical activity and sedentary behavior.客观评估的日常体力活动和久坐行为的遗传力。
Am J Clin Nutr. 2013 Nov;98(5):1317-25. doi: 10.3945/ajcn.113.069849. Epub 2013 Sep 18.
7
SNP set association analysis for genome-wide association studies.SNP 集关联分析在全基因组关联研究中的应用。
PLoS One. 2013 May 3;8(5):e62495. doi: 10.1371/journal.pone.0062495. Print 2013.
8
A latent variable partial least squares path modeling approach to regional association and polygenic effect with applications to a human obesity study.一种潜在变量偏最小二乘路径建模方法,用于区域关联和多基因效应,并应用于人类肥胖研究。
PLoS One. 2012;7(2):e31927. doi: 10.1371/journal.pone.0031927. Epub 2012 Feb 27.
9
Gene- or region-based association study via kernel principal component analysis.基于基因或区域的核主成分分析关联研究。
BMC Genet. 2011 Aug 26;12:75. doi: 10.1186/1471-2156-12-75.
N Engl J Med. 2007 Sep 20;357(12):1199-209. doi: 10.1056/NEJMoa073491. Epub 2007 Sep 5.
4
Linear trend tests for case-control genetic association that incorporate random phenotype and genotype misclassification error.纳入随机表型和基因型错误分类误差的病例对照基因关联线性趋势检验。
Genet Epidemiol. 2007 Dec;31(8):853-70. doi: 10.1002/gepi.20246.
5
Gene-gene and gene-environment interactions involving HLA-DRB1, PTPN22, and smoking in two subsets of rheumatoid arthritis.类风湿关节炎两个亚组中涉及HLA-DRB1、PTPN22和吸烟的基因-基因及基因-环境相互作用
Am J Hum Genet. 2007 May;80(5):867-75. doi: 10.1086/516736. Epub 2007 Apr 2.
6
Testing association between disease and multiple SNPs in a candidate gene.检测候选基因中疾病与多个单核苷酸多态性之间的关联。
Genet Epidemiol. 2007 Jul;31(5):383-95. doi: 10.1002/gepi.20219.
7
Improved power by use of a weighted score test for linkage disequilibrium mapping.通过使用加权评分检验进行连锁不平衡定位来提高功效。
Am J Hum Genet. 2007 Feb;80(2):353-60. doi: 10.1086/511312. Epub 2006 Dec 21.
8
Effect of mu-opioid receptor gene polymorphisms on heroin-induced subjective responses in a Chinese population.μ-阿片受体基因多态性对中国人群中海洛因所致主观反应的影响。
Biol Psychiatry. 2007 Jun 1;61(11):1244-51. doi: 10.1016/j.biopsych.2006.07.012. Epub 2006 Dec 8.
9
A tutorial on statistical methods for population association studies.群体关联研究统计方法教程。
Nat Rev Genet. 2006 Oct;7(10):781-91. doi: 10.1038/nrg1916.
10
Replication of putative candidate-gene associations with rheumatoid arthritis in >4,000 samples from North America and Sweden: association of susceptibility with PTPN22, CTLA4, and PADI4.在来自北美和瑞典的4000多个样本中对类风湿关节炎假定候选基因关联进行复制研究:易感性与蛋白酪氨酸磷酸酶非受体型22(PTPN22)、细胞毒性T淋巴细胞相关抗原4(CTLA4)和肽基精氨酸脱亚氨酶4(PADI4)的关联。
Am J Hum Genet. 2005 Dec;77(6):1044-60. doi: 10.1086/498651. Epub 2005 Nov 1.