遗传关联研究中多个单核苷酸多态性的分析：三种多位点方法在单核苷酸多态性优先级排序和选择方面的比较

Analysis of multiple SNPs in genetic association studies: comparison of three multi-locus methods to prioritize and select SNPs.

作者信息

Heidema A Geert, Feskens Edith J M, Doevendans Pieter A F M, Ruven Henk J T, van Houwelingen Hans C, Mariman Edwin C M, Boer Jolanda M A

机构信息

Centre for Nutrition and Health, National Institute for Public Health and the Environment, Bilthoven, The Netherlands.

出版信息

Genet Epidemiol. 2007 Dec;31(8):910-21. doi: 10.1002/gepi.20251.

DOI:10.1002/gepi.20251

PMID:17615573

Abstract

Nonparametric approaches have been developed that are able to analyze large numbers of single nucleotide polymorphisms (SNPs) in modest sample sizes. These approaches have different selection features and may not provide similar results when applied to the same dataset. Therefore, we compared the results of three approaches (set association, random forests and multifactor dimensionality reduction [MDR]) to select from a total of 93 candidate SNPs a subset of SNPs that are important in determining high-density lipoprotein (HDL)-cholesterol levels. The study population consisted of a random sample from a Dutch monitoring project for cardiovascular disease risk factors and was dichotomized into cases (low HDL-cholesterol, n = 533) and non-cases (high HDL-cholesterol, n = 545) based on gender-specific median values for HDL cholesterol. Clearly, all three approaches prioritized three SNPs as important (CETP Taq1B, CETP-629 C/A and LPL Ser447X). Two SNPs with weaker main effects were additionally prioritized by random forests (APOC3 3175 G/C and CCR2 Val62Ile), whereas MTHFR 677 C/T was selected in combination with CETP Taq1B as best model by MDR. Obtained p-values for the selected models were significant for the set association approach (p =.0019), random forests (p<.01) and MDR (p<.02). In conclusion, the application of a combination of multi-locus methods is a useful approach in genetic association studies to select a well-defined set of important SNPs for further statistical and epidemiological interpretation, providing increased confidence and more information compared with the application of only one method.

摘要

已经开发出非参数方法，能够在样本量适中的情况下分析大量单核苷酸多态性（SNP）。这些方法具有不同的选择特征，应用于同一数据集时可能不会产生相似的结果。因此，我们比较了三种方法（集合关联、随机森林和多因素降维法[MDR]）的结果，以便从总共93个候选SNP中选出一组对确定高密度脂蛋白（HDL）胆固醇水平至关重要的SNP子集。研究人群是从荷兰心血管疾病危险因素监测项目中随机抽取的样本，并根据HDL胆固醇的性别特异性中位数，分为病例组（HDL胆固醇水平低，n = 533）和非病例组（HDL胆固醇水平高，n = 545）。显然，所有三种方法都将三个SNP列为重要SNP（CETP Taq1B、CETP - 629 C/A和LPL Ser447X）。随机森林法还额外将两个主效应较弱的SNP列为重要SNP（APOC3 3175 G/C和CCR2 Val62Ile），而MDR法将MTHFR 677 C/T与CETP Taq1B组合选为最佳模型。所选模型的p值对于集合关联法（p = 0.0019）、随机森林法（p < 0.01）和MDR法（p < 0.02）均具有显著性。总之，在基因关联研究中，应用多种多位点方法的组合是一种有用的方法，可用于选择一组明确的重要SNP，以便进行进一步的统计和流行病学解释，与仅应用一种方法相比，能提供更高的可信度和更多信息。

相似文献

Analysis of multiple SNPs in genetic association studies: comparison of three multi-locus methods to prioritize and select SNPs.

Genet Epidemiol. 2007 Dec;31(8):910-21. doi: 10.1002/gepi.20251.

Evaluating the ability of tree-based methods and logistic regression for the detection of SNP-SNP interaction.

Ann Hum Genet. 2009 May;73(Pt 3):360-9. doi: 10.1111/j.1469-1809.2009.00511.x. Epub 2009 Mar 8.

Identifying SNPs predictive of phenotype using random forests.

Genet Epidemiol. 2005 Feb;28(2):171-82. doi: 10.1002/gepi.20041.

Association of extreme blood lipid profile phenotypic variation with 11 reverse cholesterol transport genes and 10 non-genetic cardiovascular disease risk factors.

Hum Mol Genet. 2003 Nov 1;12(21):2733-43. doi: 10.1093/hmg/ddg314. Epub 2003 Sep 9.

Screening large-scale association study data: exploiting interactions using random forests.

BMC Genet. 2004 Dec 10;5:32. doi: 10.1186/1471-2156-5-32.

Genome-wide association analysis of high-density lipoprotein cholesterol in the population-based KORA study sheds new light on intergenic regions.

Circ Cardiovasc Genet. 2008 Oct;1(1):10-20. doi: 10.1161/CIRCGENETICS.108.776708.

Haplotypes and SNPs in 13 lipid-relevant genes explain most of the genetic variance in high-density lipoprotein and low-density lipoprotein cholesterol.

Hum Mol Genet. 2004 May 15;13(10):993-1004. doi: 10.1093/hmg/ddh119. Epub 2004 Mar 25.

Association of cholesteryl ester transfer protein -629C > A polymorphism with high-density lipoprotein cholesterol levels in coronary artery disease patients.

Cell Biochem Funct. 2009 Oct;27(7):452-7. doi: 10.1002/cbf.1593.

Ability of epistatic interactions of cytokine single-nucleotide polymorphisms to predict susceptibility to disease subsets in systemic sclerosis patients.

Arthritis Rheum. 2008 Jul 15;59(7):974-83. doi: 10.1002/art.23836.

Bayesian variable and model selection methods for genetic association studies.

Genet Epidemiol. 2009 Jan;33(1):27-37. doi: 10.1002/gepi.20353.

引用本文的文献

A genetic interaction of NRXN2 with GABRE, SYT1 and CASK in migraine patients: a case-control study.

J Headache Pain. 2021 Jun 14;22(1):57. doi: 10.1186/s10194-021-01266-y.

Machine Learning Predicts Accurately Drug Resistance From Whole Genome Sequencing Data.

Front Genet. 2019 Sep 26;10:922. doi: 10.3389/fgene.2019.00922. eCollection 2019.

Interactions between polymorphisms in the 3'untranslated region of the cyclin dependent kinase 6 gene and the human papillomavirus infection, and risk of cervical precancerous lesions.

Biomed Rep. 2017 Jun;6(6):640-648. doi: 10.3892/br.2017.898. Epub 2017 May 3.

Transdisciplinary approaches enhance the production of translational knowledge.

Transl Res. 2017 Apr;182:123-134. doi: 10.1016/j.trsl.2016.11.002. Epub 2016 Nov 10.

Interaction between susceptibility loci in cGAS-STING pathway, MHC gene and HPV infection on the risk of cervical precancerous lesions in Chinese population.

Oncotarget. 2016 Dec 20;7(51):84228-84238. doi: 10.18632/oncotarget.12399.

Multivariate Methods for Genetic Variants Selection and Risk Prediction in Cardiovascular Diseases.

Front Cardiovasc Med. 2016 Jun 8;3:17. doi: 10.3389/fcvm.2016.00017. eCollection 2016.

Association of cholesteryl ester transfer protein genotypes with paraoxonase-1 activity, lipid profile and oxidative stress in type 2 diabetes mellitus: A study in San Luis, Argentina.

J Diabetes Investig. 2015 Jan;6(1):67-77. doi: 10.1111/jdi.12256. Epub 2014 Jul 25.

Gender-specific genetic associations of polymorphisms in ACE, AKR1C2, FTO and MMP2 with weight gain over a 10-year period.

Genes Nutr. 2014 Nov;9(6):434. doi: 10.1007/s12263-014-0434-2. Epub 2014 Oct 17.

Beyond the fourth wave of genome-wide obesity association studies.

Nutr Diabetes. 2012 Jul 30;2(7):e37. doi: 10.1038/nutd.2012.9.

Investigation of homocysteine-pathway-related variants in essential hypertension.

Int J Hypertens. 2012;2012:190923. doi: 10.1155/2012/190923. Epub 2012 Oct 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

遗传关联研究中多个单核苷酸多态性的分析：三种多位点方法在单核苷酸多态性优先级排序和选择方面的比较

Analysis of multiple SNPs in genetic association studies: comparison of three multi-locus methods to prioritize and select SNPs.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献