人类病例对照关联研究中对单核苷酸多态性（SNP）的修剪、加权和分组

Trimming, weighting, and grouping SNPs in human case-control association studies.

作者信息

Hoh J, Wille A, Ott J

机构信息

Laboratory of Statistical Genetics, Rockefeller University, New York, New York 10021, USA.

出版信息

Genome Res. 2001 Dec;11(12):2115-9. doi: 10.1101/gr.204001.

DOI:10.1101/gr.204001

PMID:11731502

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC311222/

Abstract

The search for genes underlying complex traits has been difficult and often disappointing. The main reason for these difficulties is that several genes, each with rather small effect, might be interacting to produce the trait. Therefore, we must search the whole genome for a good chance to find these genes. Doing this with tens of thousands of SNP markers, however, greatly increases the overall probability of false-positive results, and current methods limiting such error probabilities to acceptable levels tend to reduce the power of detecting weak genes. Investigating large numbers of SNPs inevitably introduces errors (e.g., in genotyping), which will distort analysis results. Here we propose a simple strategy that circumvents many of these problems. We develop a set-association method to blend relevant sources of information such as allelic association and Hardy-Weinberg disequilibrium. Information is combined over multiple markers and genes in the genome, quality control is improved by trimming, and an appropriate testing strategy limits the overall false-positive rate. In contrast to other available methods, our method to detect association to sets of SNP markers in different genes in a real data application has shown remarkable success.

摘要

寻找复杂性状背后的基因一直困难重重，且常常令人失望。造成这些困难的主要原因是，多个基因（每个基因的效应都相当小）可能相互作用以产生该性状。因此，我们必须在整个基因组中进行搜索，才有机会找到这些基因。然而，使用数以万计的单核苷酸多态性（SNP）标记来进行搜索，会大大增加假阳性结果的总体概率，而目前将此类错误概率限制在可接受水平的方法往往会降低检测弱效应基因的能力。研究大量的SNP不可避免地会引入误差（例如基因分型中的误差），这会扭曲分析结果。在此，我们提出一种简单的策略，可规避许多此类问题。我们开发了一种集合关联方法，以融合诸如等位基因关联和哈迪-温伯格不平衡等相关信息源。信息在基因组中的多个标记和基因上进行整合，通过筛选提高质量控制，并且采用适当的测试策略限制总体假阳性率。与其他现有方法相比，我们在实际数据应用中检测与不同基因中的SNP标记集关联的方法已取得显著成功。

相似文献

Trimming, weighting, and grouping SNPs in human case-control association studies.

Genome Res. 2001 Dec;11(12):2115-9. doi: 10.1101/gr.204001.

Quantification of the power of Hardy-Weinberg equilibrium testing to detect genotyping error.

Hum Hered. 2006;61(1):10-4. doi: 10.1159/000091787. Epub 2006 Mar 1.

Detecting susceptibility genes in case-control studies using set association.

BMC Genet. 2003 Dec 31;4 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2156-4-S1-S9.

Detection of genotyping errors and pseudo-SNPs via deviations from Hardy-Weinberg equilibrium.

Genet Epidemiol. 2005 Nov;29(3):204-14. doi: 10.1002/gepi.20086.

Statistical multilocus methods for disequilibrium analysis in complex traits.

Hum Mutat. 2001 Apr;17(4):285-8. doi: 10.1002/humu.25.

Testing Hardy-Weinberg disequilibrium using the generalized linear model.

Genet Res (Camb). 2012 Dec;94(6):319-30. doi: 10.1017/S0016672312000511. Epub 2012 Dec 18.

The power of genome-wide association studies of complex disease genes: statistical limitations of indirect approaches using SNP markers.

J Hum Genet. 2001;46(8):478-82. doi: 10.1007/s100380170048.

Detection of genotyping errors by Hardy-Weinberg equilibrium testing.

Eur J Hum Genet. 2004 May;12(5):395-9. doi: 10.1038/sj.ejhg.5201164.

Physiology and Endocrinology Symposium: How single nucleotide polymorphism chips will advance our knowledge of factors controlling puberty and aid in selecting replacement beef females.

J Anim Sci. 2012 Apr;90(4):1152-65. doi: 10.2527/jas.2011-4581. Epub 2011 Oct 28.

The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.

Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.

引用本文的文献

Synergistic Epistasis and Systems Biology Approaches to Uncover a Pharmacogenomic Map Linked to Pain, Anti-Inflammatory and Immunomodulating Agents (PAIma) in a Healthy Cohort.

Cell Mol Neurobiol. 2024 Nov 6;44(1):74. doi: 10.1007/s10571-024-01504-2.

Simultaneous detection of novel genes and SNPs by adaptive -value combination.

Front Genet. 2022 Nov 17;13:1009428. doi: 10.3389/fgene.2022.1009428. eCollection 2022.

Identification of key gene signatures for the overall survival of ovarian cancer.

J Ovarian Res. 2022 Jan 20;15(1):12. doi: 10.1186/s13048-022-00942-0.

Genotype Pattern Mining for Pairs of Interacting Variants Underlying Digenic Traits.

Genes (Basel). 2021 Jul 28;12(8):1160. doi: 10.3390/genes12081160.

Detecting Weak Signals by Combining Small P-Values in Genetic Association Studies.

Front Genet. 2019 Nov 20;10:1051. doi: 10.3389/fgene.2019.01051. eCollection 2019.

A 3' UTR SNP rs885863, a cis-eQTL for the circadian gene VIPR2 and lincRNA 689, is associated with opioid addiction.

PLoS One. 2019 Nov 5;14(11):e0224399. doi: 10.1371/journal.pone.0224399. eCollection 2019.

A non-coding CRHR2 SNP rs255105, a cis-eQTL for a downstream lincRNA AC005154.6, is associated with heroin addiction.

PLoS One. 2018 Jun 28;13(6):e0199951. doi: 10.1371/journal.pone.0199951. eCollection 2018.

Clustering of ABCB1 and CYP2C19 Genetic Variants Predicts Risk of Major Bleeding and Thrombotic Events in Elderly Patients with Acute Coronary Syndrome Receiving Dual Antiplatelet Therapy with Aspirin and Clopidogrel.

Drugs Aging. 2018 Jul;35(7):649-656. doi: 10.1007/s40266-018-0555-1.

Genetic variations in genes of the stress response pathway are associated with prolonged abstinence from heroin.

Pharmacogenomics. 2018 Mar;19(4):333-341. doi: 10.2217/pgs-2017-0179. Epub 2018 Feb 21.

Dopamine gene variants in opioid addiction: comparison of dependent patients, nondependent users and healthy controls.

Pharmacogenomics. 2018 Jan;19(2):95-104. doi: 10.2217/pgs-2017-0134. Epub 2017 Dec 6.

本文引用的文献

A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data.

Am J Hum Genet. 2001 Aug;69(2):371-80. doi: 10.1086/321981. Epub 2001 Jul 5.

The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures.

Am J Hum Genet. 2001 Jun;68(6):1447-56. doi: 10.1086/320607. Epub 2001 May 16.

A confidence-set approach for finding tightly linked genomic regions.

Am J Hum Genet. 2001 May;68(5):1219-28. doi: 10.1086/320116. Epub 2001 Apr 13.

Selecting SNPs in two-stage analysis of disease association data: a model-free approach.

Ann Hum Genet. 2000 Sep;64(Pt 5):413-7. doi: 10.1046/j.1469-1809.2000.6450413.x.

The beanbag lives on.

Nature. 2001 Feb 15;409(6822):771. doi: 10.1038/35057409.

A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation.

Genome Res. 2001 Mar;11(3):458-70. doi: 10.1101/gr.172901.

Genetic analysis of case/control data using estimated haplotype frequencies: application to APOE locus variation and Alzheimer's disease.

Genome Res. 2001 Jan;11(1):143-51. doi: 10.1101/gr.148401.

Variance component methods for detecting complex trait loci.

Adv Genet. 2001;42:151-81. doi: 10.1016/s0065-2660(01)42021-9.

Scan statistics to scan markers for susceptibility genes.

Proc Natl Acad Sci U S A. 2000 Aug 15;97(17):9615-7. doi: 10.1073/pnas.170179197.

The power of genomic control.

Am J Hum Genet. 2000 Jun;66(6):1933-44. doi: 10.1086/302929. Epub 2000 May 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人类病例对照关联研究中对单核苷酸多态性（SNP）的修剪、加权和分组

Trimming, weighting, and grouping SNPs in human case-control association studies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献