全基因组扫描中基因座特异性效应估计的大幅向上偏差。

Large upward bias in estimation of locus-specific effects from genomewide scans.

作者信息

Göring H H, Terwilliger J D, Blangero J

机构信息

Department of Genetics, Southwest Foundation for Biomedical Research, San Antonio, TX 78245-0549, USA.

出版信息

Am J Hum Genet. 2001 Dec;69(6):1357-69. doi: 10.1086/324471. Epub 2001 Oct 9.

DOI:10.1086/324471

PMID:11593451

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1235546/

Abstract

The primary goal of a genomewide scan is to estimate the genomic locations of genes influencing a trait of interest. It is sometimes said that a secondary goal is to estimate the phenotypic effects of each identified locus. Here, it is shown that these two objectives cannot be met reliably by use of a single data set of a currently realistic size. Simulation and analytical results, based on variance-components linkage analysis as an example, demonstrate that estimates of locus-specific effect size at genomewide LOD score peaks tend to be grossly inflated and can even be virtually independent of the true effect size, even for studies on large samples when the true effect size is small. However, the bias diminishes asymptotically. The explanation for the bias is that the LOD score is a function of the locus-specific effect-size estimate, such that there is a high correlation between the observed statistical significance and the effect-size estimate. When the LOD score is maximized over the many pointwise tests being conducted throughout the genome, the locus-specific effect-size estimate is therefore effectively maximized as well. We argue that attempts at bias correction give unsatisfactory results, and that pointwise estimation in an independent data set may be the only way of obtaining reliable estimates of locus-specific effect-and then only if one does not condition on statistical significance being obtained. We further show that the same factors causing this bias are responsible for frequent failures to replicate initial claims of linkage or association for complex traits, even when the initial localization is, in fact, correct. The findings of this study have wide-ranging implications, as they apply to all statistical methods of gene localization. It is hoped that, by keeping this bias in mind, we will more realistically interpret and extrapolate from the results of genomewide scans.

摘要

全基因组扫描的主要目标是估计影响感兴趣性状的基因在基因组中的位置。有时人们说其次要目标是估计每个已识别位点的表型效应。本文表明，使用当前实际大小的单个数据集无法可靠地实现这两个目标。以方差成分连锁分析为例的模拟和分析结果表明，在全基因组LOD得分峰值处，位点特异性效应大小的估计往往被严重夸大，甚至可能与真实效应大小几乎无关，即使对于大样本研究，当真实效应大小较小时也是如此。然而，这种偏差会渐近减小。偏差的解释是，LOD得分是位点特异性效应大小估计的函数，因此观察到的统计显著性与效应大小估计之间存在高度相关性。当在全基因组进行的许多逐点检验中LOD得分最大化时，位点特异性效应大小估计也因此有效地最大化了。我们认为，偏差校正的尝试结果并不理想，在独立数据集中进行逐点估计可能是获得位点特异性效应可靠估计的唯一方法——而且只有在不依赖于获得统计显著性的情况下才行。我们进一步表明，导致这种偏差的相同因素也是复杂性状连锁或关联的初始声明经常无法重复验证的原因，即使初始定位实际上是正确的。本研究的结果具有广泛的影响，因为它们适用于所有基因定位的统计方法。希望通过牢记这种偏差，我们能够更现实地解释全基因组扫描的结果并进行外推。

相似文献

Large upward bias in estimation of locus-specific effects from genomewide scans.

Am J Hum Genet. 2001 Dec;69(6):1357-69. doi: 10.1086/324471. Epub 2001 Oct 9.

Reduction of selection bias in genomewide studies by resampling.

Genet Epidemiol. 2005 May;28(4):352-67. doi: 10.1002/gepi.20068.

Replication of small effect quantitative trait loci for behavioral traits facilitated by estimation of effect size from independent cohorts.

Genes Brain Behav. 2006 Jul;5(5):404-12. doi: 10.1111/j.1601-183X.2005.00174.x.

Linkage analysis in the presence of errors II: marker-locus genotyping errors modeled with hypercomplex recombination fractions.

Am J Hum Genet. 2000 Mar;66(3):1107-18. doi: 10.1086/302798.

A genomewide search using an original pairwise sampling approach for large genealogies identifies a new locus for total and low-density lipoprotein cholesterol in two genetically differentiated isolates of Sardinia.

Am J Hum Genet. 2004 Dec;75(6):1015-31. doi: 10.1086/426155. Epub 2004 Oct 11.

Locus-specific heritability estimation via the bootstrap in linkage scans for quantitative trait loci.

Hum Hered. 2006;62(2):84-96. doi: 10.1159/000096096. Epub 2006 Oct 12.

Evaluating the results of genomewide linkage scans of complex traits by locus counting.

Am J Hum Genet. 2002 Nov;71(5):1175-82. doi: 10.1086/342976. Epub 2002 Sep 25.

A study comparing precision of the maximum multipoint heterogeneity LOD statistic to three model-free multipoint linkage methods.

Genet Epidemiol. 2001 Dec;21(4):315-25. doi: 10.1002/gepi.1037.

Genomewide linkage scan for nicotine dependence: identification of a chromosome 5 risk locus.

Biol Psychiatry. 2007 Jan 1;61(1):119-26. doi: 10.1016/j.biopsych.2006.08.023. Epub 2006 Nov 1.

A confidence-set approach for finding tightly linked genomic regions.

Am J Hum Genet. 2001 May;68(5):1219-28. doi: 10.1086/320116. Epub 2001 Apr 13.

引用本文的文献

Winner's curse in rare variant analysis: effect size estimation bias depends on effect direction and the association method used.

Front Genet. 2025 Aug 8;16:1416673. doi: 10.3389/fgene.2025.1416673. eCollection 2025.

Genetic regulation of sperm DNA methylation in cattle through meQTL mapping.

BMC Genomics. 2025 Aug 22;26(1):771. doi: 10.1186/s12864-025-11934-x.

Trans-regulatory loci shape natural variation of gene expression plasticity in Arabidopsis.

Genetics. 2025 Aug 6;230(4). doi: 10.1093/genetics/iyaf116.

DyNDG: Identifying Leukemia-related Genes Based on Time-series Dynamic Network by Integrating Differential Genes.

Genomics Proteomics Bioinformatics. 2025 May 30;23(2). doi: 10.1093/gpbjnl/qzaf037.

Evaluating the roles of drift and selection in trait loss along an elevational gradient.

Evolution. 2025 Jul 18;79(7):1322-1333. doi: 10.1093/evolut/qpaf078.

Investigating Motor Coordination Using BXD Recombinant Inbred Mice to Model the Genetic Underpinnings of Developmental Coordination Disorder.

Genes Brain Behav. 2025 Apr;24(2):e70014. doi: 10.1111/gbb.70014.

Estimation of total mediation effect for a binary trait in a case-control study for high-dimensional omics mediators.

bioRxiv. 2025 Feb 2:2025.01.28.635396. doi: 10.1101/2025.01.28.635396.

Improving Replication in Endometrial Omics: Understanding the Influence of the Menstrual Cycle.

Int J Mol Sci. 2025 Jan 20;26(2):857. doi: 10.3390/ijms26020857.

Associations Between TCF7L2, PPARγ, and KCNJ11 Genotypes and Insulin Response to an Oral Glucose Tolerance Test: A Systematic Review.

Mol Nutr Food Res. 2025 Feb;69(3):e202400561. doi: 10.1002/mnfr.202400561. Epub 2025 Jan 19.

Evaluating the Roles of Drift and Selection in Trait Loss along an Elevational Gradient.

bioRxiv. 2024 Nov 18:2024.06.12.598645. doi: 10.1101/2024.06.12.598645.

本文引用的文献

Sequential tests for the detection of linkage.

Am J Hum Genet. 1955 Sep;7(3):277-318.

Power of variance component linkage analysis to detect quantitative trait loci.

Ann Hum Genet. 1999 Nov;63(Pt 6):545-63. doi: 10.1017/S0003480099007848.

Variance component methods for detecting complex trait loci.

Adv Genet. 2001;42:151-81. doi: 10.1016/s0065-2660(01)42021-9.

How many diseases does it take to map a gene with SNPs?

Nat Genet. 2000 Oct;26(2):151-7. doi: 10.1038/79866.

Bias and Sampling Error of the Estimated Proportion of Genotypic Variance Explained by Quantitative Trait Loci Determined From Experimental Data in Maize Using Cross Validation and Validation With Independent Samples.

Genetics. 2000 Apr;154(3):1839-1849.

Linkage analysis in the presence of errors IV: joint pseudomarker analysis of linkage and/or linkage disequilibrium on a mixture of pedigrees and singletons when the mode of inheritance cannot be accurately specified.

Am J Hum Genet. 2000 Apr;66(4):1310-27. doi: 10.1086/302845. Epub 2000 Mar 23.

Gene mapping in the 20th and 21st centuries: statistical methods, data analysis, and experimental design.

Hum Biol. 2000 Feb;72(1):63-132.

Quantitative trait locus mapping using human pedigrees.

Hum Biol. 2000 Feb;72(1):35-62.

Linkage analysis in the presence of errors II: marker-locus genotyping errors modeled with hypercomplex recombination fractions.

Am J Hum Genet. 2000 Mar;66(3):1107-18. doi: 10.1086/302798.

Linkage analysis in the presence of errors I: complex-valued recombination fractions and complex phenotypes.

Am J Hum Genet. 2000 Mar;66(3):1095-106. doi: 10.1086/302797.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

全基因组扫描中基因座特异性效应估计的大幅向上偏差。

Large upward bias in estimation of locus-specific effects from genomewide scans.

作者信息

Göring H H, Terwilliger J D, Blangero J

机构信息

Department of Genetics, Southwest Foundation for Biomedical Research, San Antonio, TX 78245-0549, USA.

出版信息

Am J Hum Genet. 2001 Dec;69(6):1357-69. doi: 10.1086/324471. Epub 2001 Oct 9.

DOI:10.1086/324471

PMID:11593451

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1235546/

Abstract

摘要

全基因组扫描中基因座特异性效应估计的大幅向上偏差。

Large upward bias in estimation of locus-specific effects from genomewide scans.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

全基因组扫描中基因座特异性效应估计的大幅向上偏差。

Large upward bias in estimation of locus-specific effects from genomewide scans.

作者信息

机构信息

出版信息