利用全基因组关联研究中不同的多标记 TDT 进行遗传关联的样本可重复性：特征描述和一种新方法。

Sample reproducibility of genetic association using different multimarker TDTs in genome-wide association studies: characterization and a new approach.

机构信息

Departamento de Lenguajes y Sistemas Informáticos, ETS Ingeniera Informática y de Telecomunicaciones-CITIC, Universidad de Granada, Granada, Spain.

出版信息

PLoS One. 2012;7(2):e29613. doi: 10.1371/journal.pone.0029613. Epub 2012 Feb 17.

DOI:10.1371/journal.pone.0029613

PMID:22363405

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3281822/

Abstract

Multimarker Transmission/Disequilibrium Tests (TDTs) are very robust association tests to population admixture and structure which may be used to identify susceptibility loci in genome-wide association studies. Multimarker TDTs using several markers may increase power by capturing high-degree associations. However, there is also a risk of spurious associations and power reduction due to the increase in degrees of freedom. In this study we show that associations found by tests built on simple null hypotheses are highly reproducible in a second independent data set regardless the number of markers. As a test exhibiting this feature to its maximum, we introduce the multimarker 2-Groups TDT (mTDT(2G)), a test which under the hypothesis of no linkage, asymptotically follows a χ2 distribution with 1 degree of freedom regardless the number of markers. The statistic requires the division of parental haplotypes into two groups: disease susceptibility and disease protective haplotype groups. We assessed the test behavior by performing an extensive simulation study as well as a real-data study using several data sets of two complex diseases. We show that mTDT(2G) test is highly efficient and it achieves the highest power among all the tests used, even when the null hypothesis is tested in a second independent data set. Therefore, mTDT(2G) turns out to be a very promising multimarker TDT to perform genome-wide searches for disease susceptibility loci that may be used as a preprocessing step in the construction of more accurate genetic models to predict individual susceptibility to complex diseases.

摘要

多标记传递/不平衡测试（TDTs）是一种非常稳健的群体混合和结构关联测试，可用于识别全基因组关联研究中的易感基因座。使用多个标记的多标记 TDT 可以通过捕获高度关联来提高功效。然而，由于自由度的增加，也存在虚假关联和功效降低的风险。在这项研究中，我们表明，基于简单零假设构建的测试所发现的关联在第二个独立数据集是高度可重复的，无论标记数量多少。作为一个表现出这种特征的测试，我们引入了多标记 2 组 TDT（mTDT(2G)），该测试在无连锁假设下，无论标记数量多少，渐近遵循自由度为 1 的 χ2 分布。该统计量需要将亲本单倍型分为两组：疾病易感性和疾病保护性单倍型组。我们通过进行广泛的模拟研究以及使用两个复杂疾病的多个数据集进行真实数据研究来评估测试行为。我们表明，mTDT(2G)测试具有很高的效率，并且在所有使用的测试中达到最高的功效，即使在第二个独立数据集上测试零假设也是如此。因此，mTDT(2G) 是一种非常有前途的多标记 TDT，可用于全基因组搜索疾病易感基因座，可作为构建更准确遗传模型以预测个体对复杂疾病易感性的预处理步骤。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f26e/3281822/9e708172d6f9/pone.0029613.g001.jpg

相似文献

Sample reproducibility of genetic association using different multimarker TDTs in genome-wide association studies: characterization and a new approach.

PLoS One. 2012;7(2):e29613. doi: 10.1371/journal.pone.0029613. Epub 2012 Feb 17.

Genome-wide association filtering using a highly locus-specific transmission/disequilibrium test.

Hum Genet. 2010 Sep;128(3):325-44. doi: 10.1007/s00439-010-0854-z. Epub 2010 Jul 6.

Increasing power by using haplotype similarity in a multimarker transmission/disequilibrium test.

J Bioinform Comput Biol. 2013 Apr;11(2):1250014. doi: 10.1142/S021972001250014X. Epub 2012 Jul 11.

Transmission/disequilibrium test based on haplotype sharing for tightly linked markers.

Am J Hum Genet. 2003 Sep;73(3):566-79. doi: 10.1086/378205. Epub 2003 Aug 15.

Power of transmission/disequilibrium tests in admixed populations.

Genet Epidemiol. 2008 Jul;32(5):434-44. doi: 10.1002/gepi.20316.

Haplotype sharing transmission/disequilibrium tests that allow for genotyping errors.

Genet Epidemiol. 2005 May;28(4):341-51. doi: 10.1002/gepi.20066.

Comparison of multimarker logistic regression models, with application to a genomewide scan of schizophrenia.

BMC Genet. 2010 Sep 9;11:80. doi: 10.1186/1471-2156-11-80.

High Resolution Haplotype Analyses of Classical HLA Genes in Families With Multiple Sclerosis Highlights the Role of HLA-DP Alleles in Disease Susceptibility.

Front Immunol. 2021 May 25;12:644838. doi: 10.3389/fimmu.2021.644838. eCollection 2021.

The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets.

BMC Genet. 2005 Dec 30;6 Suppl 1(Suppl 1):S151. doi: 10.1186/1471-2156-6-S1-S151.

Multimarker analysis and imputation of multiple platform pooling-based genome-wide association studies.

Bioinformatics. 2008 Sep 1;24(17):1896-902. doi: 10.1093/bioinformatics/btn333. Epub 2008 Jul 10.

引用本文的文献

A comparison of genomic profiles of complex diseases under different models.

BMC Med Genomics. 2016 Jan 19;9:3. doi: 10.1186/s12920-015-0157-2.

Current strategies for mutation detection in phenotype-driven screens utilising next generation sequencing.

Mamm Genome. 2015 Oct;26(9-10):486-500. doi: 10.1007/s00335-015-9603-x. Epub 2015 Oct 8.

本文引用的文献

Genome-wide association filtering using a highly locus-specific transmission/disequilibrium test.

Hum Genet. 2010 Sep;128(3):325-44. doi: 10.1007/s00439-010-0854-z. Epub 2010 Jul 6.

A variable-sized sliding-window approach for genetic association studies via principal component analysis.

Ann Hum Genet. 2009 Nov;73(Pt 6):631-7. doi: 10.1111/j.1469-1809.2009.00543.x. Epub 2009 Sep 7.

The role of the CD58 locus in multiple sclerosis.

Proc Natl Acad Sci U S A. 2009 Mar 31;106(13):5264-9. doi: 10.1073/pnas.0813310106. Epub 2009 Feb 23.

An extension to a statistical approach for family based association studies provides insights into genetic risk factors for multiple sclerosis in the HLA-DRB1 gene.

BMC Med Genet. 2009 Feb 4;10:10. doi: 10.1186/1471-2350-10-10.

IL2RA/CD25 gene polymorphisms: uneven association with multiple sclerosis (MS) and type 1 diabetes (T1D).

PLoS One. 2009;4(1):e4137. doi: 10.1371/journal.pone.0004137. Epub 2009 Jan 6.

CD226 Gly307Ser association with multiple autoimmune diseases.

Genes Immun. 2009 Jan;10(1):5-10. doi: 10.1038/gene.2008.82. Epub 2008 Oct 30.

Interferon regulatory factor 5 (IRF5) gene variants are associated with multiple sclerosis in three distinct populations.

J Med Genet. 2008 Jun;45(6):362-9. doi: 10.1136/jmg.2007.055012. Epub 2008 Feb 19.

Interleukin 7 receptor alpha chain (IL7R) shows allelic and functional association with multiple sclerosis.

Nat Genet. 2007 Sep;39(9):1083-91. doi: 10.1038/ng2103. Epub 2007 Jul 29.

Variation in interleukin 7 receptor alpha chain (IL7R) influences risk of multiple sclerosis.

Nat Genet. 2007 Sep;39(9):1108-13. doi: 10.1038/ng2106. Epub 2007 Jul 29.

Risk alleles for multiple sclerosis identified by a genomewide study.

N Engl J Med. 2007 Aug 30;357(9):851-62. doi: 10.1056/NEJMoa073493. Epub 2007 Jul 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用全基因组关联研究中不同的多标记 TDT 进行遗传关联的样本可重复性：特征描述和一种新方法。

Sample reproducibility of genetic association using different multimarker TDTs in genome-wide association studies: characterization and a new approach.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献