一种用于识别预测数量性状变异的多位点基因型划分的组合划分方法。

A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation.

作者信息

Nelson M R, Kardia S L, Ferrell R E, Sing C F

机构信息

Department of Human Genetics, University of Michigan, Ann Arbor, Michigan 48109-0618, USA.

出版信息

Genome Res. 2001 Mar;11(3):458-70. doi: 10.1101/gr.172901.

DOI:10.1101/gr.172901

PMID:11230170

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC311041/

Abstract

Recent advances in genome research have accelerated the process of locating candidate genes and the variable sites within them and have simplified the task of genotype measurement. The development of statistical and computational strategies to utilize information on hundreds -- soon thousands -- of variable loci to investigate the relationships between genome variation and phenotypic variation has not kept pace, particularly for quantitative traits that do not follow simple Mendelian patterns of inheritance. We present here the combinatorial partitioning method (CPM) that examines multiple genes, each containing multiple variable loci, to identify partitions of multilocus genotypes that predict interindividual variation in quantitative trait levels. We illustrate this method with an application to plasma triglyceride levels collected on 188 males, ages 20--60 yr, ascertained without regard to health status, from Rochester, Minnesota. Genotype information included measurements at 18 diallelic loci in six coronary heart disease--candidate susceptibility gene regions: APOA1--C3--A4, APOB, APOE, LDLR, LPL, and PON1. To illustrate the CPM, we evaluated all possible partitions of two-locus genotypes into two to nine partitions (approximately 10(6) evaluations). We found that many combinations of loci are involved in sets of genotypic partitions that predict triglyceride variability and that the most predictive sets show nonadditivity. These results suggest that traditional methods of building multilocus models that rely on statistically significant marginal, single-locus effects, may fail to identify combinations of loci that best predict trait variability. The CPM offers a strategy for exploring the high-dimensional genotype state space so as to predict the quantitative trait variation in the population at large that does not require the conditioning of the analysis on a prespecified genetic model.

摘要

基因组研究的最新进展加速了寻找候选基因及其内部可变位点的过程，并简化了基因型测量的任务。利用数百个（很快将达到数千个）可变位点的信息来研究基因组变异与表型变异之间关系的统计和计算策略的发展却未能跟上步伐，尤其是对于不遵循简单孟德尔遗传模式的数量性状。我们在此介绍组合划分方法（CPM），该方法可检查多个基因，每个基因包含多个可变位点，以识别能够预测数量性状水平个体间变异的多位点基因型划分。我们通过应用该方法分析了从明尼苏达州罗切斯特市招募的188名年龄在20至60岁之间、未考虑健康状况的男性的血浆甘油三酯水平，来说明此方法。基因型信息包括在六个冠心病候选易感基因区域（APOA1 - C3 - A4、APOB、APOE、LDLR、LPL和PON1）的18个双等位基因位点的测量值。为了说明CPM，我们评估了两位点基因型的所有可能划分，划分为两到九个分区（约10^6次评估）。我们发现许多位点组合参与了预测甘油三酯变异性的基因型划分集合，并且最具预测性的集合显示出非加性。这些结果表明，依赖于具有统计学意义的边际单基因座效应来构建多位点模型的传统方法，可能无法识别出最能预测性状变异性的位点组合。CPM提供了一种探索高维基因型状态空间的策略，以便预测总体人群中的数量性状变异，而无需在预先指定的遗传模型基础上进行分析。

相似文献

A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation.

Genome Res. 2001 Mar;11(3):458-70. doi: 10.1101/gr.172901.

Contrasting multi-site genotypic distributions among discordant quantitative phenotypes: the APOA1/C3/A4/A5 gene cluster and cardiovascular disease risk factors.

Genet Epidemiol. 2006 Sep;30(6):508-18. doi: 10.1002/gepi.20163.

A genome-wide association study identified loci for yield component traits in sugarcane (Saccharum spp.).

PLoS One. 2019 Jul 18;14(7):e0219843. doi: 10.1371/journal.pone.0219843. eCollection 2019.

Linkage analysis of quantitative traits in randomly ascertained pedigrees: comparison of penetrance-based and variance component analysis.

Genet Epidemiol. 2001;21 Suppl 1:S783-8. doi: 10.1002/gepi.2001.21.s1.s783.

Ionizing radiation and genetic risks. VI. Chronic multifactorial diseases: a review of epidemiological and genetical aspects of coronary heart disease, essential hypertension and diabetes mellitus.

Mutat Res. 1999 Jan;436(1):21-57. doi: 10.1016/s1383-5742(98)00017-9.

Patterns of genetic polymorphism maintained by fluctuating selection with overlapping generations.

Theor Popul Biol. 1996 Aug;50(1):31-65. doi: 10.1006/tpbi.1996.0022.

On coding genotypes for genetic markers with multiple alleles in genetic association study of quantitative traits.

BMC Genet. 2011 Sep 21;12:82. doi: 10.1186/1471-2156-12-82.

Molecular-marker-facilitated investigations of quantitative-trait loci in maize. I. Numbers, genomic distribution and types of gene action.

Genetics. 1987 May;116(1):113-25. doi: 10.1093/genetics/116.1.113.

Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.

Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4.

Genetic structure of five susceptibility gene regions for coronary artery disease: disequilibria within and among regions.

Hum Genet. 1998 Sep;103(3):346-54. doi: 10.1007/s004390050828.

引用本文的文献

Distinct network patterns emerge from Cartesian and XOR epistasis models: a comparative network science analysis.

BioData Min. 2024 Dec 28;17(1):61. doi: 10.1186/s13040-024-00413-w.

Distinct Network Patterns Emerge from Cartesian and XOR Epistasis Models: A Comparative Network Science Analysis.

Res Sq. 2024 May 23:rs.3.rs-4392123. doi: 10.21203/rs.3.rs-4392123/v1.

Poor statistical power in population-based association study of gene interaction.

BMC Med Genomics. 2024 Apr 27;17(1):111. doi: 10.1186/s12920-024-01884-w.

Interaction models matter: an efficient, flexible computational framework for model-specific investigation of epistasis.

BioData Min. 2024 Feb 28;17(1):7. doi: 10.1186/s13040-024-00358-0.

A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction.

Front Bioinform. 2022 Jun 27;2:927312. doi: 10.3389/fbinf.2022.927312. eCollection 2022.

An Application of the Patient Rule-Induction Method to Detect Clinically Meaningful Subgroups from Failed Phase III Clinical Trials.

Int J Clin Biostat Biom. 2021;7(1). doi: 10.23937/2469-5831/1510038. Epub 2021 Jun 28.

MIDESP: Mutual Information-Based Detection of Epistatic SNP Pairs for Qualitative and Quantitative Phenotypes.

Biology (Basel). 2021 Sep 16;10(9):921. doi: 10.3390/biology10090921.

Genotype Pattern Mining for Pairs of Interacting Variants Underlying Digenic Traits.

Genes (Basel). 2021 Jul 28;12(8):1160. doi: 10.3390/genes12081160.

Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men.

Sci Rep. 2021 Feb 24;11(1):4482. doi: 10.1038/s41598-021-83828-3.

JS-MA: A Jensen-Shannon Divergence Based Method for Mapping Genome-Wide Associations on Multiple Diseases.

Front Genet. 2020 Oct 30;11:507038. doi: 10.3389/fgene.2020.507038. eCollection 2020.

本文引用的文献

Optimization by simulated annealing.

Science. 1983 May 13;220(4598):671-80. doi: 10.1126/science.220.4598.671.

On estimating the proportion of variance in a phenotypic trait attributable to a measured locus.

Hum Hered. 2001;51(3):145-9. doi: 10.1159/000053335.

Complex adaptive systems and human health: the influence of common genotypes of the apolipoprotein E (ApoE) gene polymorphism and age on the relational order within a field of lipid metabolism traits.

Hum Genet. 2000 Nov;107(5):466-75. doi: 10.1007/s004390000394.

Influence of apolipoprotein E genotype variation on the means, variances, and correlations of plasma lipids and apolipoproteins in children.

Ann Hum Genet. 1999 Jul;63(Pt 4):311-28. doi: 10.1046/j.1469-1809.1999.6340311.x.

Sources of variation in plasma lipid and lipoprotein traits in a sample selected for health.

Am J Epidemiol. 1999 Dec 1;150(11):1229-37. doi: 10.1093/oxfordjournals.aje.a009950.

Linkage disequilibrium mapping of complex disease: fantasy or reality?

Curr Opin Biotechnol. 1998 Dec;9(6):578-94. doi: 10.1016/s0958-1669(98)80135-3.

Genetic structure of five susceptibility gene regions for coronary artery disease: disequilibria within and among regions.

Hum Genet. 1998 Sep;103(3):346-54. doi: 10.1007/s004390050828.

Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase.

Am J Hum Genet. 1998 Aug;63(2):595-612. doi: 10.1086/301977.

Evidence that the apolipoprotein E-genotype effects on lipid levels can change with age in males: a longitudinal analysis.

Am J Hum Genet. 1997 Jul;61(1):171-81. doi: 10.1086/513902.

Genetic architecture of common multifactorial diseases.

Ciba Found Symp. 1996;197:211-29; discussion 229-32. doi: 10.1002/9780470514887.ch12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于识别预测数量性状变异的多位点基因型划分的组合划分方法。

A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献