Suppr超能文献

连锁不平衡对不完全家系连锁分析的影响。

The effect of linkage disequilibrium on linkage analysis of incomplete pedigrees.

机构信息

Department of Psychiatry, University of Pennsylvania School of Medicine, 353 Market Street, Philadelphia, PA, USA.

出版信息

BMC Genet. 2005 Dec 30;6 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2156-6-S1-S6.

Abstract

Dense SNP maps can be highly informative for linkage studies. But when parental genotypes are missing, multipoint linkage scores can be inflated in regions with substantial marker-marker linkage disequilibrium (LD). Such regions were observed in the Affymetrix SNP genotypes for the Genetic Analysis Workshop 14 (GAW14) Collaborative Study on the Genetics of Alcoholism (COGA) dataset, providing an opportunity to test a novel simulation strategy for studying this problem. First, an inheritance vector (with or without linkage present) is simulated for each replicate, i.e., locations of recombinations and transmission of parental chromosomes are determined for each meiosis. Then, two sets of founder haplotypes are superimposed onto the inheritance vector: one set that is inferred from the actual data and which contains the pattern of LD; and one set created by randomly selecting parental alleles based on the known allele frequencies, with no correlation (LD) between markers. Applying this strategy to a map of 176 SNPs (66 Mb of chromosome 7) for 100 replicates of 116 sibling pairs, significant inflation of multipoint linkage scores was observed in regions of high LD when parental genotypes were set to missing, with no linkage present. Similar inflation was observed in analyses of the COGA data for these affected sib pairs with parental genotypes set to missing, but not after reducing the marker map until r2 between any pair of markers was <or= 0.05. Additional simulation studies of affected sib pairs assuming uniform LD throughout a marker map demonstrated inflation of significance levels at r2 values greater than 0.05. When genotypes are available only from two affected siblings in many families in a sample, trimming SNP maps to limit r2 to 0-0.05 for all marker pairs will prevent inflation of linkage scores without sacrificing substantial linkage information. Simulation studies on the observed pedigree structures and map can also be used to determine the effect of LD on a particular study.

摘要

高密度 SNP 图谱对于连锁研究非常有用。但是当双亲基因型缺失时,在存在大量标记-标记连锁不平衡(LD)的区域,多点连锁评分会偏高。在基因分析研讨会 14(GAW14)酒精中毒遗传合作研究(COGA)数据集的 Affymetrix SNP 基因型中观察到了这样的区域,这为测试一种研究这个问题的新模拟策略提供了机会。首先,为每个重复模拟一个遗传向量(存在或不存在连锁),即确定每个减数分裂中重组和双亲染色体传递的位置。然后,将两组创始单倍型叠加到遗传向量上:一组是根据实际数据推断的,包含 LD 模式;另一组是根据已知的等位基因频率随机选择双亲等位基因创建的,标记之间没有相关性(LD)。将该策略应用于 176 个 SNP(7 号染色体 66Mb)的图谱,对 116 对同胞的 100 个重复进行分析,当设置双亲基因型缺失但不存在连锁时,在 LD 较高的区域观察到多点连锁评分显著偏高。在对这些缺失双亲基因型的 COGA 数据的分析中也观察到了类似的偏倚,但在减少标记图谱直到任何一对标记之间的 r2 值<或=0.05 后则没有。对假定整个标记图谱具有均匀 LD 的受影响同胞对进行的附加模拟研究表明,在 r2 值大于 0.05 时,显著水平会膨胀。在一个样本中,许多家庭中只有两个受影响的兄弟姐妹提供基因型时,修剪 SNP 图谱以将所有标记对的 r2 限制在 0-0.05,可以防止连锁评分的膨胀,而不会牺牲大量的连锁信息。还可以使用对观察到的家系结构和图谱的模拟研究来确定 LD 对特定研究的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0678/1866723/6e713f67bf9f/1471-2156-6-S1-S6-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验