Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, British Columbia, Canada.
Department of Mathematics and Statistics, Dalhousie University, Halifax, Nova Scotia, Canada.
Genet Epidemiol. 2023 Feb;47(1):78-94. doi: 10.1002/gepi.22502. Epub 2022 Sep 1.
Linkage analysis maps genetic loci for a heritable trait by identifying genomic regions with excess relatedness among individuals with similar trait values. Analysis may be conducted on related individuals from families, or on samples of unrelated individuals from a population. For allelically heterogeneous traits, population-based linkage analysis can be more powerful than genotypic-association analysis. Here, we focus on linkage analysis in a population sample, but use sequences rather than individuals as our unit of observation. Earlier investigations of sequence-based linkage mapping relied on known sequence relatedness, whereas we infer relatedness from the sequence data. We propose two ways to associate similarity in relatedness of sequences with similarity in their trait values and compare the resulting linkage methods to two genotypic-association methods. We also introduce a procedure to label case sequences as potential carriers or noncarriers of causal variants after an association has been found. This post hoc labeling of case sequences is based on inferred relatedness to other case sequences. Our simulation results indicate that methods based on sequence relatedness improve localization and perform as well as genotypic-association methods for detecting rare causal variants. Sequence-based linkage analysis therefore has potential to fine-map allelically heterogeneous disease traits.
连锁分析通过识别具有相似性状值的个体之间过度相关的基因组区域,为可遗传性状定位遗传基因座。分析可以在具有相似特征的家族内相关个体上进行,也可以在来自群体的无关联个体样本上进行。对于等位基因异质性性状,基于群体的连锁分析比基因型关联分析更有效。在这里,我们关注的是群体样本中的连锁分析,但使用序列而不是个体作为我们的观察单位。早期基于序列的连锁映射研究依赖于已知的序列相关性,而我们则从序列数据中推断相关性。我们提出了两种方法,将序列相关性的相似性与性状值的相似性联系起来,并将由此产生的连锁方法与两种基因型关联方法进行比较。我们还引入了一种在发现关联后将病例序列标记为潜在携带或不携带因果变异的程序。这种病例序列的事后标记是基于与其他病例序列的推断相关性。我们的模拟结果表明,基于序列相关性的方法可以改善定位,并与检测罕见因果变异的基因型关联方法一样有效。因此,基于序列的连锁分析有可能对等位基因异质性疾病特征进行精细映射。