Suppr超能文献

设计实验以精细定位家畜群体中的数量性状基因座。

Design of experiments for fine-mapping quantitative trait loci in livestock populations.

机构信息

Leibniz Institute for Farm Animal Biology, Institute of Genetics and Biometry, Dummerstorf, 18196, Germany.

University Medicine Greifswald, Department of Psychiatry and Psychotherapy, Greifswald, 17475, Germany.

出版信息

BMC Genet. 2020 Jun 29;21(1):66. doi: 10.1186/s12863-020-00871-1.

Abstract

BACKGROUND

Single nucleotide polymorphisms (SNPs) which capture a significant impact on a trait can be identified with genome-wide association studies. High linkage disequilibrium (LD) among SNPs makes it difficult to identify causative variants correctly. Thus, often target regions instead of single SNPs are reported. Sample size has not only a crucial impact on the precision of parameter estimates, it also ensures that a desired level of statistical power can be reached. We study the design of experiments for fine-mapping of signals of a quantitative trait locus in such a target region.

METHODS

A multi-locus model allows to identify causative variants simultaneously, to state their positions more precisely and to account for existing dependencies. Based on the commonly applied SNP-BLUP approach, we determine the z-score statistic for locally testing non-zero SNP effects and investigate its distribution under the alternative hypothesis. This quantity employs the theoretical instead of observed dependence between SNPs; it can be set up as a function of paternal and maternal LD for any given population structure.

RESULTS

We simulated multiple paternal half-sib families and considered a target region of 1 Mbp. A bimodal distribution of estimated sample size was observed, particularly if more than two causative variants were assumed. The median of estimates constituted the final proposal of optimal sample size; it was consistently less than sample size estimated from single-SNP investigation which was used as a baseline approach. The second mode pointed to inflated sample sizes and could be explained by blocks of varying linkage phases leading to negative correlations between SNPs. Optimal sample size increased almost linearly with number of signals to be identified but depended much stronger on the assumption on heritability. For instance, three times as many samples were required if heritability was 0.1 compared to 0.3. An R package is provided that comprises all required tools.

CONCLUSIONS

Our approach incorporates information about the population structure into the design of experiments. Compared to a conventional method, this leads to a reduced estimate of sample size enabling the resource-saving design of future experiments for fine-mapping of candidate variants.

摘要

背景

单核苷酸多态性(SNP)可以通过全基因组关联研究来识别对性状有显著影响的 SNP。SNP 之间的高度连锁不平衡(LD)使得正确识别因果变异变得困难。因此,通常报告的是目标区域而不是单个 SNP。样本量不仅对参数估计的精度有至关重要的影响,而且还确保可以达到所需的统计功效水平。我们研究了在目标区域中精细映射数量性状基因座信号的实验设计。

方法

多基因模型允许同时识别因果变异,更精确地确定其位置,并考虑到现有依赖关系。基于常用的 SNP-BLUP 方法,我们确定了用于局部检验非零 SNP 效应的 z 分数统计量,并在替代假设下研究了其分布。该数量使用 SNP 之间的理论而不是观察到的依赖关系;它可以为任何给定的群体结构设置为父本和母本 LD 的函数。

结果

我们模拟了多个父本半同胞家庭,并考虑了 1 Mbp 的目标区域。观察到估计样本量的双峰分布,特别是如果假设了两个以上的因果变异。中位数的估计构成了最终的最佳样本量建议;它始终小于用作基线方法的单 SNP 研究的估计样本量。第二个模式指向膨胀的样本量,可以用不同连锁阶段的块来解释,导致 SNP 之间的负相关。最佳样本量几乎与要识别的信号数量呈线性增加,但更多地取决于遗传力的假设。例如,如果遗传力为 0.1,则需要三倍的样本量,而不是 0.3。提供了一个包含所有必需工具的 R 包。

结论

我们的方法将关于群体结构的信息纳入实验设计中。与传统方法相比,这会导致样本量的估计减少,从而能够节省资源,设计未来精细映射候选变体的实验。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddd9/7324978/90f56b7e22d1/12863_2020_871_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验