• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

连锁不平衡度量估计量的比较。

Comparison of estimators for measures of linkage disequilibrium.

作者信息

Scholz Markus, Hasenclever Dirk

机构信息

University of Leipzig, Germany.

出版信息

Int J Biostat. 2010;6(1):Article 1. doi: 10.2202/1557-4679.1162.

DOI:10.2202/1557-4679.1162
PMID:21969963
Abstract

The measurement of biallelic pair-wise association called linkage disequilibrium (LD) is an important issue in order to understand the genomic architecture. A plethora of measures of association in two by two tables have been proposed in the literature. Beside the problem of choosing an appropriate measure, the problem of their estimation has been neglected in the literature. It needs to be emphasized that the definition of a measure and the choice of an estimator function for it are conceptually unrelated tasks. In this paper, we compare the performance of various estimators for the three popular LD measures D', r and Y in a simulation study for small to moderate samples sizes (N<=500). The usual frequency-plug-in estimators can lead to unreliable or undefined estimates. Estimators based on the computationally expensive volume measures have been proposed recently as a remedy to this well-known problem. We confirm that volume estimators have better expected mean square error than the naive plug-in estimators. But they are outperformed by estimators plugging-in easy to calculate non-informative Bayesian probability estimates into the theoretical formulae for the measures. Fully Bayesian estimators with non-informative Dirichlet priors have comparable accuracy but are computationally more expensive. We recommend the use of non-informative Bayesian plug-in estimators based on Jeffreys' prior, in particular when dealing with SNP array data where the occurrence of small table entries and table margins is likely.

摘要

为了理解基因组结构,对称为连锁不平衡(LD)的双等位基因成对关联进行测量是一个重要问题。文献中已经提出了大量用于二乘二表格的关联测量方法。除了选择合适测量方法的问题外,其估计问题在文献中一直被忽视。需要强调的是,测量方法的定义及其估计函数的选择在概念上是不相关的任务。在本文中,我们在一个针对小到中等样本量(N<=500)的模拟研究中,比较了三种常用LD测量方法D'、r和Y的各种估计器的性能。通常的频率代入估计器可能会导致不可靠或未定义的估计。最近有人提出基于计算成本高昂的体积测量的估计器来解决这个众所周知的问题。我们证实,体积估计器的期望均方误差比简单的代入估计器更好。但是,将易于计算的非信息性贝叶斯概率估计代入测量方法的理论公式中的估计器表现更优。具有非信息性狄利克雷先验的完全贝叶斯估计器具有相当的准确性,但计算成本更高。我们建议使用基于杰弗里斯先验的非信息性贝叶斯代入估计器,特别是在处理可能出现小表格条目和表格边缘的单核苷酸多态性(SNP)阵列数据时。

相似文献

1
Comparison of estimators for measures of linkage disequilibrium.连锁不平衡度量估计量的比较。
Int J Biostat. 2010;6(1):Article 1. doi: 10.2202/1557-4679.1162.
2
Bayesian estimates of linkage disequilibrium.连锁不平衡的贝叶斯估计。
BMC Genet. 2007 Jun 25;8:36. doi: 10.1186/1471-2156-8-36.
3
An evaluation of a novel estimator of linkage disequilibrium.一种新型连锁不平衡估计量的评估。
Heredity (Edinb). 2013 Oct;111(4):275-85. doi: 10.1038/hdy.2013.46. Epub 2013 Aug 7.
4
A comparison of different strategies for computing confidence intervals of the linkage disequilibrium measure D'.连锁不平衡度量D'的置信区间计算的不同策略比较。
Pac Symp Biocomput. 2004:128-39. doi: 10.1142/9789812704856_0013.
5
Detecting Recombination Hotspots from Patterns of Linkage Disequilibrium.从连锁不平衡模式中检测重组热点
G3 (Bethesda). 2016 Aug 9;6(8):2265-71. doi: 10.1534/g3.116.029587.
6
Efficient Estimation of Realized Kinship from Single Nucleotide Polymorphism Genotypes.基于单核苷酸多态性基因型的实现亲缘关系的有效估计
Genetics. 2017 Mar;205(3):1063-1078. doi: 10.1534/genetics.116.197004. Epub 2017 Jan 18.
7
Inbreeding coefficient estimation with dense SNP data: comparison of strategies and application to HapMap III.利用高密度单核苷酸多态性(SNP)数据估计近亲繁殖系数:策略比较及对HapMap III的应用
Hum Hered. 2014;77(1-4):49-62. doi: 10.1159/000358224. Epub 2014 Jul 29.
8
Shrinkage estimation of effect sizes as an alternative to hypothesis testing followed by estimation in high-dimensional biology: applications to differential gene expression.作为高维生物学中假设检验后进行估计的替代方法的效应量收缩估计:在差异基因表达中的应用
Stat Appl Genet Mol Biol. 2010;9:Article23. doi: 10.2202/1544-6115.1504. Epub 2010 Jun 8.
9
Linkage disequilibrium assessment via log-linear modeling of SNP haplotype frequencies.通过单核苷酸多态性(SNP)单倍型频率的对数线性模型进行连锁不平衡评估。
Genet Epidemiol. 2003 Sep;25(2):106-14. doi: 10.1002/gepi.10254.
10
Linkage disequilibrium across two different single-nucleotide polymorphism genome scans.两个不同单核苷酸多态性全基因组扫描的连锁不平衡。
BMC Genet. 2005 Dec 30;6 Suppl 1(Suppl 1):S86. doi: 10.1186/1471-2156-6-S1-S86.

引用本文的文献

1
A recombined allele of the lipase gene CEL and its pseudogene CELP confers susceptibility to chronic pancreatitis.脂肪酶基因CEL及其假基因CELP的一个重组等位基因会使人易患慢性胰腺炎。
Nat Genet. 2015 May;47(5):518-522. doi: 10.1038/ng.3249. Epub 2015 Mar 16.
2
Population-genetic comparison of the Sorbian isolate population in Germany with the German KORA population using genome-wide SNP arrays.利用全基因组 SNP 芯片对德国索布族隔离人群与德国 KORA 人群进行群体遗传学比较。
BMC Genet. 2011 Jul 28;12:67. doi: 10.1186/1471-2156-12-67.