• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

合并微卫星数据。

Merging microsatellite data.

作者信息

Presson Angela P, Sobel Eric, Lange Kenneth, Papp Jeanette C

机构信息

Department of Statistics, University of California, Los Angeles, Los Angeles, CA 90095, USA.

出版信息

J Comput Biol. 2006 Jul-Aug;13(6):1131-47. doi: 10.1089/cmb.2006.13.1131.

DOI:10.1089/cmb.2006.13.1131
PMID:16901233
Abstract

Genotype calling procedures vary from laboratory to laboratory for many microsatellite markers. Even within the same laboratory, application of different experimental protocols often leads to ambiguities. The impact of these ambiguities ranges from irksome to devastating. Resolving the ambiguities can increase effective sample size and preserve evidence in favor of disease-marker associations. Because different data sets may contain different numbers of alleles, merging is unfortunately not a simple process of matching alleles one to one. Merging data sets manually is difficult, time-consuming, and error-prone due to differences in genotyping hardware, binning methods, molecular weight standards, and curve fitting algorithms. Merging is particularly difficult if few or no samples occur in common, or if samples are drawn from ethnic groups with widely varying allele frequencies. It is dangerous to align alleles simply by adding a constant number of base pairs to the alleles of one of the data sets. To address these issues, we have developed a Bayesian model and a Markov chain Monte Carlo (MCMC) algorithm for sampling the posterior distribution under the model. Our computer program, MicroMerge, implements the algorithm and almost always accurately and efficiently finds the most likely correct alignment. Common allele frequencies across laboratories in the same ethnic group are the single most important cue in the model. MicroMerge computes the allelic alignments with the greatest posterior probabilities under several merging options. It also reports when data sets cannot be confidently merged. These features are emphasized in our analysis of simulated and real data.

摘要

对于许多微卫星标记物,不同实验室的基因型分型程序各不相同。即使在同一实验室中,应用不同的实验方案也常常会导致结果不明确。这些不明确性的影响范围从令人厌烦到具有毁灭性。解决这些不明确性可以增加有效样本量,并保留有利于疾病标记物关联的证据。由于不同的数据集可能包含不同数量的等位基因,不幸的是,合并并不是一个简单的一对一匹配等位基因的过程。由于基因分型硬件、分箱方法、分子量标准和曲线拟合算法的差异,手动合并数据集既困难、耗时,又容易出错。如果很少有或没有共同的样本,或者样本来自等位基因频率差异很大的种族群体,合并就会特别困难。仅仅通过给其中一个数据集的等位基因添加固定数量的碱基对来对齐等位基因是很危险的。为了解决这些问题,我们开发了一种贝叶斯模型和一种马尔可夫链蒙特卡罗(MCMC)算法,用于对模型下的后验分布进行采样。我们的计算机程序MicroMerge实现了该算法,并且几乎总是能够准确、高效地找到最可能正确的对齐方式。同一民族不同实验室之间的常见等位基因频率是该模型中最重要的单一线索。MicroMerge在几种合并选项下计算具有最大后验概率的等位基因对齐方式。它还会报告数据集何时无法可靠合并。在我们对模拟数据和真实数据的分析中强调了这些特征。

相似文献

1
Merging microsatellite data.合并微卫星数据。
J Comput Biol. 2006 Jul-Aug;13(6):1131-47. doi: 10.1089/cmb.2006.13.1131.
2
Merging microsatellite data: enhanced methodology and software to combine genotype data for linkage and association analysis.合并微卫星数据:用于连锁和关联分析的合并基因型数据的增强方法与软件
BMC Bioinformatics. 2008 Jul 21;9:317. doi: 10.1186/1471-2105-9-317.
3
A Bayesian approach to ordering gene markers.一种用于排列基因标记的贝叶斯方法。
Biometrics. 1999 Jun;55(2):419-29. doi: 10.1111/j.0006-341x.1999.00419.x.
4
Reconstruction of pedigrees in clonal plant populations.克隆植物种群的谱系重建。
Theor Popul Biol. 2010 Sep;78(2):109-17. doi: 10.1016/j.tpb.2010.05.002. Epub 2010 May 23.
5
Exact transmission-disequilibrium tests with multiallelic markers.使用多等位基因标记的精确传递不平衡检验。
Genet Epidemiol. 1997;14(4):337-47. doi: 10.1002/(SICI)1098-2272(1997)14:4<337::AID-GEPI1>3.0.CO;2-0.
6
A Markov chain Monte Carlo technique for identification of combinations of allelic variants underlying complex diseases in humans.一种用于识别人类复杂疾病潜在等位基因变异组合的马尔可夫链蒙特卡罗技术。
Genetics. 2005 Dec;171(4):2113-21. doi: 10.1534/genetics.105.048090. Epub 2005 Aug 22.
7
Maximum-likelihood and markov chain monte carlo approaches to estimate inbreeding and effective size from allele frequency changes.基于等位基因频率变化估计近亲繁殖和有效种群大小的最大似然法和马尔可夫链蒙特卡罗方法
Genetics. 2003 Jul;164(3):1189-204. doi: 10.1093/genetics/164.3.1189.
8
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
9
Parallel Metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference.用于贝叶斯系统发育推断的并行 metropolis 耦合马尔可夫链蒙特卡罗方法
Bioinformatics. 2004 Feb 12;20(3):407-15. doi: 10.1093/bioinformatics/btg427. Epub 2004 Jan 22.
10
A simple method for automated allele binning in microsatellite markers.一种用于微卫星标记中自动等位基因分型的简单方法。
Genome Res. 1997 Nov;7(11):1104-9. doi: 10.1101/gr.7.11.1104.

引用本文的文献

1
A continent-wide high genetic load in African buffalo revealed by clines in the frequency of deleterious alleles, genetic hitchhiking and linkage disequilibrium.非洲野牛在整个大陆上存在高遗传负荷,这是由有害等位基因频率的渐变、遗传搭便车和连锁不平衡揭示的。
PLoS One. 2021 Dec 9;16(12):e0259685. doi: 10.1371/journal.pone.0259685. eCollection 2021.
2
Linkage disequilibrium matches forensic genetic records to disjoint genomic marker sets.连锁不平衡将法医遗传记录与不相交的基因组标记集匹配。
Proc Natl Acad Sci U S A. 2017 May 30;114(22):5671-5676. doi: 10.1073/pnas.1619944114. Epub 2017 May 15.
3
Population structure in a comprehensive genomic data set on human microsatellite variation.
人类微卫星变异综合基因组数据集的种群结构。
G3 (Bethesda). 2013 May 20;3(5):891-907. doi: 10.1534/g3.113.005728.
4
Susceptibility loci for adiposity phenotypes on 8p, 9p, and 16q in American Samoa and Samoa.美属萨摩亚和萨摩亚8号染色体短臂、9号染色体短臂及16号染色体长臂上肥胖相关表型的易感基因座。
Obesity (Silver Spring). 2009 Mar;17(3):518-24. doi: 10.1038/oby.2008.558. Epub 2008 Dec 18.
5
Merging microsatellite data: enhanced methodology and software to combine genotype data for linkage and association analysis.合并微卫星数据:用于连锁和关联分析的合并基因型数据的增强方法与软件
BMC Bioinformatics. 2008 Jul 21;9:317. doi: 10.1186/1471-2105-9-317.
6
A cautionary tale: Lack of consistency in allele sizes between two laboratories for a published multilocus microsatellite typing system.一则警示故事:已发表的多位点微卫星分型系统在两个实验室之间等位基因大小缺乏一致性。
J Clin Microbiol. 2007 Feb;45(2):522-8. doi: 10.1128/JCM.02136-06. Epub 2006 Dec 13.
7
Accommodating chromosome inversions in linkage analysis.连锁分析中可容纳的染色体倒位
Am J Hum Genet. 2006 Aug;79(2):238-51. doi: 10.1086/505540. Epub 2006 Jun 6.