• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

合并微卫星数据:用于连锁和关联分析的合并基因型数据的增强方法与软件

Merging microsatellite data: enhanced methodology and software to combine genotype data for linkage and association analysis.

作者信息

Presson Angela P, Sobel Eric M, Pajukanta Paivi, Plaisier Christopher, Weeks Daniel E, Aberg Karolina, Papp Jeanette C

机构信息

Department of Human Genetics, University of California, Los Angeles, CA 90095, USA.

出版信息

BMC Bioinformatics. 2008 Jul 21;9:317. doi: 10.1186/1471-2105-9-317.

DOI:10.1186/1471-2105-9-317
PMID:18644149
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2515855/
Abstract

BACKGROUND

Correctly merged data sets that have been independently genotyped can increase statistical power in linkage and association studies. However, alleles from microsatellite data sets genotyped with different experimental protocols or platforms cannot be accurately matched using base-pair size information alone. In a previous publication we introduced a statistical model for merging microsatellite data by matching allele frequencies between data sets. These methods are implemented in our software MicroMerge version 1 (v1). While MicroMerge v1 output can be analyzed by some genetic analysis programs, many programs can not analyze alignments that do not match alleles one-to-one between data sets. A consequence of such alignments is that codominant genotypes must often be analyzed as phenotypes. In this paper we describe several extensions that are implemented in MicroMerge version 2 (v2).

RESULTS

Notably, MicroMerge v2 includes a new one-to-one alignment option that creates merged pedigree and locus files that can be handled by most genetic analysis software. Other features in MicroMerge v2 enhance the following aspects of control: 1) optimizing the algorithm for different merging scenarios, such as data sets with very different sample sizes or multiple data sets, 2) merging small data sets when a reliable set of allele frequencies are available, and 3) improving the quantity and 4) quality of merged data. We present results from simulated and real microsatellite genotype data sets, and conclude with an association analysis of three familial dyslipidemia (FD) study samples genotyped at different laboratories. Independent analysis of each FD data set did not yield consistent results, but analysis of the merged data sets identified strong association at locus D11S2002.

CONCLUSION

The MicroMerge v2 features will enable merging for a variety of genotype data sets, which in turn will facilitate meta-analyses for powering association analysis.

摘要

背景

经过独立基因分型且正确合并的数据集能够提高连锁分析和关联研究的统计效能。然而,使用不同实验方案或平台进行基因分型得到的微卫星数据集的等位基因,仅靠碱基对大小信息无法准确匹配。在之前的一篇论文中,我们介绍了一种通过匹配数据集之间的等位基因频率来合并微卫星数据的统计模型。这些方法已在我们的软件MicroMerge版本1(v1)中实现。虽然MicroMerge v1的输出可以被一些基因分析程序分析,但许多程序无法分析数据集之间等位基因并非一一对应的比对结果。这种比对的一个后果是,共显性基因型常常必须作为表型来分析。在本文中,我们描述了在MicroMerge版本2(v2)中实现的几个扩展功能。

结果

值得注意的是,MicroMerge v2包含一个新的一对一比对选项,该选项可创建能被大多数基因分析软件处理的合并家系和基因座文件。MicroMerge v2的其他功能在以下控制方面得到了增强:1)针对不同的合并场景优化算法,如样本量差异很大的数据集或多个数据集;2)在有可靠的等位基因频率集时合并小数据集;3)提高合并数据的数量;4)提高合并数据的质量。我们展示了模拟和真实微卫星基因型数据集的结果,并以对在不同实验室进行基因分型的三个家族性血脂异常(FD)研究样本的关联分析作为结论。对每个FD数据集进行独立分析未得出一致结果,但对合并后的数据集进行分析在基因座D11S2002处发现了强关联。

结论

MicroMerge v2的功能将能够对各种基因型数据集进行合并,这反过来将有助于进行荟萃分析以增强关联分析的效能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/5a388fa4d2e6/1471-2105-9-317-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/707dbc9d9fd0/1471-2105-9-317-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/25f279fbb5bd/1471-2105-9-317-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/f1743c3b490e/1471-2105-9-317-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/35bc783263f4/1471-2105-9-317-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/045c36ef94b8/1471-2105-9-317-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/5a388fa4d2e6/1471-2105-9-317-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/707dbc9d9fd0/1471-2105-9-317-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/25f279fbb5bd/1471-2105-9-317-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/f1743c3b490e/1471-2105-9-317-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/35bc783263f4/1471-2105-9-317-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/045c36ef94b8/1471-2105-9-317-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f2c/2515855/5a388fa4d2e6/1471-2105-9-317-6.jpg

相似文献

1
Merging microsatellite data: enhanced methodology and software to combine genotype data for linkage and association analysis.合并微卫星数据:用于连锁和关联分析的合并基因型数据的增强方法与软件
BMC Bioinformatics. 2008 Jul 21;9:317. doi: 10.1186/1471-2105-9-317.
2
Merging microsatellite data.合并微卫星数据。
J Comput Biol. 2006 Jul-Aug;13(6):1131-47. doi: 10.1089/cmb.2006.13.1131.
3
The reliability of haplotyping inference in nuclear families: misassignment rates for SNPs and microsatellites.核心家庭中单体型推断的可靠性:单核苷酸多态性(SNPs)和微卫星的错误分配率
Hum Hered. 2004;57(3):117-27. doi: 10.1159/000079242.
4
SNiPer: improved SNP genotype calling for Affymetrix 10K GeneChip microarray data.SNiPer:改进对Affymetrix 10K基因芯片微阵列数据的单核苷酸多态性(SNP)基因型分型
BMC Genomics. 2005 Oct 31;6:149. doi: 10.1186/1471-2164-6-149.
5
Pedigree generation for analysis of genetic linkage and association.用于遗传连锁和关联分析的系谱生成。
Pac Symp Biocomput. 2004:93-103. doi: 10.1142/9789812704856_0010.
6
Comparative study of multipoint methods for genotype error detection.基因型错误检测多点方法的比较研究
Hum Hered. 2004;58(3-4):175-89. doi: 10.1159/000083545.
7
Can survival prediction be improved by merging gene expression data sets?能否通过合并基因表达数据集来提高生存预测?
PLoS One. 2009 Oct 23;4(10):e7431. doi: 10.1371/journal.pone.0007431.
8
[Preliminary linkage analysis of a Chinese family with benign familial infantile convulsion].[一个中国良性家族性婴儿惊厥家系的初步连锁分析]
Zhonghua Er Ke Za Zhi. 2004 Jun;42(6):424-8.
9
A statistical method for identification of polymorphisms that explain a linkage result.一种用于识别解释连锁结果的多态性的统计方法。
Am J Hum Genet. 2002 Feb;70(2):399-411. doi: 10.1086/338660. Epub 2002 Jan 8.
10
MICROSATELIGHT--pipeline to expedite microsatellite analysis.微卫星快速分析工具(MICROSATELIGHT)
J Hered. 2011 Mar-Apr;102(2):247-9. doi: 10.1093/jhered/esq111. Epub 2010 Dec 2.

引用本文的文献

1
A continent-wide high genetic load in African buffalo revealed by clines in the frequency of deleterious alleles, genetic hitchhiking and linkage disequilibrium.非洲野牛在整个大陆上存在高遗传负荷,这是由有害等位基因频率的渐变、遗传搭便车和连锁不平衡揭示的。
PLoS One. 2021 Dec 9;16(12):e0259685. doi: 10.1371/journal.pone.0259685. eCollection 2021.
2
mtDNAcombine: tools to combine sequences from multiple studies.mtDNAcombine:用于合并来自多个研究的序列的工具。
BMC Bioinformatics. 2021 Mar 9;22(1):115. doi: 10.1186/s12859-021-04048-0.

本文引用的文献

1
Genetic susceptibility to infectious diseases: big is beautiful, but will bigger be even better?传染病的遗传易感性:量大固然好,但量更大是否会更好呢?
Lancet Infect Dis. 2006 Oct;6(10):653-63. doi: 10.1016/S1473-3099(06)70601-6.
2
Merging microsatellite data.合并微卫星数据。
J Comput Biol. 2006 Jul-Aug;13(6):1131-47. doi: 10.1089/cmb.2006.13.1131.
3
Accommodating chromosome inversions in linkage analysis.连锁分析中可容纳的染色体倒位
Am J Hum Genet. 2006 Aug;79(2):238-51. doi: 10.1086/505540. Epub 2006 Jun 6.
4
The use of pedigree, sib-pair and association studies of common diseases for genetic mapping and epidemiology.利用系谱、同胞对以及常见疾病的关联研究进行基因定位和流行病学研究。
Nat Genet. 2004 Oct;36(10):1045-51. doi: 10.1038/ng1433.
5
Genetic associations: false or true?基因关联:是假还是真?
Trends Mol Med. 2003 Apr;9(4):135-8. doi: 10.1016/s1471-4914(03)00030-3.
6
Combined analysis of genome scans of dutch and finnish families reveals a susceptibility locus for high-density lipoprotein cholesterol on chromosome 16q.对荷兰和芬兰家庭的基因组扫描进行联合分析,发现16号染色体上存在一个与高密度脂蛋白胆固醇相关的易感基因座。
Am J Hum Genet. 2003 Apr;72(4):903-17. doi: 10.1086/374177. Epub 2003 Mar 12.
7
A tale of two genotypes: consistency between two high-throughput genotyping centers.两种基因型的故事:两个高通量基因分型中心之间的一致性
Genome Res. 2002 Mar;12(3):430-5. doi: 10.1101/gr.211502.
8
Merlin--rapid analysis of dense genetic maps using sparse gene flow trees.Merlin——利用稀疏基因流树对密集遗传图谱进行快速分析。
Nat Genet. 2002 Jan;30(1):97-101. doi: 10.1038/ng786. Epub 2001 Dec 3.
9
A meta-analysis of chromosome 18 linkage data for bipolar illness.
Genet Epidemiol. 1997;14(6):617-22. doi: 10.1002/(SICI)1098-2272(1997)14:6<617::AID-GEPI11>3.0.CO;2-T.
10
A simple method for automated allele binning in microsatellite markers.一种用于微卫星标记中自动等位基因分型的简单方法。
Genome Res. 1997 Nov;7(11):1104-9. doi: 10.1101/gr.7.11.1104.