• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

不同单倍型块划分算法的一致性分析

Analysis of concordance of different haplotype block partitioning algorithms.

作者信息

Indap Amit R, Marth Gabor T, Struble Craig A, Tonellato Peter, Olivier Michael

机构信息

Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, USA.

出版信息

BMC Bioinformatics. 2005 Dec 15;6:303. doi: 10.1186/1471-2105-6-303.

DOI:10.1186/1471-2105-6-303
PMID:16356172
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1343594/
Abstract

BACKGROUND

Different classes of haplotype block algorithms exist and the ideal dataset to assess their performance would be to comprehensively re-sequence a large genomic region in a large population. Such data sets are expensive to collect. Alternatively, we performed coalescent simulations to generate haplotypes with a high marker density and compared block partitioning results from diversity based, LD based, and information theoretic algorithms under different values of SNP density and allele frequency.

RESULTS

We simulated 1000 haplotypes using the standard coalescent for three world populations--European, African American, and East Asian--and applied three classes of block partitioning algorithms--diversity based, LD based, and information theoretic. We assessed algorithm differences in number, size, and coverage of blocks inferred under different conditions of SNP density, allele frequency, and sample size. Each algorithm inferred blocks differing in number, size, and coverage under different density and allele frequency conditions. Different partitions had few if any matching block boundaries. However they still overlapped and a high percentage of total chromosomal region was common to all methods. This percentage was generally higher with a higher density of SNPs and when rarer markers were included.

CONCLUSION

A gold standard definition of a haplotype block is difficult to achieve, but collecting haplotypes covered with a high density of SNPs, partitioning them with a variety of block algorithms, and identifying regions common to all methods may be the best way to identify genomic regions that harbor SNP variants that cause disease.

摘要

背景

存在不同类别的单倍型块算法,而评估其性能的理想数据集是在大量人群中对大的基因组区域进行全面重测序。收集这样的数据集成本很高。作为替代方案,我们进行了合并模拟以生成具有高标记密度的单倍型,并比较了基于多样性、基于连锁不平衡(LD)和信息论算法在不同单核苷酸多态性(SNP)密度和等位基因频率值下的块划分结果。

结果

我们使用标准合并方法为欧洲、非裔美国人和东亚这三个人群模拟了1000个单倍型,并应用了三类块划分算法——基于多样性、基于LD和信息论算法。我们评估了在SNP密度、等位基因频率和样本量的不同条件下推断出的块在数量、大小和覆盖范围方面的算法差异。在不同的密度和等位基因频率条件下,每种算法推断出的块在数量、大小和覆盖范围上都有所不同。不同的划分几乎没有匹配的块边界。然而它们仍然有重叠,并且所有方法共有的总染色体区域的百分比很高。当SNP密度较高且包含较罕见的标记时,这个百分比通常更高。

结论

单倍型块的金标准定义很难实现,但收集覆盖高密度SNP的单倍型,用多种块算法对其进行划分,并识别所有方法共有的区域,可能是识别含有导致疾病的SNP变异的基因组区域的最佳方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/99a96299dcab/1471-2105-6-303-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/cddfd3843195/1471-2105-6-303-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/4743c170c18d/1471-2105-6-303-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/36c3b8dd546d/1471-2105-6-303-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/de699a7d449e/1471-2105-6-303-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/e0bd84062ae6/1471-2105-6-303-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/e974e7ac42b1/1471-2105-6-303-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/99a96299dcab/1471-2105-6-303-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/cddfd3843195/1471-2105-6-303-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/4743c170c18d/1471-2105-6-303-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/36c3b8dd546d/1471-2105-6-303-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/de699a7d449e/1471-2105-6-303-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/e0bd84062ae6/1471-2105-6-303-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/e974e7ac42b1/1471-2105-6-303-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf09/1343594/99a96299dcab/1471-2105-6-303-7.jpg

相似文献

1
Analysis of concordance of different haplotype block partitioning algorithms.不同单倍型块划分算法的一致性分析
BMC Bioinformatics. 2005 Dec 15;6:303. doi: 10.1186/1471-2105-6-303.
2
Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies.利用基因型数据进行单倍型块划分和标签单核苷酸多态性选择及其在关联研究中的应用。
Genome Res. 2004 May;14(5):908-16. doi: 10.1101/gr.1837404. Epub 2004 Apr 12.
3
The effect of single-nucleotide polymorphism marker selection on patterns of haplotype blocks and haplotype frequency estimates.单核苷酸多态性标记选择对单倍型块模式及单倍型频率估计的影响。
Am J Hum Genet. 2005 Dec;77(6):988-98. doi: 10.1086/498175. Epub 2005 Oct 19.
4
Efficient haplotype block recognition of very long and dense genetic sequences.高效识别非常长且密集的遗传序列的单倍型块。
BMC Bioinformatics. 2014 Jan 14;15:10. doi: 10.1186/1471-2105-15-10.
5
The effect of haplotype-block definitions on inference of haplotype-block structure and htSNPs selection.单倍型块定义对单倍型块结构推断和htSNP选择的影响。
Mol Biol Evol. 2005 Jan;22(1):148-59. doi: 10.1093/molbev/msh266. Epub 2004 Sep 15.
6
The structure of haplotype blocks in the human genome.人类基因组中单倍型块的结构。
Science. 2002 Jun 21;296(5576):2225-9. doi: 10.1126/science.1069424. Epub 2002 May 23.
7
Haplotype block partitioning as a tool for dimensionality reduction in SNP association studies.单倍型块划分作为SNP关联研究中降维的一种工具。
BMC Genomics. 2008 Aug 29;9:405. doi: 10.1186/1471-2164-9-405.
8
htSNPer1.0: software for haplotype block partition and htSNPs selection.htSNPer1.0:用于单倍型块划分和htSNP选择的软件。
BMC Bioinformatics. 2005 Mar 1;6:38. doi: 10.1186/1471-2105-6-38.
9
A novel efficient dynamic programming algorithm for haplotype block partitioning.一种用于单倍型块划分的新颖高效动态规划算法。
J Theor Biol. 2010 Nov 21;267(2):164-70. doi: 10.1016/j.jtbi.2010.08.019. Epub 2010 Aug 20.
10
The impact of SNP density on fine-scale patterns of linkage disequilibrium.单核苷酸多态性(SNP)密度对连锁不平衡精细尺度模式的影响。
Hum Mol Genet. 2004 Mar 15;13(6):577-88. doi: 10.1093/hmg/ddh060. Epub 2004 Jan 20.

引用本文的文献

1
Determinant-based grouping of SNPs and its application for detecting disease-associated genomic loci.基于决定因素的单核苷酸多态性分组及其在检测疾病相关基因组位点中的应用。
NAR Genom Bioinform. 2025 Mar 18;7(1):lqaf024. doi: 10.1093/nargab/lqaf024. eCollection 2025 Mar.
2
PIP-SNP: a pipeline for processing SNP data featured as linkage disequilibrium bin mapping, genotype imputing and marker synthesizing.PIP-SNP:一种用于处理SNP数据的流程,其特点是连锁不平衡区间映射、基因型填充和标记合成。
NAR Genom Bioinform. 2021 Jul 5;3(3):lqab060. doi: 10.1093/nargab/lqab060. eCollection 2021 Sep.
3
Cloud computing-based TagSNP selection algorithm for human genome data.

本文引用的文献

1
Genome-wide association studies for common diseases and complex traits.常见疾病和复杂性状的全基因组关联研究。
Nat Rev Genet. 2005 Feb;6(2):95-108. doi: 10.1038/nrg1521.
2
The ENCODE (ENCyclopedia Of DNA Elements) Project.DNA 元件百科全书(ENCODE)计划
Science. 2004 Oct 22;306(5696):636-40. doi: 10.1126/science.1105136.
3
The effect of haplotype-block definitions on inference of haplotype-block structure and htSNPs selection.单倍型块定义对单倍型块结构推断和htSNP选择的影响。
基于云计算的人类基因组数据标签单核苷酸多态性选择算法
Int J Mol Sci. 2015 Jan 5;16(1):1096-110. doi: 10.3390/ijms16011096.
4
Annotation of functional variation within non-MHC MS susceptibility loci through bioinformatics analysis.通过生物信息学分析对非MHC多发性硬化症易感基因座内的功能变异进行注释。
Genes Immun. 2014 Oct;15(7):466-76. doi: 10.1038/gene.2014.37. Epub 2014 Jul 17.
5
Genome-wide association studies using single-nucleotide polymorphisms versus haplotypes: an empirical comparison with data from the North American Rheumatoid Arthritis Consortium.使用单核苷酸多态性与单倍型的全基因组关联研究:与北美类风湿关节炎联盟数据的实证比较
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S35. doi: 10.1186/1753-6561-3-s7-s35.
6
Global haplotype partitioning for maximal associated SNP pairs.全球单体型划分以获得最大关联 SNP 对。
BMC Bioinformatics. 2009 Aug 27;10:269. doi: 10.1186/1471-2105-10-269.
7
Haplotype block partitioning as a tool for dimensionality reduction in SNP association studies.单倍型块划分作为SNP关联研究中降维的一种工具。
BMC Genomics. 2008 Aug 29;9:405. doi: 10.1186/1471-2164-9-405.
8
Genetic polymorphisms in uridine diphospho-glucuronosyltransferase 1A1 and breast cancer risk in Africans.非洲人中尿苷二磷酸葡萄糖醛酸基转移酶1A1的基因多态性与乳腺癌风险
Breast Cancer Res Treat. 2008 Jul;110(2):367-76. doi: 10.1007/s10549-007-9720-7. Epub 2007 Oct 2.
9
Evaluation of sample size effect on the identification of haplotype blocks.样本量对单倍型块识别的影响评估。
BMC Bioinformatics. 2007 Jun 14;8:200. doi: 10.1186/1471-2105-8-200.
Mol Biol Evol. 2005 Jan;22(1):148-59. doi: 10.1093/molbev/msh266. Epub 2004 Sep 15.
4
HapBlock: haplotype block partitioning and tag SNP selection software using a set of dynamic programming algorithms.HapBlock:一种使用一组动态规划算法进行单倍型块划分和标签单核苷酸多态性选择的软件。
Bioinformatics. 2005 Jan 1;21(1):131-4. doi: 10.1093/bioinformatics/bth482. Epub 2004 Aug 27.
5
Haploview: analysis and visualization of LD and haplotype maps.Haploview:连锁不平衡(LD)和单倍型图谱的分析与可视化
Bioinformatics. 2005 Jan 15;21(2):263-5. doi: 10.1093/bioinformatics/bth457. Epub 2004 Aug 5.
6
Optimal haplotype block-free selection of tagging SNPs for genome-wide association studies.用于全基因组关联研究的标签单核苷酸多态性的最优无单倍型块选择
Genome Res. 2004 Aug;14(8):1633-40. doi: 10.1101/gr.2570004.
7
The impact of SNP density on fine-scale patterns of linkage disequilibrium.单核苷酸多态性(SNP)密度对连锁不平衡精细尺度模式的影响。
Hum Mol Genet. 2004 Mar 15;13(6):577-88. doi: 10.1093/hmg/ddh060. Epub 2004 Jan 20.
8
The International HapMap Project.国际人类基因组单体型图计划
Nature. 2003 Dec 18;426(6968):789-96. doi: 10.1038/nature02168.
9
Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.利用连锁不平衡选择用于关联分析的信息量最大的单核苷酸多态性集合。
Am J Hum Genet. 2004 Jan;74(1):106-20. doi: 10.1086/381000. Epub 2003 Dec 15.
10
The UCSC Table Browser data retrieval tool.加州大学圣克鲁兹分校表格浏览器数据检索工具。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D493-6. doi: 10.1093/nar/gkh103.