• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于加权单核苷酸多态性片段和含误差基因型的个体单倍型分型问题的更高精度模型。

A model of higher accuracy for the individual haplotyping problem based on weighted SNP fragments and genotype with errors.

作者信息

Xie Minzhu, Wang Jianxin, Chen Jianer

机构信息

School of Information Science and Engineering, Central South University, Changsha 410083, China.

出版信息

Bioinformatics. 2008 Jul 1;24(13):i105-13. doi: 10.1093/bioinformatics/btn147.

DOI:10.1093/bioinformatics/btn147
PMID:18586702
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2718625/
Abstract

MOTIVATION

In genetic studies of complex diseases, haplotypes provide more information than genotypes. However, haplotyping is much more difficult than genotyping using biological techniques. Therefore effective computational techniques have been in demand. The individual haplotyping problem is the computational problem of inducing a pair of haplotypes from an individual's aligned SNP fragments. Based on various optimal criteria and including different extra information, many models for the problem have been proposed. Higher accuracy of the models has been an important issue in the study of haplotype reconstruction.

RESULTS

The current article proposes a highly accurate model for the single individual haplotyping problem based on weighted fragments and genotypes with errors. The model is proved to be NP-hard even with gapless fragments. Based on the characteristics of Single Nucleotide Polymorphism (SNP) fragments, a parameterized algorithm of time complexity O(nk(2)2(k(2)) + m log m + mk(1)) is developed, where m is the number of fragments, n is the number of SNP sites, k(1) is the maximum number of SNP sites that a fragment covers (no more than n and usually smaller than 10) and k(2) is the maximum number of the fragments covering a SNP site (usually no more than 19). Extensive experiments show that this model is more accurate in haplotype reconstruction than other models.

AVAILABILITY

The program of the parameterized algorithm can be obtained by sending an email to the corresponding author.

摘要

动机

在复杂疾病的基因研究中,单倍型比基因型提供的信息更多。然而,使用生物技术进行单倍型分型比基因分型要困难得多。因此,人们一直需要有效的计算技术。个体单倍型分型问题是从个体的对齐单核苷酸多态性(SNP)片段中推断出一对单倍型的计算问题。基于各种最优标准并包含不同的额外信息,已经提出了许多针对该问题的模型。模型的更高准确性一直是单倍型重建研究中的一个重要问题。

结果

本文基于带误差的加权片段和基因型,为单一个体单倍型分型问题提出了一个高精度模型。即使对于无间隙片段,该模型也被证明是NP难的。基于单核苷酸多态性(SNP)片段的特征,开发了一种时间复杂度为O(nk(2)2(k(2)) + m log m + mk(1))的参数化算法,其中m是片段数量,n是SNP位点数量,k(1)是一个片段覆盖的SNP位点的最大数量(不超过n且通常小于10),k(2)是覆盖一个SNP位点的片段的最大数量(通常不超过19)。大量实验表明,该模型在单倍型重建方面比其他模型更准确。

可用性

通过向通讯作者发送电子邮件可以获得参数化算法的程序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/9b3757b3bced/btn147f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/a0e204086c07/btn147f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/2e16f0f446e2/btn147f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/89f8ff8abcb5/btn147f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/c7d669195a01/btn147f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/43469191408f/btn147f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/94c230d11cbc/btn147f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/e584d71859af/btn147f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/9b3757b3bced/btn147f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/a0e204086c07/btn147f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/2e16f0f446e2/btn147f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/89f8ff8abcb5/btn147f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/c7d669195a01/btn147f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/43469191408f/btn147f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/94c230d11cbc/btn147f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/e584d71859af/btn147f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/2718625/9b3757b3bced/btn147f8.jpg

相似文献

1
A model of higher accuracy for the individual haplotyping problem based on weighted SNP fragments and genotype with errors.一种基于加权单核苷酸多态性片段和含误差基因型的个体单倍型分型问题的更高精度模型。
Bioinformatics. 2008 Jul 1;24(13):i105-13. doi: 10.1093/bioinformatics/btn147.
2
Research on parameterized algorithms of the individual haplotyping problem.
J Bioinform Comput Biol. 2007 Jun;5(3):795-816. doi: 10.1142/s0219720007002710.
3
Haplotype reconstruction from SNP fragments by minimum error correction.通过最小错误校正从单核苷酸多态性(SNP)片段进行单倍型重建。
Bioinformatics. 2005 May 15;21(10):2456-62. doi: 10.1093/bioinformatics/bti352. Epub 2005 Feb 24.
4
SpeedHap: an accurate heuristic for the single individual SNP haplotyping problem with many gaps, high reading error rate and low coverage.SpeedHap:一种针对存在许多缺口、高读取错误率和低覆盖率的单一个体单核苷酸多态性单倍型分型问题的精确启发式算法。
IEEE/ACM Trans Comput Biol Bioinform. 2008 Oct-Dec;5(4):492-502. doi: 10.1109/TCBB.2008.67.
5
LGH: A Fast and Accurate Algorithm for Single Individual Haplotyping Based on a Two-Locus Linkage Graph.
IEEE/ACM Trans Comput Biol Bioinform. 2015 Nov-Dec;12(6):1255-66. doi: 10.1109/TCBB.2015.2430352.
6
A parsimonious tree-grow method for haplotype inference.一种用于单倍型推断的简约树生长方法。
Bioinformatics. 2005 Sep 1;21(17):3475-81. doi: 10.1093/bioinformatics/bti572. Epub 2005 Jul 7.
7
An efficient algorithm for Perfect Phylogeny Haplotyping.一种用于完美系统发育单倍型分型的高效算法。
Proc IEEE Comput Syst Bioinform Conf. 2005:103-10. doi: 10.1109/csb.2005.12.
8
A fast and accurate algorithm for diploid individual haplotype reconstruction.一种用于二倍体个体单倍型重建的快速准确算法。
J Bioinform Comput Biol. 2013 Aug;11(4):1350010. doi: 10.1142/S0219720013500108. Epub 2013 Jun 30.
9
Constrained hidden Markov models for population-based haplotyping.
BMC Bioinformatics. 2007 May 3;8 Suppl 2(Suppl 2):S9. doi: 10.1186/1471-2105-8-S2-S9.
10
Islands of tractability for parsimony haplotyping.简约单倍型分型的易处理区域
Proc IEEE Comput Syst Bioinform Conf. 2005:65-72. doi: 10.1109/csb.2005.37.

引用本文的文献

1
A fast and accurate enumeration-based algorithm for haplotyping a triploid individual.一种用于对三倍体个体进行单倍型分型的基于计数的快速准确算法。
Algorithms Mol Biol. 2018 Jun 1;13:10. doi: 10.1186/s13015-018-0129-0. eCollection 2018.
2
dipSPAdes: Assembler for Highly Polymorphic Diploid Genomes.dipSPAdes:用于高度多态二倍体基因组的组装器。
J Comput Biol. 2015 Jun;22(6):528-45. doi: 10.1089/cmb.2014.0153. Epub 2015 Mar 3.
3
A highly accurate heuristic algorithm for the haplotype assembly problem.一种用于单体型组装问题的高度精确启发式算法。

本文引用的文献

1
The diploid genome sequence of an individual human.某个人类个体的二倍体基因组序列。
PLoS Biol. 2007 Sep 4;5(10):e254. doi: 10.1371/journal.pbio.0050254.
2
Research on parameterized algorithms of the individual haplotyping problem.
J Bioinform Comput Biol. 2007 Jun;5(3):795-816. doi: 10.1142/s0219720007002710.
3
A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.一种用于Affymetrix SNP微阵列的多阵列多SNP基因分型算法。
Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25.
BMC Genomics. 2013;14 Suppl 2(Suppl 2):S2. doi: 10.1186/1471-2164-14-S2-S2. Epub 2013 Feb 15.
4
A fast and accurate algorithm for single individual haplotyping.一种用于单一个体单倍型分型的快速准确算法。
BMC Syst Biol. 2012;6 Suppl 2(Suppl 2):S8. doi: 10.1186/1752-0509-6-S2-S8. Epub 2012 Dec 12.
4
Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data.高密度寡核苷酸单核苷酸多态性(SNP)阵列数据的探索、标准化及基因型分型
Biostatistics. 2007 Apr;8(2):485-99. doi: 10.1093/biostatistics/kxl042. Epub 2006 Dec 22.
5
A haplotype map of the human genome.人类基因组单倍型图谱。
Nature. 2005 Oct 27;437(7063):1299-320. doi: 10.1038/nature04226.
6
Haplotype assembly from aligned weighted SNP fragments.基于比对加权单核苷酸多态性片段的单倍型组装
Comput Biol Chem. 2005 Aug;29(4):281-7. doi: 10.1016/j.compbiolchem.2005.05.001.
7
Haplotype reconstruction from SNP fragments by minimum error correction.通过最小错误校正从单核苷酸多态性(SNP)片段进行单倍型重建。
Bioinformatics. 2005 May 15;21(10):2456-62. doi: 10.1093/bioinformatics/bti352. Epub 2005 Feb 24.
8
Comparison of the accuracy of methods of computational haplotype inference using a large empirical dataset.使用大型实证数据集对计算单倍型推断方法的准确性进行比较。
BMC Genet. 2004 Aug 3;5:22. doi: 10.1186/1471-2156-5-22.
9
Incorporating genotyping uncertainty in haplotype inference for single-nucleotide polymorphisms.在单核苷酸多态性的单倍型推断中纳入基因分型不确定性。
Am J Hum Genet. 2004 Mar;74(3):495-510. doi: 10.1086/382284. Epub 2004 Feb 13.
10
Algorithmic strategies for the single nucleotide polymorphism haplotype assembly problem.
Brief Bioinform. 2002 Mar;3(1):23-31. doi: 10.1093/bib/3.1.23.