• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HMEC:一种具有最小错误校正功能的个体单倍型启发式算法。

HMEC: A Heuristic Algorithm for Individual Haplotyping with Minimum Error Correction.

作者信息

Bayzid Md Shamsuzzoha, Alam Md Maksudul, Mueen Abdullah, Rahman Md Saidur

机构信息

Department of Computer Science, University of Texas at Austin, Austin, TX 78712, USA.

Department of Computer Science, Virginia Tech, Blacksburg, VA 24060, USA.

出版信息

ISRN Bioinform. 2013 Jan 28;2013:291741. doi: 10.1155/2013/291741. eCollection 2013.

DOI:10.1155/2013/291741
PMID:25969753
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4393065/
Abstract

Haplotype is a pattern of single nucleotide polymorphisms (SNPs) on a single chromosome. Constructing a pair of haplotypes from aligned and overlapping but intermixed and erroneous fragments of the chromosomal sequences is a nontrivial problem. Minimum error correction approach aims to minimize the number of errors to be corrected so that the pair of haplotypes can be constructed through consensus of the fragments. We give a heuristic algorithm (HMEC) that searches through alternative solutions using a gain measure and stops whenever no better solution can be achieved. Time complexity of each iteration is O(m (3) k) for an m × k SNP matrix where m and k are the number of fragments (number of rows) and number of SNP sites (number of columns), respectively, in an SNP matrix. Alternative gain measure is also given to reduce running time. We have compared our algorithm with other methods in terms of accuracy and running time on both simulated and real data, and our extensive experimental results indicate the superiority of our algorithm over others.

摘要

单倍型是指单条染色体上的单核苷酸多态性(SNP)模式。从染色体序列的比对、重叠但相互混合且存在错误的片段构建一对单倍型是一个具有挑战性的问题。最小错误校正方法旨在尽量减少需要校正的错误数量,以便通过片段的一致性构建单倍型对。我们给出一种启发式算法(HMEC),该算法使用增益度量在替代解决方案中进行搜索,并在无法获得更好的解决方案时停止。对于一个m×k的SNP矩阵,每次迭代的时间复杂度为O(m (3) k),其中m和k分别是SNP矩阵中片段的数量(行数)和SNP位点的数量(列数)。还给出了替代增益度量以减少运行时间。我们在模拟数据和真实数据上,从准确性和运行时间方面将我们的算法与其他方法进行了比较,我们广泛的实验结果表明我们的算法优于其他算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/cbe8a3a5f0f6/ISRN.BIOINFORMATICS2013-291741.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/1d59d181e0dd/ISRN.BIOINFORMATICS2013-291741.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/cd66e00a8c11/ISRN.BIOINFORMATICS2013-291741.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/164bf2540e40/ISRN.BIOINFORMATICS2013-291741.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/9fb57bafffc6/ISRN.BIOINFORMATICS2013-291741.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/02e71ee1e7db/ISRN.BIOINFORMATICS2013-291741.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/5d209673b4d1/ISRN.BIOINFORMATICS2013-291741.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/3b94dda6323a/ISRN.BIOINFORMATICS2013-291741.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/cbe8a3a5f0f6/ISRN.BIOINFORMATICS2013-291741.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/1d59d181e0dd/ISRN.BIOINFORMATICS2013-291741.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/cd66e00a8c11/ISRN.BIOINFORMATICS2013-291741.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/164bf2540e40/ISRN.BIOINFORMATICS2013-291741.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/9fb57bafffc6/ISRN.BIOINFORMATICS2013-291741.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/02e71ee1e7db/ISRN.BIOINFORMATICS2013-291741.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/5d209673b4d1/ISRN.BIOINFORMATICS2013-291741.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/3b94dda6323a/ISRN.BIOINFORMATICS2013-291741.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32b0/4393065/cbe8a3a5f0f6/ISRN.BIOINFORMATICS2013-291741.alg.001.jpg

相似文献

1
HMEC: A Heuristic Algorithm for Individual Haplotyping with Minimum Error Correction.HMEC:一种具有最小错误校正功能的个体单倍型启发式算法。
ISRN Bioinform. 2013 Jan 28;2013:291741. doi: 10.1155/2013/291741. eCollection 2013.
2
A fast and accurate algorithm for single individual haplotyping.一种用于单一个体单倍型分型的快速准确算法。
BMC Syst Biol. 2012;6 Suppl 2(Suppl 2):S8. doi: 10.1186/1752-0509-6-S2-S8. Epub 2012 Dec 12.
3
A heuristic algorithm for haplotype reconstruction from aligned weighted SNP fragments.
Int J Bioinform Res Appl. 2013;9(1):13-24. doi: 10.1504/IJBRA.2013.050743.
4
Better ILP models for haplotype assembly.更好的用于单体型组装的 ILP 模型。
BMC Bioinformatics. 2018 Feb 19;19(Suppl 1):52. doi: 10.1186/s12859-018-2012-x.
5
A model of higher accuracy for the individual haplotyping problem based on weighted SNP fragments and genotype with errors.一种基于加权单核苷酸多态性片段和含误差基因型的个体单倍型分型问题的更高精度模型。
Bioinformatics. 2008 Jul 1;24(13):i105-13. doi: 10.1093/bioinformatics/btn147.
6
A highly accurate heuristic algorithm for the haplotype assembly problem.一种用于单体型组装问题的高度精确启发式算法。
BMC Genomics. 2013;14 Suppl 2(Suppl 2):S2. doi: 10.1186/1471-2164-14-S2-S2. Epub 2013 Feb 15.
7
LGH: A Fast and Accurate Algorithm for Single Individual Haplotyping Based on a Two-Locus Linkage Graph.
IEEE/ACM Trans Comput Biol Bioinform. 2015 Nov-Dec;12(6):1255-66. doi: 10.1109/TCBB.2015.2430352.
8
Efficient fast heuristic algorithms for minimum error correction haplotyping from SNP fragments.用于从单核苷酸多态性(SNP)片段进行最小错误校正单倍型分型的高效快速启发式算法。
Int J Comput Biol Drug Des. 2014;7(4):358-68. doi: 10.1504/IJCBDD.2014.066543. Epub 2014 Dec 25.
9
A fast and accurate enumeration-based algorithm for haplotyping a triploid individual.一种用于对三倍体个体进行单倍型分型的基于计数的快速准确算法。
Algorithms Mol Biol. 2018 Jun 1;13:10. doi: 10.1186/s13015-018-0129-0. eCollection 2018.
10
A fast and accurate algorithm for diploid individual haplotype reconstruction.一种用于二倍体个体单倍型重建的快速准确算法。
J Bioinform Comput Biol. 2013 Aug;11(4):1350010. doi: 10.1142/S0219720013500108. Epub 2013 Jun 30.

引用本文的文献

1
Joint haplotype assembly and genotype calling via sequential Monte Carlo algorithm.通过序贯蒙特卡罗算法进行联合单倍型组装和基因型分型
BMC Bioinformatics. 2015 Jul 16;16:223. doi: 10.1186/s12859-015-0651-8.

本文引用的文献

1
A comparison of several algorithms for the single individual SNP haplotyping reconstruction problem.几种算法在单个体 SNP 单体型重构问题上的比较。
Bioinformatics. 2010 Sep 15;26(18):2217-25. doi: 10.1093/bioinformatics/btq411. Epub 2010 Jul 11.
2
Minimum conflict individual haplotyping from SNP fragments and related genotype.最小冲突个体单倍型分型来自 SNP 片段和相关基因型。
Evol Bioinform Online. 2007 Feb 16;2:261-70.
3
SpeedHap: an accurate heuristic for the single individual SNP haplotyping problem with many gaps, high reading error rate and low coverage.
SpeedHap:一种针对存在许多缺口、高读取错误率和低覆盖率的单一个体单核苷酸多态性单倍型分型问题的精确启发式算法。
IEEE/ACM Trans Comput Biol Bioinform. 2008 Oct-Dec;5(4):492-502. doi: 10.1109/TCBB.2008.67.
4
HapCUT: an efficient and accurate algorithm for the haplotype assembly problem.HapCUT:一种用于单倍型组装问题的高效且准确的算法。
Bioinformatics. 2008 Aug 15;24(16):i153-9. doi: 10.1093/bioinformatics/btn298.
5
Linear time probabilistic algorithms for the singular haplotype reconstruction problem from SNP fragments.用于从单核苷酸多态性(SNP)片段进行奇异单倍型重建问题的线性时间概率算法。
J Comput Biol. 2008 Jun;15(5):535-46. doi: 10.1089/cmb.2008.0003.
6
The diploid genome sequence of an individual human.某个人类个体的二倍体基因组序列。
PLoS Biol. 2007 Sep 4;5(10):e254. doi: 10.1371/journal.pbio.0050254.
7
A clustering algorithm based on two distance functions for MEC model.一种基于两种距离函数的用于MEC模型的聚类算法。
Comput Biol Chem. 2007 Apr;31(2):148-50. doi: 10.1016/j.compbiolchem.2007.02.001. Epub 2007 Feb 9.
8
Haplotype assembly from aligned weighted SNP fragments.基于比对加权单核苷酸多态性片段的单倍型组装
Comput Biol Chem. 2005 Aug;29(4):281-7. doi: 10.1016/j.compbiolchem.2005.05.001.
9
Haplotype reconstruction from SNP fragments by minimum error correction.通过最小错误校正从单核苷酸多态性(SNP)片段进行单倍型重建。
Bioinformatics. 2005 May 15;21(10):2456-62. doi: 10.1093/bioinformatics/bti352. Epub 2005 Feb 24.
10
Algorithmic strategies for the single nucleotide polymorphism haplotype assembly problem.
Brief Bioinform. 2002 Mar;3(1):23-31. doi: 10.1093/bib/3.1.23.