• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用异或基因型提高单倍型组装。

Improved haplotype assembly using Xor genotypes.

机构信息

Department of Computer Engineering and Information Technology, Isfahan University of Technology, Isfahan 84156-83111, Iran.

出版信息

J Theor Biol. 2012 Apr 7;298:122-30. doi: 10.1016/j.jtbi.2012.01.003. Epub 2012 Jan 12.

DOI:10.1016/j.jtbi.2012.01.003
PMID:22251889
Abstract

Given a set of aligned fragments, haplotype assembly is the problem of finding the haplotypes from which the fragments have been read. The problem is important because haplotypes contain SNP information, which is essential to many genomic analyses such as the analysis of potential association between certain diseases and genetic variations. The current state-of-the-art haplotype assembly algorithm, HapSAT, does not exploit genotype information and only receives a read matrix as input. However, the imminent importance of haplotypes and inexpensiveness of genotype information motivate for exploiting genotype information to obtain more accurate haplotypes. In this paper, an improved haplotype assembly method, xGenHapSAT, is proposed, which exploits xor genotype information for more accurate haplotype assembly. Xor genotype information is even less expensive than full genotype information, e.g., using the Denaturing High-Performance Liquid Chromatography (DHPLC) technique. It is shown that using this inexpensively obtainable information significantly improves the accuracy of the assembled haplotypes. In addition, a new, more efficient, Max-2-SAT formulation is adopted in xGenHapSAT, which, on average, increases the speed of the algorithm. Moreover, the proposed xGenHapSAT method replaces the current state-of-the-art haplotype assembly method based on genotype information. Finally, our state-of-the-art haplotype assembly software, HapSoft, which includes both xGenHapSAT and HapSAT, is made freely available for research purposes.

摘要

给定一组对齐的片段,单倍型组装就是从这些片段中找到单倍型的问题。这个问题很重要,因为单倍型包含 SNP 信息,这对于许多基因组分析是必不可少的,如某些疾病和遗传变异之间潜在关联的分析。当前最先进的单倍型组装算法 HapSAT 没有利用基因型信息,只接收一个读取矩阵作为输入。然而,单倍型的迫切重要性和基因型信息的低廉价格促使我们利用基因型信息来获得更准确的单倍型。在本文中,提出了一种改进的单倍型组装方法 xGenHapSAT,该方法利用异或基因型信息进行更准确的单倍型组装。异或基因型信息甚至比全基因型信息更便宜,例如使用变性高效液相色谱(DHPLC)技术。结果表明,利用这种廉价可得的信息可以显著提高组装单倍型的准确性。此外,在 xGenHapSAT 中采用了一种新的、更有效的 Max-2-SAT 公式,这平均提高了算法的速度。此外,所提出的 xGenHapSAT 方法取代了基于基因型信息的当前最先进的单倍型组装方法。最后,我们的最先进的单倍型组装软件 HapSoft,其中包括 xGenHapSAT 和 HapSAT,为研究目的免费提供。

相似文献

1
Improved haplotype assembly using Xor genotypes.利用异或基因型提高单倍型组装。
J Theor Biol. 2012 Apr 7;298:122-30. doi: 10.1016/j.jtbi.2012.01.003. Epub 2012 Jan 12.
2
Haplotype assembly from aligned weighted SNP fragments.基于比对加权单核苷酸多态性片段的单倍型组装
Comput Biol Chem. 2005 Aug;29(4):281-7. doi: 10.1016/j.compbiolchem.2005.05.001.
3
Effective haplotype assembly via maximum Boolean satisfiability.通过最大布尔可满足性有效组装单倍型。
Biochem Biophys Res Commun. 2011 Jan 14;404(2):593-8. doi: 10.1016/j.bbrc.2010.12.001. Epub 2010 Dec 7.
4
Maximum likelihood model based on minor allele frequencies and weighted Max-SAT formulation for haplotype assembly.基于次要等位基因频率的最大似然模型和用于单倍型组装的加权最大可满足性公式
J Theor Biol. 2014 Jun 7;350:49-56. doi: 10.1016/j.jtbi.2014.01.036. Epub 2014 Jan 31.
5
An improved heuristic for haplotype inference.一种改进的单体型推断启发式方法。
Gene. 2012 Oct 10;507(2):177-82. doi: 10.1016/j.gene.2012.06.032. Epub 2012 Jul 7.
6
Algorithm for haplotype resolution and block partitioning for partial XOR-genotype data.部分 XOR 基因型数据的单体型分辨率和块分区算法。
J Biomed Inform. 2010 Feb;43(1):51-9. doi: 10.1016/j.jbi.2009.08.009. Epub 2009 Aug 20.
7
A Markov chain model for haplotype assembly from SNP fragments.一种用于从单核苷酸多态性(SNP)片段进行单倍型组装的马尔可夫链模型。
Genome Inform. 2006;17(2):162-71.
8
Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms.估计多个单核苷酸多态性的单倍型频率和标准误差。
Biostatistics. 2003 Oct;4(4):513-22. doi: 10.1093/biostatistics/4.4.513.
9
Tag SNP selection in genotype data for maximizing SNP prediction accuracy.在基因型数据中选择标签单核苷酸多态性以最大化单核苷酸多态性预测准确性。
Bioinformatics. 2005 Jun;21 Suppl 1:i195-203. doi: 10.1093/bioinformatics/bti1021.
10
Estimating population haplotype frequencies from pooled SNP data using incomplete database information.基于不完全的数据库信息,从合并的 SNP 数据中估计群体单体型频率。
Bioinformatics. 2009 Dec 15;25(24):3296-302. doi: 10.1093/bioinformatics/btp584. Epub 2009 Oct 27.