• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

最大简约性异或单倍型推断的稀疏字典选择方法。

Maximum parsimony xor haplotyping by sparse dictionary selection.

机构信息

Department of Electrical Engineering, Columbia University, 500 W 120th St, New York, 10027 NY, USA.

出版信息

BMC Genomics. 2013 Sep 23;14:645. doi: 10.1186/1471-2164-14-645.

DOI:10.1186/1471-2164-14-645
PMID:24059285
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3852077/
Abstract

BACKGROUND

Xor-genotype is a cost-effective alternative to the genotype sequence of an individual. Recent methods developed for haplotype inference have aimed at finding the solution based on xor-genotype data. Given the xor-genotypes of a group of unrelated individuals, it is possible to infer the haplotype pairs for each individual with the aid of a small number of regular genotypes.

RESULTS

We propose a framework of maximum parsimony inference of haplotypes based on the search of a sparse dictionary, and we present a greedy method that can effectively infer the haplotype pairs given a set of xor-genotypes augmented by a small number of regular genotypes. We test the performance of the proposed approach on synthetic data sets with different number of individuals and SNPs, and compare the performances with the state-of-the-art xor-haplotyping methods PPXH and XOR-HAPLOGEN.

CONCLUSIONS

Experimental results show good inference qualities for the proposed method under all circumstances, especially on large data sets. Results on a real database, CFTR, also demonstrate significantly better performance. The proposed algorithm is also capable of finding accurate solutions with missing data and/or typing errors.

摘要

背景

异或基因型是一种比个体基因型序列更具成本效益的选择。最近开发的用于单倍型推断的方法旨在基于异或基因型数据找到解决方案。给定一组无关个体的异或基因型,可以借助少数常规基因型来推断每个个体的单倍型对。

结果

我们提出了一种基于稀疏字典搜索的最大简约单倍型推断框架,并提出了一种贪婪方法,该方法可以在给定一组异或基因型并增加少量常规基因型的情况下有效地推断单倍型对。我们在具有不同个体和 SNP 数量的合成数据集上测试了所提出方法的性能,并将性能与最先进的异或单倍型方法 PPXH 和 XOR-HAPLOGEN 进行了比较。

结论

实验结果表明,该方法在所有情况下都具有良好的推断质量,尤其是在大型数据集上。在真实数据库 CFTR 上的结果也证明了其性能显著提高。该算法还能够在存在缺失数据和/或打字错误的情况下找到准确的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/631ec34f47e4/1471-2164-14-645-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/96a6f301202b/1471-2164-14-645-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/b215b12a9bb1/1471-2164-14-645-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/e7f43d3cac0e/1471-2164-14-645-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/8730dd94a446/1471-2164-14-645-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/6bae273993cd/1471-2164-14-645-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/b81dfa15ca0a/1471-2164-14-645-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/020d8cbf0ad0/1471-2164-14-645-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/9683da0fb93f/1471-2164-14-645-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/6d2294b04168/1471-2164-14-645-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/985035685517/1471-2164-14-645-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/c9610f9d8fa0/1471-2164-14-645-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/631ec34f47e4/1471-2164-14-645-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/96a6f301202b/1471-2164-14-645-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/b215b12a9bb1/1471-2164-14-645-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/e7f43d3cac0e/1471-2164-14-645-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/8730dd94a446/1471-2164-14-645-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/6bae273993cd/1471-2164-14-645-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/b81dfa15ca0a/1471-2164-14-645-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/020d8cbf0ad0/1471-2164-14-645-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/9683da0fb93f/1471-2164-14-645-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/6d2294b04168/1471-2164-14-645-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/985035685517/1471-2164-14-645-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/c9610f9d8fa0/1471-2164-14-645-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90ba/3852077/631ec34f47e4/1471-2164-14-645-12.jpg

相似文献

1
Maximum parsimony xor haplotyping by sparse dictionary selection.最大简约性异或单倍型推断的稀疏字典选择方法。
BMC Genomics. 2013 Sep 23;14:645. doi: 10.1186/1471-2164-14-645.
2
Pure parsimony xor haplotyping.纯简约性或单体型分析。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Oct-Dec;7(4):598-610. doi: 10.1109/TCBB.2010.52.
3
Algorithm for haplotype resolution and block partitioning for partial XOR-genotype data.部分 XOR 基因型数据的单体型分辨率和块分区算法。
J Biomed Inform. 2010 Feb;43(1):51-9. doi: 10.1016/j.jbi.2009.08.009. Epub 2009 Aug 20.
4
Improved haplotype assembly using Xor genotypes.利用异或基因型提高单倍型组装。
J Theor Biol. 2012 Apr 7;298:122-30. doi: 10.1016/j.jtbi.2012.01.003. Epub 2012 Jan 12.
5
CollHaps: a heuristic approach to haplotype inference by parsimony.CollHaps:一种基于简约法的单倍型推断启发式方法。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):511-23. doi: 10.1109/TCBB.2008.130.
6
An improved preprocessing algorithm for haplotype inference by pure parsimony.一种通过纯简约法进行单倍型推断的改进预处理算法。
J Bioinform Comput Biol. 2014 Aug;12(4):1450020. doi: 10.1142/S0219720014500206. Epub 2014 Aug 1.
7
Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms.估计多个单核苷酸多态性的单倍型频率和标准误差。
Biostatistics. 2003 Oct;4(4):513-22. doi: 10.1093/biostatistics/4.4.513.
8
Joint haplotype assembly and genotype calling via sequential Monte Carlo algorithm.通过序贯蒙特卡罗算法进行联合单倍型组装和基因型分型
BMC Bioinformatics. 2015 Jul 16;16:223. doi: 10.1186/s12859-015-0651-8.
9
Computational problems in perfect phylogeny haplotyping: typing without calling the allele.完美系统发育单倍型分型中的计算问题:无需确定等位基因的分型
IEEE/ACM Trans Comput Biol Bioinform. 2008 Jan-Mar;5(1):101-9. doi: 10.1109/TCBB.2007.1063.
10
A preprocessing procedure for haplotype inference by pure parsimony.基于简约法推断单体型的预处理过程。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1183-95. doi: 10.1109/TCBB.2010.125.

本文引用的文献

1
Current methods for high-throughput detection of novel DNA polymorphisms.
Drug Discov Today Technol. 2006 Summer;3(2):123-9. doi: 10.1016/j.ddtec.2006.05.002.
2
Hap-seq: an optimal algorithm for haplotype phasing with imputation using sequencing data.Hap-seq:一种利用测序数据进行单倍型定相及插补的优化算法。
J Comput Biol. 2013 Feb;20(2):80-92. doi: 10.1089/cmb.2012.0091.
3
A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree.用于在系谱上重建零重组单倍型结构的线性时间算法。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S19. doi: 10.1186/1471-2105-13-S17-S19. Epub 2012 Dec 13.
4
A unified framework for haplotype inference in nuclear families.核心家庭单倍型推断的统一框架。
Ann Hum Genet. 2012 Jul;76(4):312-25. doi: 10.1111/j.1469-1809.2012.00715.x. Epub 2012 May 21.
5
Algorithm for haplotype inference via galled-tree networks with simple galls.
J Comput Biol. 2012 Apr;19(4):439-54. doi: 10.1089/cmb.2010.0145.
6
A polymorphism in the chromosome 9p21 ANRIL locus is associated to Philadelphia positive acute lymphoblastic leukemia.9p21 染色体上 ANRIL 基因座的多态性与费城阳性急性淋巴细胞白血病相关。
Leuk Res. 2011 Aug;35(8):1052-9. doi: 10.1016/j.leukres.2011.02.020. Epub 2011 Mar 16.
7
MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.MaCH:利用序列和基因型数据来估计单倍型和未观测基因型。
Genet Epidemiol. 2010 Dec;34(8):816-34. doi: 10.1002/gepi.20533.
8
CollHaps: a heuristic approach to haplotype inference by parsimony.CollHaps:一种基于简约法的单倍型推断启发式方法。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):511-23. doi: 10.1109/TCBB.2008.130.
9
Optimal algorithms for haplotype assembly from whole-genome sequence data.从全基因组序列数据中进行单倍型组装的最优算法。
Bioinformatics. 2010 Jun 15;26(12):i183-90. doi: 10.1093/bioinformatics/btq215.
10
Pure parsimony xor haplotyping.纯简约性或单体型分析。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Oct-Dec;7(4):598-610. doi: 10.1109/TCBB.2010.52.