• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种快速实用的方法,用于在存在错误和不完整信息的家系中进行基因型相位和推断。

A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information.

机构信息

Dipartimento di Informatica Sistemistica e Comunicazione (DISCo), Università degli Studi di Milano-Bicocca, V.le Sarca, 336, Milan 20126, Italy.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1582-94. doi: 10.1109/TCBB.2012.100.

DOI:10.1109/TCBB.2012.100
PMID:22848137
Abstract

The MINIMUM-RECOMBINANT HAPLOTYPE CONFIGURATION problem (MRHC) has been highly successful in providing a sound combinatorial formulation for the important problem of genotype phasing on pedigrees. Despite several algorithmic advances that have improved the efficiency, its applicability to real data sets has been limited since it does not take into account some important phenomena such as mutations, genotyping errors, and missing data. In this work, we propose the MINIMUM-RECOMBINANT HAPLOTYPE CONFIGURATION WITH BOUNDED ERRORS problem (MRHCE), which extends the original MRHC formulation by incorporating the two most common characteristics of real data: errors and missing genotypes (including untyped individuals). We describe a practical algorithm for MRHCE that is based on a reduction to the well-known Satisfiability problem (SAT) and exploits recent advances in the constraint programming literature. An experimental analysis demonstrates the biological soundness of the phasing model and the effectiveness (on both accuracy and performance) of the algorithm under several scenarios. The analysis on real data and the comparison with state-of-the-art programs reveals that our approach couples better scalability to large and complex pedigrees with the explicit inclusion of genotyping errors into the model.

摘要

最小重组单倍型配置问题(MRHC)在为系谱上的基因型相位重要问题提供合理的组合公式方面非常成功。尽管已经有几项算法改进提高了效率,但由于它没有考虑到一些重要的现象,如突变、基因分型错误和缺失数据,因此其在实际数据集上的应用受到限制。在这项工作中,我们提出了具有边界错误的最小重组单倍型配置问题(MRHCE),该问题通过将真实数据的两个最常见特征(错误和缺失基因型(包括未分型个体))纳入原始 MRHC 公式来扩展。我们描述了一种基于著名可满足性问题(SAT)的实用算法,并利用约束编程文献中的最新进展。实验分析证明了相位模型的生物学合理性,以及在几种情况下算法的有效性(准确性和性能)。对真实数据的分析以及与最先进程序的比较表明,我们的方法将更好的可扩展性与对模型中基因分型错误的明确纳入相结合,适用于大型和复杂的系谱。

相似文献

1
A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information.一种快速实用的方法,用于在存在错误和不完整信息的家系中进行基因型相位和推断。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1582-94. doi: 10.1109/TCBB.2012.100.
2
An efficient algorithm for haplotype inference on pedigrees with recombinations and mutations.一种用于存在重组和突变的家系中单体型推断的高效算法。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):12-25. doi: 10.1109/TCBB.2011.51. Epub 2011 Mar 3.
3
Computing the minimum recombinant haplotype configuration from incomplete genotype data on a pedigree by integer linear programming.通过整数线性规划从家系的不完整基因型数据计算最小重组单倍型构型。
J Comput Biol. 2005 Jul-Aug;12(6):719-39. doi: 10.1089/cmb.2005.12.719.
4
A comparison of different algorithms for phasing haplotypes using Holstein cattle genotypes and pedigree data.使用荷斯坦奶牛基因型和系谱数据对不同单倍型定相算法的比较。
J Dairy Sci. 2017 Apr;100(4):2837-2849. doi: 10.3168/jds.2016-11590. Epub 2017 Feb 1.
5
Inferring haplotypes from genotypes on a pedigree with mutations, genotyping errors and missing alleles.在存在突变、基因分型错误和等位基因缺失的家系中从基因型推断单倍型。
J Bioinform Comput Biol. 2011 Apr;9(2):339-65. doi: 10.1142/s0219720011005549.
6
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.HAPLORE:一个用于在无重组的一般家系中进行单倍型重建的程序。
Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1.
7
FNphasing: a novel fast heuristic algorithm for haplotype phasing based on flow network model.FNphasing:一种基于流网络模型的新型快速单体型相位推断启发式算法。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):372-82. doi: 10.1109/TCBB.2013.18.
8
Efficient inference of haplotypes from genotypes on a pedigree.从系谱中的基因型高效推断单倍型。
J Bioinform Comput Biol. 2003 Apr;1(1):41-69. doi: 10.1142/s0219720003000204.
9
Efficient inference of haplotypes from genotypes on a large animal pedigree.在大型动物谱系中从基因型高效推断单倍型。
Genetics. 2006 Mar;172(3):1757-65. doi: 10.1534/genetics.105.047134. Epub 2005 Dec 15.
10
A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree.用于在系谱上重建零重组单倍型结构的线性时间算法。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S19. doi: 10.1186/1471-2105-13-S17-S19. Epub 2012 Dec 13.