Suppr超能文献

HAT:使用短读长和易错长读进行单体型组装的工具。

HAT: haplotype assembly tool using short and error-prone long reads.

机构信息

Delft Bioinformatics Lab, Delft University of Technology Van Mourik, 2628 XE Delft, The Netherlands.

Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.

出版信息

Bioinformatics. 2022 Dec 13;38(24):5352-5359. doi: 10.1093/bioinformatics/btac702.

Abstract

MOTIVATION

Haplotypes are the set of alleles co-occurring on a single chromosome and inherited together to the next generation. Because a monoploid reference genome loses this co-occurrence information, it has limited use in associating phenotypes with allelic combinations of genotypes. Therefore, methods to reconstruct the complete haplotypes from DNA sequencing data are crucial. Recently, several attempts have been made at haplotype reconstructions, but significant limitations remain. High-quality continuous haplotypes cannot be created reliably, particularly when there are few differences between the homologous chromosomes.

RESULTS

Here, we introduce HAT, a haplotype assembly tool that exploits short and long reads along with a reference genome to reconstruct haplotypes. HAT tries to take advantage of the accuracy of short reads and the length of the long reads to reconstruct haplotypes. We tested HAT on the aneuploid yeast strain Saccharomyces pastorianus CBS1483 and multiple simulated polyploid datasets of the same strain, showing that it outperforms existing tools.

AVAILABILITY AND IMPLEMENTATION

https://github.com/AbeelLab/hat/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

单倍型是指在同一染色体上共同发生并遗传到下一代的等位基因集合。由于单倍体参考基因组丢失了这种共同发生的信息,因此它在将表型与基因型的等位基因组合相关联方面的用途有限。因此,从 DNA 测序数据中重建完整单倍型的方法至关重要。最近,已经有几种尝试进行单倍型重建,但仍然存在显著的局限性。无法可靠地创建高质量的连续单倍型,尤其是当同源染色体之间的差异很少时。

结果

在这里,我们引入了 HAT,这是一种单倍型组装工具,它利用短读长和长读长以及参考基因组来重建单倍型。HAT 试图利用短读长的准确性和长读长的长度来重建单倍型。我们在非整倍体酵母菌株 Saccharomyces pastorianus CBS1483 以及同一菌株的多个模拟多倍体数据集上测试了 HAT,结果表明它优于现有工具。

可用性和实现

https://github.com/AbeelLab/hat/。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad67/9750119/21dcfad9c971/btac702f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验