Suppr超能文献

基于参考的染色体组装。

Reference-assisted chromosome assembly.

机构信息

Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.

出版信息

Proc Natl Acad Sci U S A. 2013 Jan 29;110(5):1785-90. doi: 10.1073/pnas.1220349110. Epub 2013 Jan 10.

Abstract

One of the most difficult problems in modern genomics is the assembly of full-length chromosomes using next generation sequencing (NGS) data. To address this problem, we developed "reference-assisted chromosome assembly" (RACA), an algorithm to reliably order and orient sequence scaffolds generated by NGS and assemblers into longer chromosomal fragments using comparative genome information and paired-end reads. Evaluation of results using simulated and real genome assemblies indicates that our approach can substantially improve genomes generated by a wide variety of de novo assemblers if a good reference assembly of a closely related species and outgroup genomes are available. We used RACA to reconstruct 60 Tibetan antelope (Pantholops hodgsonii) chromosome fragments from 1,434 SOAPdenovo sequence scaffolds, of which 16 chromosome fragments were homologous to complete cattle chromosomes. Experimental validation by PCR showed that predictions made by RACA are highly accurate. Our results indicate that RACA will significantly facilitate the study of chromosome evolution and genome rearrangements for the large number of genomes being sequenced by NGS that do not have a genetic or physical map.

摘要

现代基因组学中最困难的问题之一是使用下一代测序(NGS)数据组装全长染色体。为了解决这个问题,我们开发了“参考辅助染色体组装”(RACA),这是一种算法,可以使用比较基因组信息和配对末端读取,可靠地将 NGS 和组装器生成的序列支架排列和定向为更长的染色体片段。使用模拟和真实基因组组装评估结果表明,如果有一个密切相关物种和外群基因组的良好参考组装,我们的方法可以大大改进由各种从头组装器生成的基因组。我们使用 RACA 从 1434 个 SOAPdenovo 序列支架重建了 60 个藏羚羊(Pantholops hodgsonii)染色体片段,其中 16 个染色体片段与完整的牛染色体同源。通过 PCR 进行的实验验证表明,RACA 的预测非常准确。我们的结果表明,RACA 将极大地促进对大量通过 NGS 测序但没有遗传或物理图谱的基因组的染色体进化和基因组重排的研究。

相似文献

1
Reference-assisted chromosome assembly.
Proc Natl Acad Sci U S A. 2013 Jan 29;110(5):1785-90. doi: 10.1073/pnas.1220349110. Epub 2013 Jan 10.
3
Construction of Red Fox Chromosomal Fragments from the Short-Read Genome Assembly.
Genes (Basel). 2018 Jun 20;9(6):308. doi: 10.3390/genes9060308.
5
Construction of Whole Genomes from Scaffolds Using Single Cell Strand-Seq Data.
Int J Mol Sci. 2021 Mar 31;22(7):3617. doi: 10.3390/ijms22073617.
6
AlignGraph: algorithm for secondary de novo genome assembly guided by closely related references.
Bioinformatics. 2014 Jun 15;30(12):i319-i328. doi: 10.1093/bioinformatics/btu291.
8
TIGER: tiled iterative genome assembler.
BMC Bioinformatics. 2012;13 Suppl 19(Suppl 19):S18. doi: 10.1186/1471-2105-13-S19-S18. Epub 2012 Dec 19.
10
Upgrading short-read animal genome assemblies to chromosome level using comparative genomics and a universal probe set.
Genome Res. 2017 May;27(5):875-884. doi: 10.1101/gr.213660.116. Epub 2016 Nov 30.

引用本文的文献

1
Genomes of critically endangered saola are shaped by population structure and purging.
Cell. 2025 Jun 12;188(12):3102-3116.e22. doi: 10.1016/j.cell.2025.03.040. Epub 2025 May 5.
2
Synteny Enabled Upgrade of the Galapagos Giant Tortoise Genome Improves Inferences of Runs of Homozygosity.
Ecol Evol. 2025 Apr 25;15(4):e71358. doi: 10.1002/ece3.71358. eCollection 2025 Apr.
3
A chromosome-level genome assembly of the Korean minipig (Sus scrofa).
Sci Data. 2024 Aug 3;11(1):840. doi: 10.1038/s41597-024-03680-8.
4
Three-dimensional genome architecture persists in a 52,000-year-old woolly mammoth skin sample.
Cell. 2024 Jul 11;187(14):3541-3562.e51. doi: 10.1016/j.cell.2024.06.002.
6
Anemonefishes: A model system for evolutionary genomics.
F1000Res. 2023 Oct 27;12:204. doi: 10.12688/f1000research.130752.2. eCollection 2023.
7
A chromosome-level genome assembly of the Korean crossbred pig Nanchukmacdon (Sus scrofa).
Sci Data. 2023 Nov 3;10(1):761. doi: 10.1038/s41597-023-02661-7.
8
The Utilization of Reference-Guided Assembly and In Silico Libraries Improves the Draft Genome of Clarias batrachus and Culter alburnus.
Mar Biotechnol (NY). 2023 Dec;25(6):907-917. doi: 10.1007/s10126-023-10248-x. Epub 2023 Sep 4.

本文引用的文献

1
Efficient de novo assembly of large genomes using compressed data structures.
Genome Res. 2012 Mar;22(3):549-56. doi: 10.1101/gr.126953.111. Epub 2011 Dec 7.
2
GAGE: A critical evaluation of genome assemblies and assembly algorithms.
Genome Res. 2012 Mar;22(3):557-67. doi: 10.1101/gr.131383.111. Epub 2012 Jan 6.
3
Creating a buzz about insect genomes.
Science. 2011 Mar 18;331(6023):1386. doi: 10.1126/science.331.6023.1386.
4
High-quality draft assemblies of mammalian genomes from massively parallel sequence data.
Proc Natl Acad Sci U S A. 2011 Jan 25;108(4):1513-8. doi: 10.1073/pnas.1017351108. Epub 2010 Dec 27.
5
Scaffolding pre-assembled contigs using SSPACE.
Bioinformatics. 2011 Feb 15;27(4):578-9. doi: 10.1093/bioinformatics/btq683. Epub 2010 Dec 12.
6
Assembly algorithms for next-generation sequencing data.
Genomics. 2010 Jun;95(6):315-27. doi: 10.1016/j.ygeno.2010.03.001. Epub 2010 Mar 6.
7
Evolutionary constraint facilitates interpretation of genetic variation in resequenced human genomes.
Genome Res. 2010 Mar;20(3):301-10. doi: 10.1101/gr.102210.109. Epub 2010 Jan 12.
8
De novo assembly of human genomes with massively parallel short read sequencing.
Genome Res. 2010 Feb;20(2):265-72. doi: 10.1101/gr.097261.109. Epub 2009 Dec 17.
9
Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species.
J Hered. 2009 Nov-Dec;100(6):659-74. doi: 10.1093/jhered/esp086. Epub 2009 Nov 5.
10
Every genome sequence needs a good map.
Genome Res. 2009 Nov;19(11):1925-8. doi: 10.1101/gr.094557.109. Epub 2009 Jul 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验