Suppr
超能文献

利用低覆盖度长读长测序技术快速低成本组装参考基因组

Rapid Low-Cost Assembly of the Reference Genome Using Low-Coverage, Long-Read Sequencing.

作者信息

Solares Edwin A, Chakraborty Mahul, Miller Danny E, Kalsow Shannon, Hall Kate, Perera Anoja G, Emerson J J, Hawley R Scott

机构信息

Department of Ecology and Evolutionary Biology, University of California Irvine, CA.

Stowers Institute for Medical Research, Kansas City, MO.

出版信息

G3 (Bethesda). 2018 Oct 3;8(10):3143-3154. doi: 10.1534/g3.118.200162.

DOI:10.1534/g3.118.200162

PMID:30018084

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6169397/

Abstract

Accurate and comprehensive characterization of genetic variation is essential for deciphering the genetic basis of diseases and other phenotypes. A vast amount of genetic variation stems from large-scale sequence changes arising from the duplication, deletion, inversion, and translocation of sequences. In the past 10 years, high-throughput short reads have greatly expanded our ability to assay sequence variation due to single nucleotide polymorphisms. However, a recent assembly of a second reference genome has revealed that short read genotyping methods miss hundreds of structural variants, including those affecting phenotypes. While genomes assembled using high-coverage long reads can achieve high levels of contiguity and completeness, concerns about cost, errors, and low yield have limited widespread adoption of such sequencing approaches. Here we resequenced the reference strain of (ISO1) on a single Oxford Nanopore MinION flow cell run for 24 hr. Using only reads longer than 1 kb or with at least 30x coverage, we assembled a highly contiguous genome. The addition of inexpensive paired reads and subsequent scaffolding using an optical map technology achieved an assembly with completeness and contiguity comparable to the reference assembly. Comparison of our assembly to the reference assembly of ISO1 uncovered a number of structural variants (SVs), including novel LTR transposable element insertions and duplications affecting genes with developmental, behavioral, and metabolic functions. Collectively, these SVs provide a snapshot of the dynamics of genome evolution. Furthermore, our assembly and comparison to the reference genome demonstrates that high-quality assembly of reference genomes and comprehensive variant discovery using such assemblies are now possible by a single lab for under $1,000 (USD).

摘要

准确而全面地表征遗传变异对于解读疾病和其他表型的遗传基础至关重要。大量的遗传变异源于序列的复制、缺失、倒位和易位所产生的大规模序列变化。在过去十年中，高通量短读长极大地扩展了我们检测单核苷酸多态性导致的序列变异的能力。然而，最近对第二个参考基因组的组装表明，短读长基因分型方法会遗漏数百个结构变异，包括那些影响表型的变异。虽然使用高覆盖度长读长组装的基因组可以实现高水平的连续性和完整性，但对成本、错误和低产量的担忧限制了此类测序方法的广泛应用。在这里，我们在单个牛津纳米孔MinION流动槽上运行24小时对（ISO1）参考菌株进行了重测序。仅使用长度超过1 kb或至少有30倍覆盖度的读长，我们组装了一个高度连续的基因组。添加廉价的配对读长并随后使用光学图谱技术进行支架构建，得到了一个完整性和连续性与参考组装相当的组装体。将我们的组装体与ISO1的参考组装体进行比较，发现了许多结构变异（SVs），包括影响具有发育、行为和代谢功能基因的新型LTR转座元件插入和重复。总体而言，这些SVs提供了基因组进化动态的一个快照。此外，我们的组装以及与参考基因组的比较表明，单个实验室现在可以以低于1000美元（美元）的成本完成高质量的参考基因组组装以及使用此类组装体进行全面的变异发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f298/6169397/dd884f868acc/3143f1.jpg

相似文献

Rapid Low-Cost Assembly of the Reference Genome Using Low-Coverage, Long-Read Sequencing.

G3 (Bethesda). 2018 Oct 3;8(10):3143-3154. doi: 10.1534/g3.118.200162.

de novo assembly and population genomic survey of natural yeast isolates with the Oxford Nanopore MinION sequencer.

Gigascience. 2017 Feb 1;6(2):1-13. doi: 10.1093/gigascience/giw018.

Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage.

Nucleic Acids Res. 2016 Nov 2;44(19):e147. doi: 10.1093/nar/gkw654. Epub 2016 Jul 25.

ONT-Based Alternative Assemblies Impact on the Annotations of Unique versus Repetitive Features in the Genome of a Romanian Strain of .

Int J Mol Sci. 2022 Nov 28;23(23):14892. doi: 10.3390/ijms232314892.

Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing.

BMC Genomics. 2019 Jan 9;20(1):23. doi: 10.1186/s12864-018-5381-7.

Chromosome-level hybrid de novo genome assemblies as an attainable option for nonmodel insects.

Mol Ecol Resour. 2020 Sep;20(5):1277-1293. doi: 10.1111/1755-0998.13176. Epub 2020 Jun 7.

Highly contiguous assemblies of 101 drosophilid genomes.

Elife. 2021 Jul 19;10:e66405. doi: 10.7554/eLife.66405.

Highly Contiguous Genome Assemblies of 15 Species Generated Using Nanopore Sequencing.

G3 (Bethesda). 2018 Oct 3;8(10):3131-3141. doi: 10.1534/g3.118.200160.

Benchmarking of next and third generation sequencing technologies and their associated algorithms for genome assembly.

Mol Med Rep. 2021 Apr;23(4). doi: 10.3892/mmr.2021.11890. Epub 2021 Feb 4.

MinION-based long-read sequencing and assembly extends the reference genome.

Genome Res. 2018 Feb;28(2):266-274. doi: 10.1101/gr.221184.117. Epub 2017 Dec 22.

引用本文的文献

Genetic variation in recalcitrant repetitive regions of the genome.

Genome Res. 2025 Aug 5. doi: 10.1101/gr.280728.125.

The First Genome Assembly Of The Dogwhelk Nucella lapillus, a Bioindicator Species For The Marine Environment.

Sci Data. 2025 Apr 28;12(1):704. doi: 10.1038/s41597-025-04764-9.

Identification of quantitative trait loci (QTLs) for key cheese making phenotypes in the blue-cheese mold Penicillium roqueforti.

PLoS Genet. 2025 Apr 15;21(4):e1011669. doi: 10.1371/journal.pgen.1011669. eCollection 2025 Apr.

The Impact of Oxford Nanopore Technologies Based Methodologies on the Genome Sequencing and Assembly of Romanian Strains of .

Insects. 2024 Dec 24;16(1):2. doi: 10.3390/insects16010002.

An updated reference genome of Barbatula barbatula (Linnaeus, 1758).

Sci Data. 2025 Jan 22;12(1):137. doi: 10.1038/s41597-025-04469-z.

Chromosome-level genome assembly of Tritrichomonas foetus, the causative agent of Bovine Trichomonosis.

Sci Data. 2024 Sep 20;11(1):1030. doi: 10.1038/s41597-024-03818-8.

Range-wide population genomic structure of the Karner blue butterfly, () .

Ecol Evol. 2024 Sep 12;14(9):e70044. doi: 10.1002/ece3.70044. eCollection 2024 Sep.

Reclassification of Botryococcus braunii chemical races into separate species based on a comparative genomics analysis.

PLoS One. 2024 Jul 29;19(7):e0304144. doi: 10.1371/journal.pone.0304144. eCollection 2024.

Pseudomolecule-scale genome assemblies of Drepanocaryum sewerzowii and Marmoritis complanata.

G3 (Bethesda). 2024 Oct 7;14(10). doi: 10.1093/g3journal/jkae172.

Single-fly genome assemblies fill major phylogenomic gaps across the Drosophilidae Tree of Life.

PLoS Biol. 2024 Jul 18;22(7):e3002697. doi: 10.1371/journal.pbio.3002697. eCollection 2024 Jul.

本文引用的文献

Multi-platform discovery of haplotype-resolved structural variation in human genomes.

Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.

Nanopore sequencing and assembly of a human genome with ultra-long reads.

Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.

High contiguity Arabidopsis thaliana genome assembly with a single nanopore flow cell.

Nat Commun. 2018 Feb 7;9(1):541. doi: 10.1038/s41467-018-03016-2.

Hidden genetic variation shapes the structure of functional elements in Drosophila.

Nat Genet. 2018 Jan;50(1):20-25. doi: 10.1038/s41588-017-0010-y. Epub 2017 Dec 18.

Deep sequencing of natural and experimental populations of reveals biases in the spectrum of new mutations.

Genome Res. 2017 Dec;27(12):1988-2000. doi: 10.1101/gr.219956.116. Epub 2017 Oct 27.

Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation.

Genome Res. 2017 May;27(5):722-736. doi: 10.1101/gr.215087.116. Epub 2017 Mar 15.

DBG2OLC: Efficient Assembly of Large Genomes Using Long Erroneous Reads of the Third Generation Sequencing Technologies.

Sci Rep. 2016 Aug 30;6:31900. doi: 10.1038/srep31900.

Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage.

Nucleic Acids Res. 2016 Nov 2;44(19):e147. doi: 10.1093/nar/gkw654. Epub 2016 Jul 25.

Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences.

Bioinformatics. 2016 Jul 15;32(14):2103-10. doi: 10.1093/bioinformatics/btw152. Epub 2016 Mar 19.

An Incomplete Understanding of Human Genetic Variation.

Genetics. 2016 Apr;202(4):1251-4. doi: 10.1534/genetics.115.180539.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

利用低覆盖度长读长测序技术快速低成本组装参考基因组

Rapid Low-Cost Assembly of the Reference Genome Using Low-Coverage, Long-Read Sequencing.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译