Suppr超能文献

拟南芥Ler的染色体水平组装揭示了易位和倒位多态性的程度。

Chromosome-level assembly of Arabidopsis thaliana Ler reveals the extent of translocation and inversion polymorphisms.

作者信息

Zapata Luis, Ding Jia, Willing Eva-Maria, Hartwig Benjamin, Bezdan Daniela, Jiao Wen-Biao, Patel Vipul, Velikkakam James Geo, Koornneef Maarten, Ossowski Stephan, Schneeberger Korbinian

机构信息

Bioinformatics and Genomics Programme, Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, 08003 Barcelona, Spain; Universitat Pompeu Fabra, 08002 Barcelona, Spain;

Department of Plant Breeding and Genetics, Max Planck Institute for Plant Breeding Research, 50829 Cologne, Germany;

出版信息

Proc Natl Acad Sci U S A. 2016 Jul 12;113(28):E4052-60. doi: 10.1073/pnas.1607532113. Epub 2016 Jun 27.

Abstract

Resequencing or reference-based assemblies reveal large parts of the small-scale sequence variation. However, they typically fail to separate such local variation into colinear and rearranged variation, because they usually do not recover the complement of large-scale rearrangements, including transpositions and inversions. Besides the availability of hundreds of genomes of diverse Arabidopsis thaliana accessions, there is so far only one full-length assembled genome: the reference sequence. We have assembled 117 Mb of the A. thaliana Landsberg erecta (Ler) genome into five chromosome-equivalent sequences using a combination of short Illumina reads, long PacBio reads, and linkage information. Whole-genome comparison against the reference sequence revealed 564 transpositions and 47 inversions comprising ∼3.6 Mb, in addition to 4.1 Mb of nonreference sequence, mostly originating from duplications. Although rearranged regions are not different in local divergence from colinear regions, they are drastically depleted for meiotic recombination in heterozygotes. Using a 1.2-Mb inversion as an example, we show that such rearrangement-mediated reduction of meiotic recombination can lead to genetically isolated haplotypes in the worldwide population of A. thaliana Moreover, we found 105 single-copy genes, which were only present in the reference sequence or the Ler assembly, and 334 single-copy orthologs, which showed an additional copy in only one of the genomes. To our knowledge, this work gives first insights into the degree and type of variation, which will be revealed once complete assemblies will replace resequencing or other reference-dependent methods.

摘要

重测序或基于参考序列的组装揭示了小规模序列变异的大部分情况。然而,它们通常无法将这种局部变异区分为共线性变异和重排变异,因为它们通常无法恢复大规模重排的互补序列,包括转座和倒位。除了有数百个不同拟南芥种质的基因组外,到目前为止只有一个全长组装基因组:参考序列。我们使用短读长的Illumina测序数据、长读长的PacBio测序数据和连锁信息,将117 Mb的拟南芥直立型(Ler)基因组组装成了五个与染色体等效的序列。与参考序列进行全基因组比较,除了4.1 Mb的非参考序列(大多源自重复)外,还发现了564个转座和47个倒位,共约3.6 Mb。尽管重排区域在局部差异上与共线性区域并无不同,但在杂合子中它们的减数分裂重组却大幅减少。以一个1.2 Mb的倒位为例,我们表明这种由重排介导的减数分裂重组减少可导致拟南芥全球种群中出现遗传隔离的单倍型。此外,我们发现了105个单拷贝基因,它们仅存在于参考序列或Ler组装中,以及334个单拷贝直系同源基因,它们在仅一个基因组中出现了额外的拷贝。据我们所知,这项工作首次深入了解了变异的程度和类型,一旦完整的组装取代重测序或其他依赖参考序列的方法,这些变异将会被揭示出来。

相似文献

3
Reference-guided assembly of four diverse Arabidopsis thaliana genomes.基于参考基因组的四个拟南芥基因组的组装。
Proc Natl Acad Sci U S A. 2011 Jun 21;108(25):10249-54. doi: 10.1073/pnas.1107739108. Epub 2011 Jun 6.
8
Sequencing of natural strains of Arabidopsis thaliana with short reads.对拟南芥自然菌株进行短读长测序。
Genome Res. 2008 Dec;18(12):2024-33. doi: 10.1101/gr.080200.108. Epub 2008 Sep 25.

引用本文的文献

3
Mapping-based genome size estimation.基于图谱的基因组大小估计
BMC Genomics. 2025 May 14;26(1):482. doi: 10.1186/s12864-025-11640-8.
5
Complete mitogenome and phylogenetic analysis of L. var.  Plenck.L. var. Plenck的完整线粒体基因组及系统发育分析
Mitochondrial DNA B Resour. 2025 Apr 2;10(5):331-336. doi: 10.1080/23802359.2024.2423830. eCollection 2025.

本文引用的文献

2
Improving the Annotation of Arabidopsis lyrata Using RNA-Seq Data.利用RNA测序数据改进琴叶拟南芥的注释
PLoS One. 2015 Sep 18;10(9):e0137391. doi: 10.1371/journal.pone.0137391. eCollection 2015.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验