Department of Biology, University of Rochester; Rochester, New York, United States of America.
Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America.
PLoS Biol. 2019 May 14;17(5):e3000241. doi: 10.1371/journal.pbio.3000241. eCollection 2019 May.
Centromeres are essential chromosomal regions that mediate kinetochore assembly and spindle attachments during cell division. Despite their functional conservation, centromeres are among the most rapidly evolving genomic regions and can shape karyotype evolution and speciation across taxa. Although significant progress has been made in identifying centromere-associated proteins, the highly repetitive centromeres of metazoans have been refractory to DNA sequencing and assembly, leaving large gaps in our understanding of their functional organization and evolution. Here, we identify the sequence composition and organization of the centromeres of Drosophila melanogaster by combining long-read sequencing, chromatin immunoprecipitation for the centromeric histone CENP-A, and high-resolution chromatin fiber imaging. Contrary to previous models that heralded satellite repeats as the major functional components, we demonstrate that functional centromeres form on islands of complex DNA sequences enriched in retroelements that are flanked by large arrays of satellite repeats. Each centromere displays distinct size and arrangement of its DNA elements but is similar in composition overall. We discover that a specific retroelement, G2/Jockey-3, is the most highly enriched sequence in CENP-A chromatin and is the only element shared among all centromeres. G2/Jockey-3 is also associated with CENP-A in the sister species D. simulans, revealing an unexpected conservation despite the reported turnover of centromeric satellite DNA. Our work reveals the DNA sequence identity of the active centromeres of a premier model organism and implicates retroelements as conserved features of centromeric DNA.
着丝粒是染色体的重要区域,在细胞分裂过程中介导动粒组装和纺锤体附着。尽管它们具有功能保守性,但着丝粒是进化最快的基因组区域之一,可以塑造核型进化和分类群的物种形成。尽管在鉴定着丝粒相关蛋白方面已经取得了重大进展,但后生动物的高度重复着丝粒一直难以进行 DNA 测序和组装,这使得我们对其功能组织和进化的理解存在很大的空白。在这里,我们通过结合长读测序、着丝粒组蛋白 CENP-A 的染色质免疫沉淀和高分辨率染色质纤维成像,鉴定了黑腹果蝇的着丝粒的序列组成和组织。与以前的模型相反,这些模型认为卫星重复序列是主要的功能成分,我们证明功能着丝粒形成于富含反转录元件的复杂 DNA 序列岛上,这些序列被大量卫星重复序列包围。每个着丝粒的 DNA 元件大小和排列都不同,但总体组成相似。我们发现,一个特定的反转录元件 G2/Jockey-3,是 CENP-A 染色质中最丰富的序列,并且是所有着丝粒共有的唯一元件。G2/Jockey-3 也与姐妹种 D. simulans 中的 CENP-A 相关,尽管报道了着丝粒卫星 DNA 的周转,但仍揭示了出人意料的保守性。我们的工作揭示了一个主要模式生物的活性着丝粒的 DNA 序列同一性,并暗示反转录元件是着丝粒 DNA 的保守特征。