Nakandala Upuli, Masouleh Ardashir Kharabian, Smith Malcolm W, Furtado Agnelo, Mason Patrick, Constantin Lena, Henry Robert J
Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Brisbane 4072, Australia.
ARC Centre of Excellence for Plant Success in Nature and Agriculture, University of Queensland, Brisbane 4072, Australia.
Hortic Res. 2023 Apr 3;10(5):uhad058. doi: 10.1093/hr/uhad058. eCollection 2023 May.
Recent advances in genome sequencing and assembly techniques have made it possible to achieve chromosome level reference genomes for citrus. Relatively few genomes have been anchored at the chromosome level and/or are haplotype phased, with the available genomes of varying accuracy and completeness. We now report a phased high-quality chromosome level genome assembly for an Australian native citrus species; (round lime) using highly accurate PacBio HiFi long reads, complemented with Hi-C scaffolding. Hifiasm with Hi-C integrated assembly resulted in a 331 Mb genome of with two haplotypes of nine pseudochromosomes with an N50 of 36.3 Mb and 98.8% genome assembly completeness (BUSCO). Repeat analysis showed that more than 50% of the genome contained interspersed repeats. Among them, LTR elements were the predominant type (21.0%), of which LTR Gypsy (9.8%) and LTR copia (7.7%) elements were the most abundant repeats. A total of 29 464 genes and 32 009 transcripts were identified in the genome. Of these, 28 222 CDS (25 753 genes) had BLAST hits and 21 401 CDS (75.8%) were annotated with at least one GO term. Citrus specific genes for antimicrobial peptides, defense, volatile compounds and acidity regulation were identified. The synteny analysis showed conserved regions between the two haplotypes with some structural variations in Chromosomes 2, 4, 7 and 8. This chromosome scale, and haplotype resolved genome will facilitate the study of important genes for citrus breeding and will also allow the enhanced definition of the evolutionary relationships between wild and domesticated citrus species.
基因组测序和组装技术的最新进展使得获得柑橘的染色体水平参考基因组成为可能。相对较少的基因组已被定位到染色体水平和/或进行了单倍型定相,现有基因组的准确性和完整性各不相同。我们现在报告了一种澳大利亚本土柑橘品种(圆柠檬)的定相高质量染色体水平基因组组装,该组装使用了高度准确的PacBio HiFi长读长,并辅以Hi-C支架构建。结合Hi-C的Hifiasm组装产生了一个331 Mb的基因组,包含两个单倍型的九条假染色体,N50为36.3 Mb,基因组组装完整性为98.8%(BUSCO)。重复序列分析表明,超过50%的基因组包含散在重复序列。其中,LTR元件是主要类型(21.0%),其中LTR Gypsy(9.8%)和LTR copia(7.7%)元件是最丰富的重复序列。基因组中总共鉴定出29464个基因和32009个转录本。其中,28222个CDS(25753个基因)有BLAST比对结果,21401个CDS(75.8%)至少有一个GO术语注释。鉴定出了柑橘特异性的抗菌肽、防御、挥发性化合物和酸度调节基因。共线性分析显示两个单倍型之间存在保守区域,但在染色体2、4、7和8中存在一些结构变异。这个染色体规模且解析了单倍型的基因组将有助于柑橘育种重要基因的研究,也将有助于更清晰地界定野生和驯化柑橘物种之间的进化关系。