Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA.
Plant Genome and Systems Biology, Helmholtz Zentrum München, Munich 85764, Germany.
G3 (Bethesda). 2021 Dec 8;11(12). doi: 10.1093/g3journal/jkab325.
Aegilops tauschii is the donor of the D subgenome of hexaploid wheat and an important genetic resource. The reference-quality genome sequence Aet v4.0 for Ae. tauschii acc. AL8/78 was therefore an important milestone for wheat biology and breeding. Further advances in sequencing acc. AL8/78 and release of the Aet v5.0 sequence assembly are reported here. Two new optical maps were constructed and used in the revision of pseudomolecules. Gaps were closed with Pacific Biosciences long-read contigs, decreasing the gap number by 38,899. Transposable elements and protein-coding genes were reannotated. The number of annotated high-confidence genes was reduced from 39,635 in Aet v4.0 to 32,885 in Aet v5.0. A total of 2245 biologically important genes, including those affecting plant phenology, grain quality, and tolerance of abiotic stresses in wheat, was manually annotated and disease-resistance genes were annotated by a dedicated pipeline. Disease-resistance genes encoding nucleotide-binding site domains, receptor-like protein kinases, and receptor-like proteins were preferentially located in distal chromosome regions, whereas those encoding transmembrane coiled-coil proteins were dispersed more evenly along the chromosomes. Discovery, annotation, and expression analyses of microRNA (miRNA) precursors, mature miRNAs, and phasiRNAs are reported, including miRNA target genes. Other small RNAs, such as hc-siRNAs and tRFs, were characterized. These advances enhance the utility of the Ae. tauschii genome sequence for wheat genetics, biotechnology, and breeding.
节节麦是六倍体普通小麦 D 基因组的供体,也是一种重要的遗传资源。因此,Ae. tauschii acc. AL8/78 的参考质量基因组序列 Aet v4.0 是小麦生物学和育种的重要里程碑。本文报道了 Ae. tauschii acc. AL8/78 测序的进一步进展和 Aet v5.0 序列组装的发布。构建了两个新的光学图谱,并用于假基因的修订。通过 Pacific Biosciences 长读长 contigs 封闭缺口,使缺口数量减少了 38899 个。转座元件和蛋白质编码基因被重新注释。注释的高可信度基因数量从 Aet v4.0 的 39635 个减少到 Aet v5.0 的 32885 个。共手动注释了 2245 个具有重要生物学意义的基因,包括影响小麦物候、籽粒品质和非生物胁迫耐受性的基因,并通过专门的管道注释了抗病基因。编码核苷酸结合位点结构域、受体样蛋白激酶和受体样蛋白的抗病基因优先位于染色体的远端区域,而编码跨膜卷曲螺旋蛋白的抗病基因则更均匀地分散在染色体上。本文还报告了 microRNA (miRNA) 前体、成熟 miRNA 和 phasiRNA 的发现、注释和表达分析,包括 miRNA 靶基因。还对其他小 RNA,如 hc-siRNAs 和 tRFs 进行了表征。这些进展提高了 Ae. tauschii 基因组序列在小麦遗传学、生物技术和育种中的应用价值。