Nature. 2012 Nov 29;491(7426):711-6. doi: 10.1038/nature11543. Epub 2012 Oct 17.
Barley (Hordeum vulgare L.) is among the world's earliest domesticated and most important crop plants. It is diploid with a large haploid genome of 5.1 gigabases (Gb). Here we present an integrated and ordered physical, genetic and functional sequence resource that describes the barley gene-space in a structured whole-genome context. We developed a physical map of 4.98 Gb, with more than 3.90 Gb anchored to a high-resolution genetic map. Projecting a deep whole-genome shotgun assembly, complementary DNA and deep RNA sequence data onto this framework supports 79,379 transcript clusters, including 26,159 'high-confidence' genes with homology support from other plant genomes. Abundant alternative splicing, premature termination codons and novel transcriptionally active regions suggest that post-transcriptional processing forms an important regulatory layer. Survey sequences from diverse accessions reveal a landscape of extensive single-nucleotide variation. Our data provide a platform for both genome-assisted research and enabling contemporary crop improvement.
大麦(Hordeum vulgare L.)是世界上最早被驯化和最重要的作物之一。它是二倍体,具有 51 亿碱基对(Gb)的大型单倍体基因组。在这里,我们提供了一个集成的、有序的物理、遗传和功能序列资源,该资源以结构化的全基因组背景描述了大麦的基因空间。我们开发了一个 49.8 Gb 的物理图谱,其中超过 39.0 Gb 锚定在高分辨率的遗传图谱上。将深度全基因组鸟枪法组装、cDNA 和深度 RNA 序列数据投射到这个框架上,支持 79379 个转录物簇,包括 26159 个具有同源性支持的“高可信度”基因来自其他植物基因组。丰富的选择性剪接、过早终止密码子和新的转录活性区域表明,转录后加工形成了一个重要的调控层。来自不同品系的调查序列揭示了广泛的单核苷酸变异景观。我们的数据为基因组辅助研究和当代作物改良提供了一个平台。