Patel Sagar, Harris Zachary N, Londo Jason P, Miller Allison, Fennell Anne
Saint Louis University, Department of Biology, 3507 Laclede Ave, St. Louis, MO 63103, USA.
Donald Danforth Plant Science Center, 975 North Warson Road, St. Louis, MO 63132, USA.
GigaByte. 2023 Jul 3;2023:gigabyte84. doi: 10.46471/gigabyte.84. eCollection 2023.
'Chambourcin' is a French-American interspecific hybrid grape grown in the eastern and midwestern United States and used for making wine. Few genomic resources are available for hybrid grapevines like 'Chambourcin'. Here, we assembled the genome of 'Chambourcin' using PacBio HiFi long-read, Bionano optical map, and Illumina short-read sequencing technologies. We generated an assembly for 'Chambourcin' with 26 scaffolds, with an N50 length of 23.3 Mb and an estimated BUSCO completeness of 97.9%. We predicted 33,791 gene models and identified 16,056 common orthologs between 'Chambourcin', 'PN40024' 12X.v2, VCOST.v3, Shine Muscat and Gloire. We found 1,606 plant transcription factors from 58 gene families. Finally, we identified 304,571 simple sequence repeats (up to six base pairs long). Our work provides the genome assembly, annotation and the protein and coding sequences of 'Chambourcin'. Our genome assembly is a valuable resource for genome comparisons, functional genomic analyses and genome-assisted breeding research.
“尚博辛”是一种美法种间杂交葡萄,种植于美国东部和中西部,用于酿酒。像“尚博辛”这样的杂交葡萄可用的基因组资源很少。在这里,我们使用PacBio HiFi长读长测序、Bionano光学图谱和Illumina短读长测序技术组装了“尚博辛”的基因组。我们为“尚博辛”生成了一个由26个支架组成的组装体,N50长度为23.3 Mb,估计BUSCO完整性为97.9%。我们预测了33791个基因模型,并在“尚博辛”、“PN40024”12X.v2、VCOST.v3、阳光玫瑰和光辉等品种之间鉴定出16056个共同直系同源基因。我们从58个基因家族中发现了1606个植物转录因子。最后,我们鉴定出304571个简单序列重复序列(最长六个碱基对)。我们的工作提供了“尚博辛”的基因组组装、注释以及蛋白质和编码序列。我们的基因组组装是进行基因组比较、功能基因组分析和基因组辅助育种研究的宝贵资源。