Suppr超能文献

葡萄基因组的改良参考再次证实了 PN40024 高度纯合基因型的起源。

An improved reference of the grapevine genome reasserts the origin of the PN40024 highly homozygous genotype.

机构信息

SVQV, INRAE-University of Strasbourg, Colmar 68000, France.

Genetics and Genomics of Plants, CeBiTec & Faculty of Biology, Bielefeld University, Bielefeld 33615, Germany.

出版信息

G3 (Bethesda). 2023 May 2;13(5). doi: 10.1093/g3journal/jkad067.

Abstract

The genome sequence of the diploid and highly homozygous Vitis vinifera genotype PN40024 serves as the reference for many grapevine studies. Despite several improvements to the PN40024 genome assembly, its current version PN12X.v2 is quite fragmented and only represents the haploid state of the genome with mixed haplotypes. In fact, being nearly homozygous, this genome contains several heterozygous regions that are yet to be resolved. Taking the opportunity of improvements that long-read sequencing technologies offer to fully discriminate haplotype sequences, an improved version of the reference, called PN40024.v4, was generated. Through incorporating long genomic sequencing reads to the assembly, the continuity of the 12X.v2 scaffolds was highly increased with a total number decreasing from 2,059 to 640 and a reduction in N bases of 88%. Additionally, the full alternative haplotype sequence was built for the first time, the chromosome anchoring was improved and the number of unplaced scaffolds was reduced by half. To obtain a high-quality gene annotation that outperforms previous versions, a liftover approach was complemented with an optimized annotation workflow for Vitis. Integration of the gene reference catalogue and its manual curation have also assisted in improving the annotation, while defining the most reliable estimation of 35,230 genes to date. Finally, we demonstrated that PN40024 resulted from 9 selfings of cv. "Helfensteiner" (cross of cv. "Pinot noir" and "Schiava grossa") instead of a single "Pinot noir". These advances will help maintain the PN40024 genome as a gold-standard reference, also contributing toward the eventual elaboration of the grapevine pangenome.

摘要

二倍体且高度纯合的葡萄基因型 PN40024 的基因组序列是许多葡萄研究的参考基因组。尽管对 PN40024 基因组组装进行了多次改进,但目前的版本 PN12X.v2 仍然非常碎片化,仅代表基因组的单倍体状态,具有混合的单倍型。事实上,由于近乎纯合,该基因组包含几个有待解决的杂合区域。利用长读测序技术提供的机会,可以完全区分单倍型序列,从而生成了参考基因组的一个改进版本,称为 PN40024.v4。通过将长基因组测序reads 整合到组装中,12X.v2 支架的连续性得到了极大提高,总数从 2059 个减少到 640 个,N 碱基减少了 88%。此外,首次构建了完整的替代单倍型序列,染色体锚定得到了改善,未定位支架的数量减少了一半。为了获得优于以前版本的高质量基因注释,采用了基因参考目录的移码方法,并针对 Vitis 优化了注释工作流程。基因参考目录的整合及其手动注释也有助于提高注释质量,同时确定了迄今为止最可靠的 35230 个基因的估计数量。最后,我们证明 PN40024 是由“Helfensteiner”(“Pinot noir”和“Schiava grossa”的杂交品种)的 9 次自交产生的,而不是由单一的“Pinot noir”产生的。这些进展将有助于将 PN40024 基因组保持为黄金标准参考基因组,也有助于葡萄泛基因组的最终构建。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b9f/10151409/077c91e7f85d/jkad067f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验