Suppr超能文献

通过人工注释更新黄瓜(9930)基因组的基因预测

Updated Gene Prediction of the Cucumber (9930) Genome through Manual Annotation.

作者信息

Du Weixuan, Xia Lei, Li Rui, Zhao Xiaokun, Jin Danna, Wang Xiaoning, Pei Yun, Zhou Rong, Chen Jinfeng, Yu Xiaqing

机构信息

State Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, No. 1 Weigang, Nanjing 210095, China.

College of Agriculture, Guizhou University, Guiyang 550025, China.

出版信息

Plants (Basel). 2024 Jun 9;13(12):1604. doi: 10.3390/plants13121604.

Abstract

Thorough and precise gene structure annotations are essential for maximizing the benefits of genomic data and unveiling valuable genetic insights. The cucumber genome was first released in 2009 and updated in 2019. To increase the accuracy of the predicted gene models, 64 published RNA-seq data and 9 new strand-specific RNA-seq data from multiple tissues were used for manual comparison with the gene models. The updated annotation file (V3.1) contains an increased number (24,145) of predicted genes compared to the previous version (24,317 genes), with a higher BUSCO value of 96.9%. A total of 6231 and 1490 transcripts were adjusted and newly added, respectively, accounting for 31.99% of the overall gene tally. These newly added and adjusted genes were renamed (CsaV3.1_XGXXXXX), while genes remaining unaltered preserved their original designations. A random selection of 21 modified/added genes were validated using RT-PCR analyses. Additionally, tissue-specific patterns of gene expression were examined using the newly obtained transcriptome data with the revised gene prediction model. This improved annotation of the cucumber genome will provide essential and accurate resources for studies in cucumber.

摘要

全面而精确的基因结构注释对于最大化基因组数据的益处和揭示有价值的遗传见解至关重要。黄瓜基因组于2009年首次发布,并于2019年更新。为提高预测基因模型的准确性,使用了64个已发表的RNA测序数据和来自多个组织的9个新的链特异性RNA测序数据与基因模型进行人工比对。与上一版本(24317个基因)相比,更新后的注释文件(V3.1)预测基因数量增加(24145个),BUSCO值更高,为96.9%。分别调整和新添加了6231个和1490个转录本,占基因总数的31.99%。这些新添加和调整的基因被重新命名(CsaV3.1_XGXXXXX),而未改变的基因保留其原始名称。使用RT-PCR分析对随机选择的21个修改/添加的基因进行了验证。此外,利用新获得的转录组数据和修订后的基因预测模型检查了基因表达的组织特异性模式。黄瓜基因组的这一改进注释将为黄瓜研究提供重要且准确的资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47d0/11207753/d4e3895a666f/plants-13-01604-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验