Suppr超能文献

利用组合短读长读测序对油棕基部茎腐病病原菌波氏角菌进行全基因组测序。

Whole-genome sequencing of Ganoderma boninense, the causal agent of basal stem rot disease in oil palm, via combined short- and long-read sequencing.

机构信息

Department of Biotechnology, PT SMART Tbk, Bogor, 16810, Indonesia.

Section of Bioinformatics, PT SMART Tbk, Bogor, 16810, Indonesia.

出版信息

Sci Rep. 2024 May 8;14(1):10520. doi: 10.1038/s41598-024-60713-3.

Abstract

The hemibiotrophic Basidiomycete pathogen Ganoderma boninense (Gb) is the dominant causal agent of oil palm basal stem rot disease. Here, we report a complete chromosomal genome map of Gb using a combination of short-read Illumina and long-read Pacific Biosciences (PacBio) sequencing platforms combined with chromatin conformation capture data from the Chicago and Hi-C platforms. The genome was 55.87 Mb in length and assembled to a high contiguity (N50: 304.34 kb) of 12 chromosomes built from 112 scaffolds, with a total of only 4.34 Mb (~ 7.77%) remaining unplaced. The final assemblies were evaluated for completeness of the genome by using Benchmarking Universal Single Copy Orthologs (BUSCO) v4.1.4, and based on 4464 total BUSCO polyporales group searches, the assemblies yielded 4264 (95.52%) of the conserved orthologs as complete and only a few fragmented BUSCO of 42 (0.94%) as well as a missing BUSCO of 158 (3.53%). Genome annotation predicted a total of 21,074 coding genes, with a GC content ratio of 59.2%. The genome features were analyzed with different databases, which revealed 2471 Gene Ontology/GO (11.72%), 5418 KEGG (Kyoto Encyclopedia of Genes and Genomes) Orthologous/KO (25.71%), 13,913 Cluster of Orthologous Groups of proteins/COG (66.02%), 60 ABC transporter (0.28%), 1049 Carbohydrate-Active Enzymes/CAZy (4.98%), 4005 pathogen-host interactions/PHI (19%), and 515 fungal transcription factor/FTFD (2.44%) genes. The results obtained in this study provide deep insight for further studies in the future.

摘要

黄韧壳菌(Ganoderma boninense)是一种兼性半活体的担子菌病原体,是油棕基部腐烂病的主要致病因子。在这里,我们报告了黄韧壳菌的完整染色体基因组图谱,该图谱是使用短读长 Illumina 和长读长 Pacific Biosciences(PacBio)测序平台以及来自芝加哥和 Hi-C 平台的染色质构象捕获数据组合而成的。基因组长度为 55.87Mb,组装到了很高的连续性(N50:304.34kb),由 112 个支架构建的 12 条染色体组成,总共只有 4.34Mb(约 7.77%)未定位。最终的组装通过使用基准通用单拷贝同源物(BUSCO)v4.1.4 来评估基因组的完整性,基于 4464 个总 BUSCO 多孔菌目搜索,组装体产生了 4264 个(95.52%)完整的保守同源物,只有少数 42 个(0.94%)碎片化的 BUSCO 和 158 个(3.53%)缺失的 BUSCO。基因组注释预测了总共 21074 个编码基因,GC 含量比为 59.2%。通过不同的数据库对基因组特征进行了分析,揭示了 2471 个基因本体论/GO(11.72%)、5418 个京都基因与基因组百科全书/KO(25.71%)、13913 个蛋白质同源群/COG(66.02%)、60 个 ABC 转运蛋白(0.28%)、1049 个碳水化合物活性酶/CAZy(4.98%)、4005 个病原体-宿主相互作用/PHI(19%)和 515 个真菌转录因子/FTFD(2.44%)基因。本研究的结果为今后的进一步研究提供了深入的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc78/11076493/4e1d23d0a161/41598_2024_60713_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验