• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

来自南极洲的39种稀有放线菌的全基因组序列和注释数据集。

Whole genome sequence and annotation dataset of rare actinobacteria, 39 from Antarctica.

作者信息

Chong Sin Yee, Azmi Aida Azrina, Cheah Yoke Kqueen

机构信息

Unit of Molecular Biology and Bioinformatics, Department of Biomedical Science, Faculty of Medicine and Health Sciences, Universiti Putra Malaysia, 43400 UPM Serdang, Selangor Darul Ehsan, Malaysia.

Halal Science Research, Halal Products Research Institute, Universiti Putra Malaysia, 43400 UPM Serdang, Selangor Darul Ehsan, Malaysia.

出版信息

Data Brief. 2023 Oct 12;51:109657. doi: 10.1016/j.dib.2023.109657. eCollection 2023 Dec.

DOI:10.1016/j.dib.2023.109657
PMID:37876741
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10590835/
Abstract

39 is a rare actinobacteria strain isolated from the less explored extreme environment of the Antarctic soil. Here, we present the whole genome sequencing and annotation data from the high-quality draft genome of from Antarctica. The extracted genomic deoxyribonucleic acid (DNA) was sequenced using the PacBio Sequel sequencing platform, followed by the Illumina HiSeq sequencing system. Subsequently, the assembly data from Canu 1.7 and Pilon were subjected to bioinformatics analysis for genome annotation to analyze the entire genomic information of the sequences. Different bioinformatics analysis approaches were used to disclose a high-quality draft genome basis for and provided a better understanding of its biological and molecular functions. Note that 83,639 reads were predicted from its 3.6Mb genome size, with a guanine-cytosine content (GC) content of 72.39%. The genome was assembled into two contigs, where the larger contig represents the chromosome and the smaller contig represents the plasmid. It is composed of 3,381 coding genes, with about 95% of them being functionally annotated. It consists of 3,318 coding sequences, one tmRNA gene, 57 tRNA genes, and five repeated regions. was evident, sharing a close sequence similarity with the species and the family . Gene Ontology (GO) functional classification indicated cell and cell parts were highly represented among the cellular component category; catalytic activity and binding were the most enriched processes within the molecular function category; metabolic and cellular processes were the most represented in the biological process category. Clusters of Orthologous Group (COG) functional classification revealed metabolism-related genes were highly enriched and mostly mapped to amino acid transport metabolism, transcription, energy production, and conversion. Moreover, the Kyoto Encyclopedia of Genes and Genomes (KEGG) functional classification reported that the metabolism process was the most represented KEGG pathway. There were 52 biosynthetic gene clusters involved in secondary metabolites biosynthesis, indicating has antibacterial, antifungal, cytotoxic, and inhibitor bioactivities. The dataset of the whole-genome sequence of has been deposited in the European Nucleotide Archive (ENA) repository under the accession number PRJEB44986 / ERP129097. The dataset of the genome annotation of had been deposited in Zenodo. The reported genomic sequence data for contributes comprehensive data to the current molecular information of the species, serving as a significant approach that facilitates the advancement of medicine.

摘要

39是从探索较少的南极土壤极端环境中分离出的一种罕见放线菌菌株。在此,我们展示了来自南极洲该菌株高质量草图基因组的全基因组测序和注释数据。提取的基因组脱氧核糖核酸(DNA)使用PacBio Sequel测序平台进行测序,随后使用Illumina HiSeq测序系统。随后,对来自Canu 1.7和Pilon的组装数据进行生物信息学分析以进行基因组注释,从而分析序列的整个基因组信息。使用了不同的生物信息学分析方法来揭示该菌株高质量草图基因组的基础,并更好地了解其生物学和分子功能。请注意,从其3.6Mb的基因组大小预测出83,639条 reads,鸟嘌呤 - 胞嘧啶含量(GC)为72.39%。基因组被组装成两个重叠群,其中较大的重叠群代表染色体,较小的重叠群代表质粒。它由3381个编码基因组成,其中约95%在功能上已注释。它由3318个编码序列、一个tmRNA基因、57个tRNA基因和五个重复区域组成。该菌株很明显,与[具体物种]和[具体科]的物种具有密切的序列相似性。基因本体论(GO)功能分类表明,在细胞成分类别中,细胞和细胞部分的代表性很高;在分子功能类别中,催化活性和结合是最丰富的过程;在生物过程类别中,代谢和细胞过程的代表性最高。直系同源群(COG)功能分类显示,与代谢相关的基因高度富集,主要映射到氨基酸转运代谢、转录、能量产生和转换。此外,京都基因与基因组百科全书(KEGG)功能分类报告称,代谢过程是最具代表性的KEGG途径。有52个生物合成基因簇参与次生代谢物生物合成,表明该菌株具有抗菌、抗真菌、细胞毒性和抑制剂生物活性。该菌株全基因组序列数据集已存入欧洲核苷酸档案库(ENA),登录号为PRJEB44986 / ERP129097。该菌株基因组注释数据集已存入Zenodo。所报告的该菌株基因组序列数据为该物种当前的分子信息提供了全面的数据,是促进医学进步的重要途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40cb/10590835/8bd2fbab7d54/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40cb/10590835/af94276394c7/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40cb/10590835/7ab2040a59db/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40cb/10590835/8bd2fbab7d54/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40cb/10590835/af94276394c7/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40cb/10590835/7ab2040a59db/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40cb/10590835/8bd2fbab7d54/gr3.jpg

相似文献

1
Whole genome sequence and annotation dataset of rare actinobacteria, 39 from Antarctica.来自南极洲的39种稀有放线菌的全基因组序列和注释数据集。
Data Brief. 2023 Oct 12;51:109657. doi: 10.1016/j.dib.2023.109657. eCollection 2023 Dec.
2
Barrientosiimonas humi gen. nov., sp. nov., an actinobacterium of the family Dermacoccaceae.棒杆菌样海栖菌属,新属,动胶菌科的放线菌。
Int J Syst Evol Microbiol. 2013 Jan;63(Pt 1):241-248. doi: 10.1099/ijs.0.038232-0. Epub 2012 Mar 2.
3
Barrientosiimonas endolithica sp. nov., isolated from pebbles, reclassification of the only species of the genus Tamlicoccus, Tamlicoccus marinus Lee 2013, as Barrientosiimonas marina comb. nov. and emended description of the genus Barrientosiimonas.从卵石中分离出的新物种巴伦蒂西莫纳斯内石生菌,将塔姆利球菌属的唯一物种——2013年李发现的海生塔姆利球菌重新分类为滨海巴伦蒂西莫纳斯新组合,以及对巴伦蒂西莫纳斯属的修订描述。
Int J Syst Evol Microbiol. 2015 Sep;65(9):3031-3036. doi: 10.1099/ijs.0.000374. Epub 2015 Jun 8.
4
Allobranchiibius huperziae gen. nov., sp. nov., a member of Dermacoccaceae isolated from the root of a medicinal plant Huperzia serrata (Thunb.).蛇足石杉异枝球菌,新属,新种,是从药用植物蛇足石杉(Thunb.)根部分离得到的皮肤球菌科成员。
Int J Syst Evol Microbiol. 2017 Oct;67(10):4210-4215. doi: 10.1099/ijsem.0.002284. Epub 2017 Sep 18.
5
Draft genome sequence of type strain HBR26 and description of sp. nov.模式菌株HBR26的基因组草图序列及新种描述
Stand Genomic Sci. 2017 Jan 26;12:14. doi: 10.1186/s40793-017-0220-z. eCollection 2017.
6
Data on genome annotation and analysis of earthworm .蚯蚓的基因组注释与分析数据。
Data Brief. 2018 Aug 29;20:525-534. doi: 10.1016/j.dib.2018.08.067. eCollection 2018 Oct.
7
Genome sequencing of the winged midge, Parochlus steinenii, from the Antarctic Peninsula.对来自南极半岛的有翅蠓类昆虫——施氏南极蠓进行基因组测序。
Gigascience. 2017 Mar 1;6(3):1-8. doi: 10.1093/gigascience/giw009.
8
gen. nov., sp. nov., a novel bacterium of the family isolated from soil of a farming field.属名. 新种,一种从农田土壤中分离得到的新菌科细菌。
Int J Syst Evol Microbiol. 2020 Sep;70(9):5123-5130. doi: 10.1099/ijsem.0.004397.
9
Exploring drought stress-regulated genes in senna (Cassia angustifolia Vahl.): a transcriptomic approach.探索番泻叶(狭叶决明)中干旱胁迫调控基因:一种转录组学方法。
Funct Integr Genomics. 2017 Jan;17(1):1-25. doi: 10.1007/s10142-016-0523-y. Epub 2016 Oct 5.
10

本文引用的文献

1
A deep learning genome-mining strategy for biosynthetic gene cluster prediction.深度学习基因组挖掘策略用于生物合成基因簇预测。
Nucleic Acids Res. 2019 Oct 10;47(18):e110. doi: 10.1093/nar/gkz654.
2
antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline.antiSMASH 5.0:二次代谢产物基因组挖掘管道的更新。
Nucleic Acids Res. 2019 Jul 2;47(W1):W81-W87. doi: 10.1093/nar/gkz310.
3
eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses.
eggNOG 5.0:一个基于 5090 个生物体和 2502 种病毒的层次化、功能和系统发育注释的同源资源。
Nucleic Acids Res. 2019 Jan 8;47(D1):D309-D314. doi: 10.1093/nar/gky1085.
4
BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins.BAGEL4:一个用户友好的网络服务器,用于彻底挖掘 RiPPs 和细菌素。
Nucleic Acids Res. 2018 Jul 2;46(W1):W278-W281. doi: 10.1093/nar/gky383.
5
methods for linking genes and secondary metabolites: The way forward.连接基因与次生代谢产物的方法:前进的道路。
Synth Syst Biotechnol. 2016 Apr 1;1(2):80-88. doi: 10.1016/j.synbio.2016.03.001. eCollection 2016 Jun.
6
Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper.通过eggNOG-Mapper进行直系同源物分配实现全基因组快速功能注释
Mol Biol Evol. 2017 Aug 1;34(8):2115-2122. doi: 10.1093/molbev/msx148.
7
Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation.Canu:通过自适应k-mer加权和重复序列分离实现可扩展且准确的长读长序列拼接
Genome Res. 2017 May;27(5):722-736. doi: 10.1101/gr.215087.116. Epub 2017 Mar 15.
8
PacBio Sequencing and Its Applications.PacBio测序技术及其应用。
Genomics Proteomics Bioinformatics. 2015 Oct;13(5):278-89. doi: 10.1016/j.gpb.2015.08.002. Epub 2015 Nov 2.
9
Barrientosiimonas endolithica sp. nov., isolated from pebbles, reclassification of the only species of the genus Tamlicoccus, Tamlicoccus marinus Lee 2013, as Barrientosiimonas marina comb. nov. and emended description of the genus Barrientosiimonas.从卵石中分离出的新物种巴伦蒂西莫纳斯内石生菌,将塔姆利球菌属的唯一物种——2013年李发现的海生塔姆利球菌重新分类为滨海巴伦蒂西莫纳斯新组合,以及对巴伦蒂西莫纳斯属的修订描述。
Int J Syst Evol Microbiol. 2015 Sep;65(9):3031-3036. doi: 10.1099/ijs.0.000374. Epub 2015 Jun 8.
10
BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.BUSCO:利用单拷贝同源基因评估基因组组装和注释的完整性。
Bioinformatics. 2015 Oct 1;31(19):3210-2. doi: 10.1093/bioinformatics/btv351. Epub 2015 Jun 9.