• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解析基因组岛的结构特征:一种机器学习方法。

Resolving the structural features of genomic islands: a machine learning approach.

作者信息

Vernikos Georgios S, Parkhill Julian

机构信息

The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom.

出版信息

Genome Res. 2008 Feb;18(2):331-42. doi: 10.1101/gr.7004508. Epub 2007 Dec 10.

DOI:10.1101/gr.7004508
PMID:18071028
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2203631/
Abstract

Large inserts of horizontally acquired DNA that contain functionally related genes with limited phylogenetic distribution are often referred to as genomic islands (GIs), and structural definitions of these islands, based on common features, have been proposed. Although a large number of mobile elements fall well within the GI definition, there are several concerns about the structural consensus for GIs: The current GI definition was put forward 10 yr ago when only 12 complete bacterial genomes were available, a large number of GIs deviate from that definition, and in silico predictions assuming a full/partial GI structural model bias the sampling of the GI structural space toward "well-structured" GIs. In this study, the structural features of genomic regions are sampled by a hypothesis-free, bottom-up search, and these are exploited in a machine learning approach with the aim of explicitly quantifying and modeling the contribution of each feature to the GI structure. Performing a whole-genome-based comparative analysis between 37 strains of three different genera and 12 outgroup genomes, 668 genomic regions were sampled and used to train structural GI models. The data show that, overall, GIs from the three different genera fall into distinct, genus-specific structural families. However, decreasing the taxa resolution, by studying GI structures across different genus boundaries, provides models that converge on a fairly similar GI structure, further suggesting that GIs can be seen as a superfamily of mobile elements, with core and variable structural features, rather than a well-defined family.

摘要

水平获得的大片段DNA插入序列,包含系统发育分布有限但功能相关的基因,通常被称为基因组岛(GIs),基于共同特征对这些岛进行了结构定义。尽管大量的移动元件完全符合基因组岛的定义,但对于基因组岛的结构共识仍存在一些问题:当前的基因组岛定义是在10年前提出的,当时仅有12个完整的细菌基因组可供使用,大量的基因组岛偏离了该定义,并且基于全基因组/部分基因组岛结构模型的计算机预测使基因组岛结构空间的采样偏向于“结构良好”的基因组岛。在本研究中,通过无假设的自下而上搜索对基因组区域的结构特征进行采样,并将其应用于机器学习方法,旨在明确量化和建模每个特征对基因组岛结构的贡献。对三个不同属的37个菌株和12个外群基因组进行基于全基因组的比较分析,采样了668个基因组区域并用于训练基因组岛结构模型。数据表明,总体而言,来自三个不同属的基因组岛属于不同的、属特异性结构家族。然而,通过跨不同属边界研究基因组岛结构来降低分类分辨率,得到的模型趋于相当相似的基因组岛结构,这进一步表明基因组岛可被视为具有核心和可变结构特征的移动元件超家族,而非定义明确的家族。

相似文献

1
Resolving the structural features of genomic islands: a machine learning approach.解析基因组岛的结构特征:一种机器学习方法。
Genome Res. 2008 Feb;18(2):331-42. doi: 10.1101/gr.7004508. Epub 2007 Dec 10.
2
GI-Cluster: Detecting genomic islands via consensus clustering on multiple features.GI-Cluster:通过对多个特征进行一致性聚类来检测基因组岛
J Bioinform Comput Biol. 2018 Jun;16(3):1840010. doi: 10.1142/S0219720018400103. Epub 2018 Feb 4.
3
DICEP: An integrative approach to augmenting genomic island detection.DICEP:一种增强基因组岛检测的综合方法。
J Biotechnol. 2024 Jun 10;388:49-58. doi: 10.1016/j.jbiotec.2024.04.011. Epub 2024 Apr 17.
4
Classification of genomic islands using decision trees and their ensemble algorithms.基于决策树及其集成算法的基因组岛分类。
BMC Genomics. 2010 Nov 2;11 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2164-11-S2-S1.
5
Patterns and architecture of genomic islands in marine bacteria.海洋细菌中基因组岛的模式和结构。
BMC Genomics. 2012 Jul 29;13:347. doi: 10.1186/1471-2164-13-347.
6
Identification of phage-induced genomic islands in the 13 Streptococcus pyogenes strains using genome barcodes.使用基因组条形码鉴定13株化脓性链球菌中的噬菌体诱导基因组岛。
Int J Data Min Bioinform. 2014;10(3):269-84. doi: 10.1504/ijdmb.2014.064524.
7
Enabling genomic island prediction and comparison in multiple genomes to investigate bacterial evolution and outbreaks.实现多个基因组中的基因组岛预测和比较,以研究细菌进化和爆发。
Microb Genom. 2022 May;8(5). doi: 10.1099/mgen.0.000818.
8
Co-evolution of genomic islands and their bacterial hosts revealed through phylogenetic analyses of 17 groups of homologous genomic islands.通过对17组同源基因组岛的系统发育分析揭示基因组岛与其细菌宿主的共同进化
Genet Mol Res. 2012 Oct 15;11(4):3735-43. doi: 10.4238/2012.October.15.5.
9
High-density transcriptional initiation signals underline genomic islands in bacteria.高密度转录起始信号突出了细菌中的基因组岛。
PLoS One. 2012;7(3):e33759. doi: 10.1371/journal.pone.0033759. Epub 2012 Mar 20.
10
A computational approach for identifying pathogenicity islands in prokaryotic genomes.一种用于识别原核生物基因组中致病岛的计算方法。
BMC Bioinformatics. 2005 Jul 21;6:184. doi: 10.1186/1471-2105-6-184.

引用本文的文献

1
Current state and future prospects of Horizontal Gene Transfer detection.水平基因转移检测的现状与未来展望
NAR Genom Bioinform. 2025 Feb 11;7(1):lqaf005. doi: 10.1093/nargab/lqaf005. eCollection 2025 Mar.
2
Evolution of the pheV-tRNA integrated genomic island in Escherichia coli.大肠杆菌中 pheV-tRNA 整合基因组岛的进化。
PLoS Genet. 2024 Oct 24;20(10):e1011459. doi: 10.1371/journal.pgen.1011459. eCollection 2024 Oct.
3
Investigating the nature of prokaryotic genomic island locations within a genome.研究原核基因组岛在基因组中的位置的性质。
PLoS One. 2024 May 2;19(5):e0301172. doi: 10.1371/journal.pone.0301172. eCollection 2024.
4
Current Methods for Recombination Detection in Bacteria.当前细菌重组检测方法。
Int J Mol Sci. 2022 Jun 2;23(11):6257. doi: 10.3390/ijms23116257.
5
Genomic Features and Construction of Streamlined Genome Chassis of Nisin Z Producer N8.乳酸链球菌素Z产生菌N8的基因组特征及简化基因组底盘的构建
Microorganisms. 2021 Dec 27;10(1):47. doi: 10.3390/microorganisms10010047.
6
Genomic Island Prediction via Chi-Square Test and Random Forest Algorithm.基于卡方检验和随机森林算法的基因组岛预测。
Comput Math Methods Med. 2021 May 24;2021:9969751. doi: 10.1155/2021/9969751. eCollection 2021.
7
Identifying genomic islands with deep neural networks.利用深度神经网络识别基因组岛。
BMC Genomics. 2021 Jun 2;22(Suppl 3):281. doi: 10.1186/s12864-021-07575-5.
8
Engineering of a genome-reduced strain Bacillus amyloliquefaciens for enhancing surfactin production.工程菌枯草芽孢杆菌的基因组减少菌株增强表面活性剂生产。
Microb Cell Fact. 2020 Dec 7;19(1):223. doi: 10.1186/s12934-020-01485-z.
9
Genomics: Macro and Micro Evolution.基因组学:宏观与微观进化。
Int J Mol Sci. 2020 Oct 20;21(20):7749. doi: 10.3390/ijms21207749.
10
Experimental approaches to tracking mobile genetic elements in microbial communities.追踪微生物群落中移动遗传元件的实验方法。
FEMS Microbiol Rev. 2020 Sep 1;44(5):606-630. doi: 10.1093/femsre/fuaa025.

本文引用的文献

1
Molecular correlates of host specialization in Staphylococcus aureus.金黄色葡萄球菌宿主特异性的分子关联
PLoS One. 2007 Oct 31;2(10):e1120. doi: 10.1371/journal.pone.0001120.
2
The SPI-2 type III secretion system restricts motility of Salmonella-containing vacuoles.SPI-2 III型分泌系统限制含沙门氏菌液泡的运动性。
Cell Microbiol. 2007 Oct;9(10):2517-29. doi: 10.1111/j.1462-5822.2007.00977.x. Epub 2007 Jun 7.
3
Genetic flux over time in the Salmonella lineage.沙门氏菌谱系随时间的基因通量。
Genome Biol. 2007;8(6):R100. doi: 10.1186/gb-2007-8-6-r100.
4
In vivo acquisition of prophage in Streptococcus pyogenes.化脓性链球菌中噬菌体的体内获得
Trends Microbiol. 2007 Jul;15(7):297-300. doi: 10.1016/j.tim.2007.05.003. Epub 2007 May 31.
5
The SaPIs: mobile pathogenicity islands of Staphylococcus.葡萄球菌辅助噬菌体整合位点(SaPIs):葡萄球菌的移动致病岛
Chem Immunol Allergy. 2007;93:42-57. doi: 10.1159/000100857.
6
A machine learning strategy to identify candidate binding sites in human protein-coding sequence.一种用于识别人类蛋白质编码序列中候选结合位点的机器学习策略。
BMC Bioinformatics. 2006 Sep 26;7:419. doi: 10.1186/1471-2105-7-419.
7
Vertebrate gene finding from multiple-species alignments using a two-level strategy.使用两级策略从多物种比对中寻找脊椎动物基因。
Genome Biol. 2006;7 Suppl 1(Suppl 1):S6.1-12. doi: 10.1186/gb-2006-7-s1-s6. Epub 2006 Aug 7.
8
Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands.用于鉴定水平获得性DNA的内插可变顺序基序:重新审视沙门氏菌致病岛
Bioinformatics. 2006 Sep 15;22(18):2196-203. doi: 10.1093/bioinformatics/btl369. Epub 2006 Jul 12.
9
Identification of novel genomic islands coding for antigenic pilus-like structures in Streptococcus agalactiae.无乳链球菌中编码抗原性菌毛样结构的新型基因组岛的鉴定。
Mol Microbiol. 2006 Jul;61(1):126-41. doi: 10.1111/j.1365-2958.2006.05225.x.
10
Dispensable genes and foreign DNA in Streptococcus mutans.变形链球菌中的非必需基因和外源DNA。
Microbiology (Reading). 2006 Jun;152(Pt 6):1777-1788. doi: 10.1099/mic.0.28647-0.