• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用Gecko 3寻找近似基因簇。

Finding approximate gene clusters with Gecko 3.

作者信息

Winter Sascha, Jahn Katharina, Wehner Stefanie, Kuchenbecker Leon, Marz Manja, Stoye Jens, Böcker Sebastian

机构信息

Chair for Bioinformatics, Institute for Computer Science, Friedrich-Schiller-University Jena, Jena, Germany.

Genome Informatics, Faculty of Technology and Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, Germany.

出版信息

Nucleic Acids Res. 2016 Nov 16;44(20):9600-9610. doi: 10.1093/nar/gkw843. Epub 2016 Sep 26.

DOI:10.1093/nar/gkw843
PMID:27679480
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5175365/
Abstract

Gene-order-based comparison of multiple genomes provides signals for functional analysis of genes and the evolutionary process of genome organization. Gene clusters are regions of co-localized genes on genomes of different species. The rapid increase in sequenced genomes necessitates bioinformatics tools for finding gene clusters in hundreds of genomes. Existing tools are often restricted to few (in many cases, only two) genomes, and often make restrictive assumptions such as short perfect conservation, conserved gene order or monophyletic gene clusters. We present Gecko 3, an open-source software for finding gene clusters in hundreds of bacterial genomes, that comes with an easy-to-use graphical user interface. The underlying gene cluster model is intuitive, can cope with low degrees of conservation as well as misannotations and is complemented by a sound statistical evaluation. To evaluate the biological benefit of Gecko 3 and to exemplify our method, we search for gene clusters in a dataset of 678 bacterial genomes using Synechocystis sp. PCC 6803 as a reference. We confirm detected gene clusters reviewing the literature and comparing them to a database of operons; we detect two novel clusters, which were confirmed by publicly available experimental RNA-Seq data. The computational analysis is carried out on a laptop computer in <40 min.

摘要

基于基因顺序的多个基因组比较为基因功能分析和基因组组织的进化过程提供了线索。基因簇是不同物种基因组上共定位基因的区域。测序基因组的快速增加需要生物信息学工具来在数百个基因组中寻找基因簇。现有工具通常局限于少数(在许多情况下,仅两个)基因组,并且常常做出诸如短完美保守、保守基因顺序或单系基因簇等限制性假设。我们展示了Gecko 3,这是一款用于在数百个细菌基因组中寻找基因簇的开源软件,它带有易于使用的图形用户界面。其底层的基因簇模型直观,能够应对低保守度以及错误注释,并辅以合理的统计评估。为了评估Gecko 3的生物学益处并举例说明我们的方法,我们以集胞藻属PCC 6803作为参考,在一个包含678个细菌基因组的数据集里寻找基因簇。我们通过查阅文献并将检测到的基因簇与操纵子数据库进行比较来确认它们;我们检测到两个新的簇,它们被公开可用的实验RNA测序数据所证实。在一台笔记本电脑上进行计算分析用时不到40分钟。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8d4/5175365/1d9c19a0dacc/gkw843fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8d4/5175365/e34854ef0fa8/gkw843fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8d4/5175365/c4e96ed2ee2c/gkw843fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8d4/5175365/bc041d3d3e27/gkw843fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8d4/5175365/1d9c19a0dacc/gkw843fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8d4/5175365/e34854ef0fa8/gkw843fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8d4/5175365/c4e96ed2ee2c/gkw843fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8d4/5175365/bc041d3d3e27/gkw843fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8d4/5175365/1d9c19a0dacc/gkw843fig5.jpg

相似文献

1
Finding approximate gene clusters with Gecko 3.使用Gecko 3寻找近似基因簇。
Nucleic Acids Res. 2016 Nov 16;44(20):9600-9610. doi: 10.1093/nar/gkw843. Epub 2016 Sep 26.
2
Detecting gene clusters under evolutionary constraint in a large number of genomes.在大量基因组中检测处于进化约束下的基因簇。
Bioinformatics. 2009 Mar 1;25(5):571-7. doi: 10.1093/bioinformatics/btp027. Epub 2009 Jan 21.
3
Automatic detection of conserved gene clusters in multiple genomes by graph comparison and P-quasi grouping.通过图形比较和P-拟分组自动检测多个基因组中的保守基因簇。
Nucleic Acids Res. 2000 Oct 15;28(20):4029-36. doi: 10.1093/nar/28.20.4029.
4
Computational workflow for analysis of gain and loss of genes in distantly related genomes.计算工作流程,用于分析远缘基因组中基因的增益和缺失。
BMC Bioinformatics. 2012;13 Suppl 15(Suppl 15):S5. doi: 10.1186/1471-2105-13-S15-S5. Epub 2012 Sep 11.
5
GRAST: a new way of genome reduction analysis using comparative genomics.GRAST:一种利用比较基因组学进行基因组简化分析的新方法。
Bioinformatics. 2006 Jul 1;22(13):1551-61. doi: 10.1093/bioinformatics/btl139. Epub 2006 Apr 6.
6
panX: pan-genome analysis and exploration.panX:泛基因组分析与探索。
Nucleic Acids Res. 2018 Jan 9;46(1):e5. doi: 10.1093/nar/gkx977.
7
Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context.基因组比对、原核生物基因组组织的进化以及利用基因组背景预测基因功能。
Genome Res. 2001 Mar;11(3):356-72. doi: 10.1101/gr.gr-1619r.
8
Identification of Protein Secretion Systems in Bacterial Genomes Using MacSyFinder.使用MacSyFinder鉴定细菌基因组中的蛋白质分泌系统
Methods Mol Biol. 2017;1615:1-21. doi: 10.1007/978-1-4939-7033-9_1.
9
FLAGdb: A Bioinformatic Environment to Study and Compare Plant Genomes.FLAGdb:一个用于研究和比较植物基因组的生物信息学环境。
Methods Mol Biol. 2017;1533:79-101. doi: 10.1007/978-1-4939-6658-5_4.
10
The ExAC browser: displaying reference data information from over 60 000 exomes.ExAC浏览器:展示来自6万多个外显子组的参考数据信息。
Nucleic Acids Res. 2017 Jan 4;45(D1):D840-D845. doi: 10.1093/nar/gkw971. Epub 2016 Nov 28.

引用本文的文献

1
CLOCI: unveiling cryptic fungal gene clusters with generalized detection.CLOCI:利用广义检测揭示隐匿真菌基因簇。
Nucleic Acids Res. 2024 Sep 9;52(16):e75. doi: 10.1093/nar/gkae625.
2
New algorithms for structure informed genome rearrangement.用于结构信息基因组重排的新算法。
Algorithms Mol Biol. 2023 Dec 1;18(1):17. doi: 10.1186/s13015-023-00239-x.
3
SYNPHONI: scale-free and phylogeny-aware reconstruction of synteny conservation and transformation across animal genomes.SYNPHONI:一种跨动物基因组进行同线性保守和转化的无尺度和系统发生感知重建方法。

本文引用的文献

1
Carboxysome genomics: a status report.羧酶体基因组学:现状报告。
Funct Plant Biol. 2002 Apr;29(3):175-182. doi: 10.1071/PP01200.
2
Identifying gene clusters by discovering common intervals in indeterminate strings.通过在不确定字符串中发现公共区间来识别基因簇。
BMC Genomics. 2014;15 Suppl 6(Suppl 6):S2. doi: 10.1186/1471-2164-15-S6-S2. Epub 2014 Oct 17.
3
Gene expansion shapes genome architecture in the human pathogen Lichtheimia corymbifera: an evolutionary genomics analysis in the ancient terrestrial mucorales (Mucoromycotina).
Bioinformatics. 2022 Dec 13;38(24):5434-5436. doi: 10.1093/bioinformatics/btac695.
4
Core circadian clock and light signaling genes brought into genetic linkage across the green lineage.核心生物钟和光信号基因在绿色谱系中发生遗传连锁。
Plant Physiol. 2022 Sep 28;190(2):1037-1056. doi: 10.1093/plphys/kiac276.
5
Approximate search for known gene clusters in new genomes using PQ-trees.使用PQ树在新基因组中近似搜索已知基因簇。
Algorithms Mol Biol. 2021 Jul 9;16(1):16. doi: 10.1186/s13015-021-00190-9.
6
Discovery of multi-operon colinear syntenic blocks in microbial genomes.微生物基因组中多操纵子共线性协同模块的发现。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i21-i29. doi: 10.1093/bioinformatics/btaa503.
7
Analysis of local genome rearrangement improves resolution of ancestral genomic maps in plants.分析局部基因组重排可提高植物祖先基因组图谱的分辨率。
BMC Genomics. 2020 Apr 16;21(Suppl 2):273. doi: 10.1186/s12864-020-6609-x.
8
Comparative genome characterization of the periodontal pathogen Tannerella forsythia.福赛坦纳氏菌的比较基因组特征分析。
BMC Genomics. 2020 Feb 11;21(1):150. doi: 10.1186/s12864-020-6535-y.
9
EvolClust: automated inference of evolutionary conserved gene clusters in eukaryotes.EvolClust:真核生物中进化保守基因簇的自动推断。
Bioinformatics. 2020 Feb 15;36(4):1265-1266. doi: 10.1093/bioinformatics/btz706.
10
Evolutionary and functional patterns of shared gene neighbourhood in fungi.真菌中共享基因邻域的进化和功能模式。
Nat Microbiol. 2019 Dec;4(12):2383-2392. doi: 10.1038/s41564-019-0552-0. Epub 2019 Sep 16.
基因扩增塑造了人类病原体伞枝犁头霉的基因组结构:对古老陆生毛霉目(毛霉亚门)的进化基因组学分析
PLoS Genet. 2014 Aug 14;10(8):e1004496. doi: 10.1371/journal.pgen.1004496. eCollection 2014 Aug.
4
Comparative analysis of the primary transcriptome of Synechocystis sp. PCC 6803.聚球藻属PCC 6803初级转录组的比较分析。
DNA Res. 2014 Oct;21(5):527-39. doi: 10.1093/dnares/dsu018. Epub 2014 Jun 16.
5
Transcript mapping based on dRNA-seq data.基于 dRNA-seq 数据的转录本映射。
BMC Bioinformatics. 2014 Apr 29;15:122. doi: 10.1186/1471-2105-15-122.
6
Statistics for approximate gene clusters.近似基因簇的统计
BMC Bioinformatics. 2013;14 Suppl 15(Suppl 15):S14. doi: 10.1186/1471-2105-14-S15-S14. Epub 2013 Dec 13.
7
Comparative genome analysis of the closely related Synechocystis strains PCC 6714 and PCC 6803.近缘集胞藻菌株PCC 6714和PCC 6803的比较基因组分析。
DNA Res. 2014 Jun;21(3):255-66. doi: 10.1093/dnares/dst055. Epub 2014 Jan 9.
8
RefSeq: an update on mammalian reference sequences.RefSeq:哺乳动物参考序列的更新。
Nucleic Acids Res. 2014 Jan;42(Database issue):D756-63. doi: 10.1093/nar/gkt1114. Epub 2013 Nov 19.
9
DOOR 2.0: presenting operons and their functions through dynamic and integrated views.DOOR 2.0:通过动态和集成的视图呈现操纵子及其功能。
Nucleic Acids Res. 2014 Jan;42(Database issue):D654-9. doi: 10.1093/nar/gkt1048. Epub 2013 Nov 7.
10
Transcription regulation of plastid genes involved in sulfate transport in Viridiplantae.参与硫酸盐转运的质体基因在光合植物中的转录调控。
Biomed Res Int. 2013;2013:413450. doi: 10.1155/2013/413450. Epub 2013 Aug 29.