• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过比较基因组学进行基因标签评估(GTACG):一个用于细菌比较基因组学的用户友好框架。

Gene Tags Assessment by Comparative Genomics (GTACG): A User-Friendly Framework for Bacterial Comparative Genomics.

作者信息

Santiago Caio Rafael do Nascimento, Assis Renata de Almeida Barbosa, Moreira Leandro Marcio, Digiampietri Luciano Antonio

机构信息

Bioinformatics Graduate Program, University of Sao Paulo, Sao Paulo, Brazil.

Adventist University of Sao Paulo, Sao Paulo, Brazil.

出版信息

Front Genet. 2019 Aug 26;10:725. doi: 10.3389/fgene.2019.00725. eCollection 2019.

DOI:10.3389/fgene.2019.00725
PMID:31507629
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6718126/
Abstract

Genomics research has produced an exponential amount of data. However, the genetic knowledge pertaining to certain phenotypic characteristics is lacking. Also, a considerable part of these genomes have coding sequences (CDSs) with unknown functions, posing additional challenges to researchers. Phylogenetically close microorganisms share much of their CDSs, and certain phenotypes unique to a set of microorganisms may be the result of the genes found exclusively in those microorganisms. This study presents the GTACG framework, an easy-to-use tool for identifying in the subgroups of bacterial genomes whose microorganisms have common phenotypic characteristics, to find data that differentiates them from other associated genomes in a simple and fast way. The GTACG analysis is based on the formation of homologous CDS clusters from local alignments. The front-end is easy to use, and the installation packages have been developed to enable users lacking knowledge of programming languages or bioinformatics analyze high-throughput data using the tool. The validation of the GTACG framework has been carried out based on a case report involving a set of 161 genomes from the Xanthomonadaceae family, in which 19 families of orthologous proteins were found in 90% of the plant-associated genomes, allowing the identification of the proteins potentially associated with adaptation and virulence in plant tissue. The results show the potential use of GTACG in the search for new targets for molecular studies, and GTACG can be used as a research tool by biologists who lack advanced knowledge in the use of computational tools for bacterial comparative genomics.

摘要

基因组学研究产生了指数级增长的数据量。然而,关于某些表型特征的遗传知识却很匮乏。此外,这些基因组中有相当一部分编码序列(CDS)的功能未知,给研究人员带来了额外的挑战。系统发育关系相近的微生物共享许多CDS,而一组微生物特有的某些表型可能是仅在这些微生物中发现的基因所致。本研究提出了GTACG框架,这是一种易于使用的工具,用于在具有共同表型特征的微生物的细菌基因组亚组中进行识别,以简单快速的方式找到将它们与其他相关基因组区分开来的数据。GTACG分析基于通过局部比对形成同源CDS簇。其前端易于使用,并且已经开发了安装包,使缺乏编程语言或生物信息学知识的用户能够使用该工具分析高通量数据。GTACG框架的验证是基于一个案例报告进行的,该报告涉及来自黄单胞菌科的161个基因组,其中在90%的植物相关基因组中发现了19个直系同源蛋白家族,从而能够鉴定出可能与植物组织中的适应性和毒力相关的蛋白质。结果表明GTACG在寻找分子研究新靶点方面具有潜在用途,并且GTACG可被缺乏细菌比较基因组学计算工具使用方面先进知识的生物学家用作研究工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/2f2cde8e582b/fgene-10-00725-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/c6d0ff1100de/fgene-10-00725-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/05750546ceec/fgene-10-00725-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/bb7d5ebdd68d/fgene-10-00725-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/895f169dfcdf/fgene-10-00725-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/3fd492d46270/fgene-10-00725-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/1bd34fb69d7c/fgene-10-00725-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/14b5d822ce80/fgene-10-00725-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/c4ca3f1e7e7c/fgene-10-00725-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/2f2cde8e582b/fgene-10-00725-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/c6d0ff1100de/fgene-10-00725-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/05750546ceec/fgene-10-00725-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/bb7d5ebdd68d/fgene-10-00725-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/895f169dfcdf/fgene-10-00725-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/3fd492d46270/fgene-10-00725-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/1bd34fb69d7c/fgene-10-00725-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/14b5d822ce80/fgene-10-00725-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/c4ca3f1e7e7c/fgene-10-00725-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b784/6718126/2f2cde8e582b/fgene-10-00725-g009.jpg

相似文献

1
Gene Tags Assessment by Comparative Genomics (GTACG): A User-Friendly Framework for Bacterial Comparative Genomics.通过比较基因组学进行基因标签评估(GTACG):一个用于细菌比较基因组学的用户友好框架。
Front Genet. 2019 Aug 26;10:725. doi: 10.3389/fgene.2019.00725. eCollection 2019.
2
3
OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.OrthoSelect:一种在系统发育基因组学中选择直系同源组的方案。
BMC Bioinformatics. 2009 Jul 16;10:219. doi: 10.1186/1471-2105-10-219.
4
ITEP: an integrated toolkit for exploration of microbial pan-genomes.ITEP:用于探索微生物泛基因组的集成工具包。
BMC Genomics. 2014 Jan 3;15:8. doi: 10.1186/1471-2164-15-8.
5
PLAZA: a comparative genomics resource to study gene and genome evolution in plants.PLAZA:一个用于研究植物基因和基因组进化的比较基因组学资源。
Plant Cell. 2009 Dec;21(12):3718-31. doi: 10.1105/tpc.109.071506. Epub 2009 Dec 29.
6
The Gene Set Builder: collation, curation, and distribution of sets of genes.基因集构建器:基因集的整理、管理与分发。
BMC Bioinformatics. 2005 Dec 21;6:305. doi: 10.1186/1471-2105-6-305.
7
An infrastructure for comparative genomics to functionally characterize genes and proteins.一个用于对基因和蛋白质进行功能表征的比较基因组学基础设施。
Genome Inform Ser Workshop Genome Inform. 2000;11:24-32.
8
Bioinformatics software for biologists in the genomics era.基因组学时代面向生物学家的生物信息学软件。
Bioinformatics. 2007 Jul 15;23(14):1713-7. doi: 10.1093/bioinformatics/btm239. Epub 2007 May 7.
9
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.一种用于原核生物基因组中顺式调控基序识别的综合且适用的系统发育足迹分析框架。
BMC Genomics. 2016 Aug 9;17:578. doi: 10.1186/s12864-016-2982-x.
10

引用本文的文献

1
Molecular Breeding and Drought Tolerance in Chickpea.鹰嘴豆的分子育种与耐旱性
Life (Basel). 2022 Nov 11;12(11):1846. doi: 10.3390/life12111846.
2
Comparative Genomics of Explores Candidate Host-Specificity Determinants and Expands the Known Repertoire of Mobile Genetic Elements and Immunity Systems.比较基因组学探索宿主特异性决定因素的候选基因,并扩展了移动遗传元件和免疫系统的已知库。
Microorganisms. 2022 Apr 27;10(5):914. doi: 10.3390/microorganisms10050914.

本文引用的文献

1
Identifying genes associated with invasive disease in S. pneumoniae by applying a machine learning approach to whole genome sequence typing data.应用机器学习方法对全基因组序列分型数据进行分析,鉴定肺炎链球菌侵袭性疾病相关基因。
Sci Rep. 2019 Mar 11;9(1):4049. doi: 10.1038/s41598-019-40346-7.
2
Homology Detection Using Multilayer Maximum Clustering Coefficient.使用多层最大聚类系数的同源性检测
J Comput Biol. 2018 Aug 13. doi: 10.1089/cmb.2017.0266.
3
PGAP-X: extension on pan-genome analysis pipeline.PGAP-X:泛基因组分析管道的扩展。
BMC Genomics. 2018 Jan 19;19(Suppl 1):36. doi: 10.1186/s12864-017-4337-7.
4
Identification and analysis of seven effector protein families with different adaptive and evolutionary histories in plant-associated members of the Xanthomonadaceae.鉴定和分析黄单胞菌科植物相关成员中具有不同适应性和进化历史的七个效应蛋白家族。
Sci Rep. 2017 Nov 23;7(1):16133. doi: 10.1038/s41598-017-16325-1.
5
panX: pan-genome analysis and exploration.panX:泛基因组分析与探索。
Nucleic Acids Res. 2018 Jan 9;46(1):e5. doi: 10.1093/nar/gkx977.
6
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets.MMseqs2支持进行灵敏的蛋白质序列搜索,以分析海量数据集。
Nat Biotechnol. 2017 Nov;35(11):1026-1028. doi: 10.1038/nbt.3988. Epub 2017 Oct 16.
7
The Impact of Systems Biology on Bioprocessing.系统生物学对生物工艺学的影响。
Trends Biotechnol. 2017 Dec;35(12):1156-1168. doi: 10.1016/j.tibtech.2017.08.011. Epub 2017 Oct 4.
8
Proteomics-based identification of differentially abundant proteins reveals adaptation mechanisms of Xanthomonas citri subsp. citri during Citrus sinensis infection.基于蛋白质组学的差异丰富蛋白质鉴定揭示了柑橘黄龙病菌在感染甜橙过程中的适应机制。
BMC Microbiol. 2017 Jul 11;17(1):155. doi: 10.1186/s12866-017-1063-x.
9
Improvements to PATRIC, the all-bacterial Bioinformatics Database and Analysis Resource Center.全细菌生物信息学数据库与分析资源中心PATRIC的改进。
Nucleic Acids Res. 2017 Jan 4;45(D1):D535-D542. doi: 10.1093/nar/gkw1017. Epub 2016 Nov 29.
10
MSAViewer: interactive JavaScript visualization of multiple sequence alignments.MSAViewer:多序列比对的交互式JavaScript可视化工具。
Bioinformatics. 2016 Nov 15;32(22):3501-3503. doi: 10.1093/bioinformatics/btw474. Epub 2016 Jul 13.