• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

快速搜索所有已存入的细菌和病毒基因组数据。

Ultrafast search of all deposited bacterial and viral genomic data.

机构信息

Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK.

EMBL-EBI, Hinxton, UK.

出版信息

Nat Biotechnol. 2019 Feb;37(2):152-159. doi: 10.1038/s41587-018-0010-1. Epub 2019 Feb 4.

DOI:10.1038/s41587-018-0010-1
PMID:30718882
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6420049/
Abstract

Exponentially increasing amounts of unprocessed bacterial and viral genomic sequence data are stored in the global archives. The ability to query these data for sequence search terms would facilitate both basic research and applications such as real-time genomic epidemiology and surveillance. However, this is not possible with current methods. To solve this problem, we combine knowledge of microbial population genomics with computational methods devised for web search to produce a searchable data structure named BItsliced Genomic Signature Index (BIGSI). We indexed the entire global corpus of 447,833 bacterial and viral whole-genome sequence datasets using four orders of magnitude less storage than previous methods. We applied our BIGSI search function to rapidly find resistance genes MCR-1, MCR-2, and MCR-3, determine the host-range of 2,827 plasmids, and quantify antibiotic resistance in archived datasets. Our index can grow incrementally as new (unprocessed or assembled) sequence datasets are deposited and can scale to millions of datasets.

摘要

大量未经处理的细菌和病毒基因组序列数据不断增加,并存储在全球档案库中。能够针对这些数据进行序列搜索词查询,将有助于促进基础研究和实时基因组流行病学及监测等应用。但目前的方法无法实现这一点。为了解决这个问题,我们将微生物群体基因组学知识与专为网络搜索设计的计算方法相结合,生成了一种可搜索的数据结构,名为 Bitsliced Genomic Signature Index(BIGSI)。我们使用比以前的方法少四个数量级的存储空间,对包含 447,833 个细菌和病毒全基因组序列数据集的全球语料库进行了索引。我们应用 BIGSI 搜索功能快速查找了耐药基因 MCR-1、MCR-2 和 MCR-3,确定了 2,827 个质粒的宿主范围,并量化了存档数据集中的抗生素耐药性。我们的索引可以随着新的(未处理或组装的)序列数据集的不断增加而逐步增长,并且可以扩展到数百万个数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/ece9c23cd4ad/emss-80982-f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/e7f6e15258d4/emss-80982-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/f3baa3f1ea4b/emss-80982-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/474c026a70bf/emss-80982-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/f02512ac4984/emss-80982-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/a14ea441cdd2/emss-80982-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/ece9c23cd4ad/emss-80982-f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/e7f6e15258d4/emss-80982-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/f3baa3f1ea4b/emss-80982-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/474c026a70bf/emss-80982-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/f02512ac4984/emss-80982-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/a14ea441cdd2/emss-80982-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfc6/6420049/ece9c23cd4ad/emss-80982-f006.jpg

相似文献

1
Ultrafast search of all deposited bacterial and viral genomic data.快速搜索所有已存入的细菌和病毒基因组数据。
Nat Biotechnol. 2019 Feb;37(2):152-159. doi: 10.1038/s41587-018-0010-1. Epub 2019 Feb 4.
2
The co-transfer of plasmid-borne colistin-resistant genes mcr-1 and mcr-3.5, the carbapenemase gene bla and the 16S methylase gene rmtB from Escherichia coli.质粒介导的黏菌素耐药基因 mcr-1 和 mcr-3.5、碳青霉烯酶基因 bla 和 16S 甲基化酶基因 rmtB 从大肠杆菌中的共同转移。
Sci Rep. 2019 Jan 24;9(1):696. doi: 10.1038/s41598-018-37125-1.
3
Genetic Analysis of p17S-208 Plasmid Encoding the Colistin Resistance mcr-3 Gene in Escherichia coli Isolated from Swine in South Korea.对从韩国猪中分离出的携带耐黏菌素mcr-3基因的p17S - 208质粒进行的遗传分析。
Microb Drug Resist. 2019 Apr;25(3):457-461. doi: 10.1089/mdr.2018.0132. Epub 2018 Oct 31.
4
Various conjugative plasmids carrying the mcr-5 gene in Escherichia coli isolates from healthy chickens in Paraguay.从巴拉圭健康鸡中分离出的大肠杆菌中携带mcr-5基因的各种接合质粒。
J Antimicrob Chemother. 2019 Nov 1;74(11):3394-3397. doi: 10.1093/jac/dkz317.
5
Co-occurrence of mcr-1 and mcr-3 genes in a single Escherichia coli in New Zealand.新西兰一株大肠杆菌中mcr-1和mcr-3基因的共现。
J Antimicrob Chemother. 2019 Oct 1;74(10):3113-3116. doi: 10.1093/jac/dkz311.
6
Genetic environment of colistin resistance genes mcr-1 and mcr-3 in Escherichia coli from one pig farm in China.中国某猪场大肠杆菌中黏菌素耐药基因 mcr-1 和 mcr-3 的遗传环境。
Vet Microbiol. 2019 Mar;230:56-61. doi: 10.1016/j.vetmic.2019.01.011. Epub 2019 Jan 12.
7
Heterogeneous and Flexible Transmission of in Hospital-Associated Escherichia coli.医院相关大肠杆菌中 的异质和灵活传播。
mBio. 2018 Jul 3;9(4):e00943-18. doi: 10.1128/mBio.00943-18.
8
Complete genome sequencing and genomic characterization of two Escherichia coli strains co-producing MCR-1 and NDM-1 from bloodstream infection.从血流感染中同时产生 MCR-1 和 NDM-1 的两株大肠杆菌的全基因组测序和基因组特征分析。
Sci Rep. 2017 Dec 20;7(1):17885. doi: 10.1038/s41598-017-18273-2.
9
Emergence and Comparative Genomics Analysis of Extended-Spectrum-β-Lactamase-Producing Escherichia coli Carrying in Fennec Fox Imported from Sudan to China.从苏丹进口到中国的雪貂中携带的产extended-spectrum-β-lactamase 大肠杆菌的出现及比较基因组学分析。
mSphere. 2019 Nov 20;4(6):e00732-19. doi: 10.1128/mSphere.00732-19.
10
Molecular Insights into Functional Differences between - and -Mediated Colistin Resistance.解析:原文中“-”和“-”之间的空格并非分隔符,而是为了使“Mediated”一词与后文“Colistin Resistance”隔开而添加的。在翻译时,应将其还原为“-mediated”。 因此,译文为: 深入解析 - 与 - 介导的多黏菌素耐药性的功能差异。
Antimicrob Agents Chemother. 2018 Aug 27;62(9). doi: 10.1128/AAC.00366-18. Print 2018 Sep.

引用本文的文献

1
A prevalent huge phage clade in human and animal gut microbiomes.在人类和动物肠道微生物群中普遍存在的一个巨大噬菌体分支。
Res Sq. 2025 Aug 19:rs.3.rs-7356405. doi: 10.21203/rs.3.rs-7356405/v1.
2
A prevalent huge phage clade in human and animal gut microbiomes.人类和动物肠道微生物群中普遍存在的一个巨大噬菌体分支。
bioRxiv. 2025 Aug 11:2025.08.10.669567. doi: 10.1101/2025.08.10.669567.
3
Kaminari: a resource-frugal index for approximate colored -mer queries.电雷:一种用于近似彩色k-mer查询的资源节约型索引。

本文引用的文献

1
Mantis: A Fast, Small, and Exact Large-Scale Sequence-Search Index.螳螂:一种快速、小巧、精确的大规模序列搜索索引。
Cell Syst. 2018 Aug 22;7(2):201-207.e4. doi: 10.1016/j.cels.2018.05.021. Epub 2018 Jun 20.
2
Integrating long-range connectivity information into de Bruijn graphs.将长程连接信息整合到 de Bruijn 图中。
Bioinformatics. 2018 Aug 1;34(15):2556-2565. doi: 10.1093/bioinformatics/bty157.
3
On the (im)possibility of reconstructing plasmids from whole-genome short-read sequencing data.从全基因组短读测序数据重建质粒的(不)可能性。
bioRxiv. 2025 May 21:2025.05.16.654317. doi: 10.1101/2025.05.16.654317.
4
Efficient and robust search of microbial genomes via phylogenetic compression.通过系统发育压缩对微生物基因组进行高效且稳健的搜索。
Nat Methods. 2025 Apr;22(4):692-697. doi: 10.1038/s41592-025-02625-2. Epub 2025 Apr 9.
5
The backpack quotient filter: A dynamic and space-efficient data structure for querying -mers with abundance.背包商数过滤器:一种用于查询具有丰度的k-mers的动态且节省空间的数据结构。
iScience. 2024 Nov 23;27(12):111435. doi: 10.1016/j.isci.2024.111435. eCollection 2024 Dec 20.
6
MOBFinder: a tool for mobilization typing of plasmid metagenomic fragments based on a language model.MOBFinder:一种基于语言模型的质粒宏基因组片段移动分型工具。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae047.
7
GSearch: ultra-fast and scalable genome search by combining K-mer hashing with hierarchical navigable small world graphs.GSearch:通过组合 K -mer 哈希和分层可导航小世界图实现超快速和可扩展的基因组搜索。
Nucleic Acids Res. 2024 Sep 9;52(16):e74. doi: 10.1093/nar/gkae609.
8
MiPRIME: an integrated and intelligent platform for mining primer and probe sequences of microbial species.MiPRIME:用于挖掘微生物物种引物和探针序列的集成智能平台。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae429.
9
The origins of haplotype 58 (H58) Salmonella enterica serovar Typhi.H58 型(H58)伤寒沙门氏菌的起源。
Commun Biol. 2024 Jun 28;7(1):775. doi: 10.1038/s42003-024-06451-8.
10
Fast and space-efficient taxonomic classification of long reads with hierarchical interleaved XOR filters.基于分层交错异或过滤器的长读快速且节省空间的分类学分类。
Genome Res. 2024 Jul 23;34(6):914-924. doi: 10.1101/gr.278623.123.
Microb Genom. 2017 Aug 18;3(10):e000128. doi: 10.1099/mgen.0.000128. eCollection 2017 Oct.
4
Towards a genomics-informed, real-time, global pathogen surveillance system.迈向一个基于基因组学的、实时的全球病原体监测系统。
Nat Rev Genet. 2018 Jan;19(1):9-20. doi: 10.1038/nrg.2017.88. Epub 2017 Nov 13.
5
Global phylogenetic analysis of Escherichia coli and plasmids carrying the mcr-1 gene indicates bacterial diversity but plasmid restriction.全球携带 mcr-1 基因的大肠杆菌和质粒的系统发育分析表明细菌多样性但质粒受限。
Sci Rep. 2017 Nov 10;7(1):15364. doi: 10.1038/s41598-017-15539-7.
6
Novel Plasmid-Mediated Colistin Resistance Gene in .新型质粒介导的黏菌素耐药基因在……中
mBio. 2017 Jun 27;8(3):e00543-17. doi: 10.1128/mBio.00543-17.
7
Comprehensive Whole-Genome Sequencing and Reporting of Drug Resistance Profiles on Clinical Cases of Mycobacterium tuberculosis in New York State.纽约州结核分枝杆菌临床病例的全基因组测序及耐药谱综合报告
J Clin Microbiol. 2017 Jun;55(6):1871-1882. doi: 10.1128/JCM.00298-17. Epub 2017 Apr 5.
8
WGS to predict antibiotic MICs for Neisseria gonorrhoeae.全基因组测序用于预测淋病奈瑟菌的抗生素最低抑菌浓度。
J Antimicrob Chemother. 2017 Jul 1;72(7):1937-1947. doi: 10.1093/jac/dkx067.
9
Same-Day Diagnostic and Surveillance Data for Tuberculosis via Whole-Genome Sequencing of Direct Respiratory Samples.通过直接呼吸道样本的全基因组测序获得的结核病当日诊断和监测数据。
J Clin Microbiol. 2017 May;55(5):1285-1298. doi: 10.1128/JCM.02483-16. Epub 2017 Mar 8.
10
MCR-1.6, a New MCR Variant Carried by an IncP Plasmid in a Colistin-Resistant Salmonella enterica Serovar Typhimurium Isolate from a Healthy Individual.MCR-1.6,一种由IncP质粒携带的新型MCR变体,存在于从一名健康个体分离出的耐黏菌素肠炎沙门氏菌鼠伤寒血清型菌株中。
Antimicrob Agents Chemother. 2017 Apr 24;61(5). doi: 10.1128/AAC.02632-16. Print 2017 May.