• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于基因簇中心的 -mers 减少细菌全基因组关联的歧义性并提高可解释性。

Reduced ambiguity and improved interpretability of bacterial genome-wide associations using gene-cluster-centric -mers.

机构信息

Institute for Molecular Bacteriology, TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany.

Cluster of Excellence RESIST (EXC 2155), Hannover Medical School (MHH), Hannover, Germany.

出版信息

Microb Genom. 2023 Nov;9(11). doi: 10.1099/mgen.0.001129.

DOI:10.1099/mgen.0.001129
PMID:37934071
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10711318/
Abstract

The wide adoption of bacterial genome sequencing and encoding both core and accessory genome variation using -mers has allowed bacterial genome-wide association studies (GWAS) to identify genetic variants associated with relevant phenotypes such as those linked to infection. Significant limitations still remain because of -mers being duplicated across gene clusters and as far as the interpretation of association results is concerned, which affects the wider adoption of GWAS methods on microbial data sets. We have developed a simple computational method (panfeed) that explicitly links each -mer to their gene cluster at base-resolution level, which allows us to avoid biases introduced by a global de Bruijn graph as well as more easily map and annotate associated variants. We tested panfeed on two independent data sets, correctly identifying previously characterized causal variants, which demonstrates the precision of the method, as well as its scalable performance. panfeed is a command line tool written in the python programming language and is available at https://github.com/microbial-pangenomes-lab/panfeed.

摘要

广泛采用细菌基因组测序,并使用 -mers 对核心和辅助基因组变异进行编码,使得细菌全基因组关联研究(GWAS)能够识别与相关表型相关的遗传变异,例如与感染相关的表型。由于 -mers 在基因簇之间重复,以及关联结果的解释仍然存在重大限制,这影响了 GWAS 方法在微生物数据集上的更广泛采用。我们开发了一种简单的计算方法(panfeed),可以在碱基分辨率水平上明确将每个 -mer 与其基因簇联系起来,这使我们能够避免全局 de Bruijn 图引入的偏差,并更轻松地映射和注释相关变体。我们在两个独立的数据集中测试了 panfeed,正确识别了先前表征的因果变异,这证明了该方法的精确性,以及其可扩展的性能。panfeed 是一个用 Python 编程语言编写的命令行工具,可在 https://github.com/microbial-pangenomes-lab/panfeed 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/4f9e6f0d9795/mgen-9-1129-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/db4dbbc14caf/mgen-9-1129-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/ba7eb4cbeb5b/mgen-9-1129-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/5410b3393127/mgen-9-1129-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/a78c4f45ad3c/mgen-9-1129-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/4f9e6f0d9795/mgen-9-1129-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/db4dbbc14caf/mgen-9-1129-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/ba7eb4cbeb5b/mgen-9-1129-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/5410b3393127/mgen-9-1129-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/a78c4f45ad3c/mgen-9-1129-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b880/10711318/4f9e6f0d9795/mgen-9-1129-g005.jpg

相似文献

1
Reduced ambiguity and improved interpretability of bacterial genome-wide associations using gene-cluster-centric -mers.使用基于基因簇中心的 -mers 减少细菌全基因组关联的歧义性并提高可解释性。
Microb Genom. 2023 Nov;9(11). doi: 10.1099/mgen.0.001129.
2
A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events.一种快速且无偏倚的细菌全基因组关联研究方法:弥合 k- mers 与遗传事件之间的差距。
PLoS Genet. 2018 Nov 12;14(11):e1007758. doi: 10.1371/journal.pgen.1007758. eCollection 2018 Nov.
3
CALDERA: finding all significant de Bruijn subgraphs for bacterial GWAS.CALDERA:用于细菌 GWAS 的所有显著 de Bruijn 子图的发现。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i36-i44. doi: 10.1093/bioinformatics/btac238.
4
kGWASflow: a modular, flexible, and reproducible Snakemake workflow for k-mers-based GWAS.kGWASflow:一种基于 k-mer 的 GWAS 的模块化、灵活和可重复的 Snakemake 工作流程。
G3 (Bethesda). 2023 Dec 29;14(1). doi: 10.1093/g3journal/jkad246.
5
: a pre- and post- genome-wide association studies pipeline for detecting phenotype-associated genome rearrangement events.一个用于检测表型相关基因组重排事件的全基因组关联研究前和后管道。
Microb Genom. 2024 Jul;10(7). doi: 10.1099/mgen.0.001268.
6
Benchmarking bacterial genome-wide association study methods using simulated genomes and phenotypes.基于模拟基因组和表型对细菌全基因组关联研究方法进行基准测试。
Microb Genom. 2020 Mar;6(3). doi: 10.1099/mgen.0.000337.
7
GWAS for genetics of complex quantitative traits: Genome to pangenome and SNPs to SVs and k-mers.全基因组关联分析(GWAS)在复杂数量性状的遗传学研究中的应用:从基因组到泛基因组,从单核苷酸多态性(SNPs)到结构变异(SVs)和 k-mers。
Bioessays. 2021 Nov;43(11):e2100109. doi: 10.1002/bies.202100109. Epub 2021 Sep 6.
8
Quantifying the mapping precision of genome-wide association studies using whole-genome sequencing data.利用全基因组测序数据量化全基因组关联研究的定位精度
Genome Biol. 2017 May 16;18(1):86. doi: 10.1186/s13059-017-1216-0.
9
A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.一种在考虑种群结构和重组的情况下,在微生物中进行全基因组关联研究的系统发育方法。
PLoS Comput Biol. 2018 Feb 5;14(2):e1005958. doi: 10.1371/journal.pcbi.1005958. eCollection 2018 Feb.
10
MegaGTA: a sensitive and accurate metagenomic gene-targeted assembler using iterative de Bruijn graphs.MegaGTA:一种使用迭代德布鲁因图的灵敏且准确的宏基因组基因靶向组装器。
BMC Bioinformatics. 2017 Oct 16;18(Suppl 12):408. doi: 10.1186/s12859-017-1825-3.

引用本文的文献

1
Bacterial genome-wide association studies: exploring the genetic variation underlying bacterial phenotypes.细菌全基因组关联研究:探索细菌表型背后的遗传变异
Appl Environ Microbiol. 2025 Jun 18;91(6):e0251224. doi: 10.1128/aem.02512-24. Epub 2025 May 16.
2
: a pre- and post- genome-wide association studies pipeline for detecting phenotype-associated genome rearrangement events.一个用于检测表型相关基因组重排事件的全基因组关联研究前和后管道。
Microb Genom. 2024 Jul;10(7). doi: 10.1099/mgen.0.001268.

本文引用的文献

1
Accurate and fast graph-based pangenome annotation and clustering with ggCaller.使用 ggCaller 实现基于图的精确快速泛基因组注释和聚类。
Genome Res. 2023 Sep;33(9):1622-1637. doi: 10.1101/gr.277733.123. Epub 2023 Aug 24.
2
CALDERA: finding all significant de Bruijn subgraphs for bacterial GWAS.CALDERA:用于细菌 GWAS 的所有显著 de Bruijn 子图的发现。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i36-i44. doi: 10.1093/bioinformatics/btac238.
3
Genome wide association study of Escherichia coli bloodstream infection isolates identifies genetic determinants for the portal of entry but not fatal outcome.
全基因组关联研究鉴定了大肠埃希菌血流感染分离株的肠外感染源相关遗传决定因素,但与致死结局无关。
PLoS Genet. 2022 Mar 24;18(3):e1010112. doi: 10.1371/journal.pgen.1010112. eCollection 2022 Mar.
4
Genome-wide association studies reveal the role of polymorphisms affecting factor H binding protein expression in host invasion by Neisseria meningitidis.全基因组关联研究揭示了影响因子 H 结合蛋白表达的多态性在脑膜炎奈瑟菌宿主侵袭中的作用。
PLoS Pathog. 2021 Oct 18;17(10):e1009992. doi: 10.1371/journal.ppat.1009992. eCollection 2021 Oct.
5
Pandora: nucleotide-resolution bacterial pan-genomics with reference graphs.潘多拉:基于参考图谱的核苷酸分辨率细菌泛基因组学。
Genome Biol. 2021 Sep 14;22(1):267. doi: 10.1186/s13059-021-02473-1.
6
Horizontally acquired papGII-containing pathogenicity islands underlie the emergence of invasive uropathogenic Escherichia coli lineages.水平获得的含有 papGII 的致病性岛是侵袭性泌尿道致病性大肠杆菌谱系出现的基础。
Nat Commun. 2020 Nov 24;11(1):5968. doi: 10.1038/s41467-020-19714-9.
7
Major role of iron uptake systems in the intrinsic extra-intestinal virulence of the genus Escherichia revealed by a genome-wide association study.通过全基因组关联研究揭示铁摄取系统在大肠杆菌属固有肠外毒力中的主要作用。
PLoS Genet. 2020 Oct 28;16(10):e1009065. doi: 10.1371/journal.pgen.1009065. eCollection 2020 Oct.
8
Array programming with NumPy.使用 NumPy 进行数组编程。
Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.
9
Producing polished prokaryotic pangenomes with the Panaroo pipeline.使用 Panaroo 管道生成精修的原核泛基因组。
Genome Biol. 2020 Jul 22;21(1):180. doi: 10.1186/s13059-020-02090-4.
10
Improved Prediction of Bacterial Genotype-Phenotype Associations Using Interpretable Pangenome-Spanning Regressions.利用可解释的泛基因组跨越回归提高细菌基因型-表型关联的预测能力。
mBio. 2020 Jul 7;11(4):e01344-20. doi: 10.1128/mBio.01344-20.