• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无需基因组比对,从数百个微生物基因组中进行单核苷酸多态性关联分析和表型预测。

SNP-associations and phenotype predictions from hundreds of microbial genomes without genome alignments.

作者信息

Hall Barry G

机构信息

Bellingham Research Institute, Bellingham, Washington, United States of America.

出版信息

PLoS One. 2014 Feb 28;9(2):e90490. doi: 10.1371/journal.pone.0090490. eCollection 2014.

DOI:10.1371/journal.pone.0090490
PMID:24587377
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3938750/
Abstract

SNP-association studies are a starting point for identifying genes that may be responsible for specific phenotypes, such as disease traits. The vast bulk of tools for SNP-association studies are directed toward SNPs in the human genome, and I am unaware of any tools designed specifically for such studies in bacterial or viral genomes. The PPFS (Predict Phenotypes From SNPs) package described here is an add-on to kSNP , a program that can identify SNPs in a data set of hundreds of microbial genomes. PPFS identifies those SNPs that are non-randomly associated with a phenotype based on the χ² probability, then uses those diagnostic SNPs for two distinct, but related, purposes: (1) to predict the phenotypes of strains whose phenotypes are unknown, and (2) to identify those diagnostic SNPs that are most likely to be causally related to the phenotype. In the example illustrated here, from a set of 68 E. coli genomes, for 67 of which the pathogenicity phenotype was known, there were 418,500 SNPs. Using the phenotypes of 36 of those strains, PPFS identified 207 diagnostic SNPs. The diagnostic SNPs predicted the phenotypes of all of the genomes with 97% accuracy. It then identified 97 SNPs whose probability of being causally related to the pathogenic phenotype was >0.999. In a second example, from a set of 116 E. coli genome sequences, using the phenotypes of 65 strains PPFS identified 101 SNPs that predicted the source host (human or non-human) with 90% accuracy.

摘要

单核苷酸多态性(SNP)关联研究是识别可能导致特定表型(如疾病特征)的基因的起点。绝大多数用于SNP关联研究的工具都针对人类基因组中的SNP,而我并不知晓有任何专门为细菌或病毒基因组的此类研究设计的工具。这里描述的PPFS(从SNP预测表型)软件包是kSNP的一个附加组件,kSNP是一个能够在数百个微生物基因组的数据集中识别SNP的程序。PPFS基于χ²概率识别那些与表型非随机关联的SNP,然后将这些诊断性SNP用于两个不同但相关的目的:(1)预测表型未知的菌株的表型,以及(2)识别那些最有可能与表型存在因果关系的诊断性SNP。在本文所示的示例中,从一组68个大肠杆菌基因组(其中67个的致病性表型已知)中,共有418,500个SNP。利用其中36个菌株的表型,PPFS识别出207个诊断性SNP。这些诊断性SNP以97%的准确率预测了所有基因组的表型。然后它识别出97个与致病表型存在因果关系的概率>0.999的SNP。在第二个示例中,从一组116个大肠杆菌基因组序列中,利用65个菌株的表型,PPFS识别出101个SNP,这些SNP以90%的准确率预测了源宿主(人类或非人类)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ae/3938750/895de14e1437/pone.0090490.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ae/3938750/895de14e1437/pone.0090490.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ae/3938750/895de14e1437/pone.0090490.g001.jpg

相似文献

1
SNP-associations and phenotype predictions from hundreds of microbial genomes without genome alignments.无需基因组比对,从数百个微生物基因组中进行单核苷酸多态性关联分析和表型预测。
PLoS One. 2014 Feb 28;9(2):e90490. doi: 10.1371/journal.pone.0090490. eCollection 2014.
2
When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.当全基因组比对无法奏效时:用于数百个微生物基因组的无比对单核苷酸多态性(SNP)发现及系统发育分析的kSNP v2软件
PLoS One. 2013 Dec 9;8(12):e81760. doi: 10.1371/journal.pone.0081760. eCollection 2013.
3
A phylogenomic analysis of Escherichia coli / Shigella group: implications of genomic features associated with pathogenicity and ecological adaptation.大肠杆菌/志贺氏菌群的系统基因组分析:与致病性和生态适应性相关的基因组特征的意义。
BMC Evol Biol. 2012 Sep 7;12:174. doi: 10.1186/1471-2148-12-174.
4
kSNP3.0: SNP detection and phylogenetic analysis of genomes without genome alignment or reference genome.kSNP3.0:无需基因组比对或参考基因组即可进行基因组的单核苷酸多态性检测和系统发育分析。
Bioinformatics. 2015 Sep 1;31(17):2877-8. doi: 10.1093/bioinformatics/btv271. Epub 2015 Apr 25.
5
Investigating the Relatedness of Enteroinvasive Escherichia coli to Other E. coli and Shigella Isolates by Using Comparative Genomics.通过比较基因组学研究侵袭性大肠杆菌与其他大肠杆菌和志贺氏菌分离株的相关性。
Infect Immun. 2016 Jul 21;84(8):2362-2371. doi: 10.1128/IAI.00350-16. Print 2016 Aug.
6
Using complete genome comparisons to identify sequences whose presence accurately predicts clinically important phenotypes.利用全基因组比较来识别那些存在可准确预测临床重要表型的序列。
PLoS One. 2013 Jul 23;8(7):e68901. doi: 10.1371/journal.pone.0068901. Print 2013.
7
Mash-based analyses of Escherichia coli genomes reveal 14 distinct phylogroups.基于 mash 的大肠杆菌基因组分析揭示了 14 个不同的系统发育群。
Commun Biol. 2021 Jan 26;4(1):117. doi: 10.1038/s42003-020-01626-5.
8
Identification of genetic variants of Brucella spp. through genome-wide association studies.通过全基因组关联研究鉴定布鲁氏菌属的遗传变异。
Infect Genet Evol. 2017 Dec;56:92-98. doi: 10.1016/j.meegid.2017.11.016. Epub 2017 Nov 15.
9
Identifying structural variation in haploid microbial genomes from short-read resequencing data using breseq.使用breseq从短读重测序数据中识别单倍体微生物基因组中的结构变异。
BMC Genomics. 2014 Nov 29;15(1):1039. doi: 10.1186/1471-2164-15-1039.
10
Single nucleotide polymorphism (SNP)-based differentiation of Shigella isolates by pyrosequencing.基于焦磷酸测序的志贺氏菌分离株单核苷酸多态性(SNP)分型。
Infect Genet Evol. 2011 Oct;11(7):1761-8. doi: 10.1016/j.meegid.2011.07.015. Epub 2011 Aug 4.

引用本文的文献

1
Evaluating the Potential of Bacillus Isolates for Chlorpyrifos Degradation and Their Role in Tea Growth Promotion and Suppression of Pathogens.评价芽孢杆菌分离物对毒死蜱的降解潜力及其对茶树生长促进和抑制病原菌的作用。
Curr Microbiol. 2024 Aug 28;81(10):332. doi: 10.1007/s00284-024-03859-7.
2
Building Phylogenetic Trees From Genome Sequences With kSNP4.基于 kSNP4 从基因组序列构建系统发育树。
Mol Biol Evol. 2023 Nov 3;40(11). doi: 10.1093/molbev/msad235.
3
Detecting patterns of accessory genome coevolution in Staphylococcus aureus using data from thousands of genomes.

本文引用的文献

1
When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.当全基因组比对无法奏效时:用于数百个微生物基因组的无比对单核苷酸多态性(SNP)发现及系统发育分析的kSNP v2软件
PLoS One. 2013 Dec 9;8(12):e81760. doi: 10.1371/journal.pone.0081760. eCollection 2013.
2
Building phylogenetic trees from molecular data with MEGA.利用 MEGA 从分子数据构建系统发育树。
Mol Biol Evol. 2013 May;30(5):1229-35. doi: 10.1093/molbev/mst012. Epub 2013 Mar 13.
3
Criteria for selection of surrogates used to study the fate and control of pathogens in the environment.
利用来自数千个基因组的数据检测金黄色葡萄球菌附属基因组协同进化的模式。
BMC Bioinformatics. 2023 Jun 9;24(1):243. doi: 10.1186/s12859-023-05363-4.
4
Unravel the regulatory mechanism of Yrr1p phosphorylation in response to vanillin stress in Saccharomyces cerevisiae.解析酿酒酵母中 Yrr1p 磷酸化对应香草醛胁迫的调控机制。
Microb Cell Fact. 2023 Mar 11;22(1):48. doi: 10.1186/s12934-023-02056-8.
5
Development of a Single-nucleotide Polymorphism Genotyping Assay for the Rapid Detection of Vancomycin-intermediate Resistance in Epidemic Lineage ST5.开发一种单核苷酸多态性基因分型检测方法,用于快速检测流行谱系 ST5 的万古霉素中介耐药性。
Ann Lab Med. 2023 Jul 1;43(4):355-363. doi: 10.3343/alm.2023.43.4.355. Epub 2023 Feb 24.
6
Genome-wide association and dissociation studies in reveal potential virulence factors affecting and   ×  hybrid.全基因组关联和解离研究揭示了影响×杂交种的潜在毒力因子。 (注:原文中“in reveal potential virulence factors affecting and   ×  hybrid”部分缺失关键信息,翻译可能不太准确,完整准确的翻译需补充完整原文内容。)
Front Microbiol. 2023 Feb 2;13:1094155. doi: 10.3389/fmicb.2022.1094155. eCollection 2022.
7
Interaction analyses based on growth parameters of GWAS between Escherichia coli and Staphylococcus aureus.基于大肠杆菌和金黄色葡萄球菌全基因组关联研究生长参数的相互作用分析。
AMB Express. 2021 Mar 1;11(1):34. doi: 10.1186/s13568-021-01192-x.
8
Using affinity propagation clustering for identifying bacterial clades and subclades with whole-genome sequences of Francisella tularensis.使用亲和传播聚类分析鉴定土拉弗朗西斯菌全基因组序列中的细菌进化枝和亚进化枝。
PLoS Negl Trop Dis. 2020 Sep 29;14(9):e0008018. doi: 10.1371/journal.pntd.0008018. eCollection 2020 Sep.
9
Genes significantly associated with lineage II food isolates of Listeria monocytogenes.与单核细胞增生李斯特菌谱系 II 型食物分离株显著相关的基因。
BMC Genomics. 2018 Sep 25;19(1):708. doi: 10.1186/s12864-018-5074-2.
10
Population Dynamics of Staphylococcus aureus in Cystic Fibrosis Patients To Determine Transmission Events by Use of Whole-Genome Sequencing.囊性纤维化患者金黄色葡萄球菌的种群动态:通过全基因组测序确定传播事件
J Clin Microbiol. 2017 Jul;55(7):2143-2152. doi: 10.1128/JCM.00164-17. Epub 2017 Apr 26.
用于研究环境中病原体的命运和控制的替代物选择标准。
Appl Environ Microbiol. 2012 Mar;78(6):1969-77. doi: 10.1128/AEM.06582-11. Epub 2012 Jan 13.
4
MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.MEGA5:用于最大似然法、进化距离法和最大简约法的分子进化遗传学分析。
Mol Biol Evol. 2011 Oct;28(10):2731-9. doi: 10.1093/molbev/msr121. Epub 2011 May 4.
5
Genome sequencing of environmental Escherichia coli expands understanding of the ecology and speciation of the model bacterial species.环境大肠杆菌基因组测序拓展了对模式细菌物种生态和物种形成的理解。
Proc Natl Acad Sci U S A. 2011 Apr 26;108(17):7200-5. doi: 10.1073/pnas.1015622108. Epub 2011 Apr 11.
6
A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility.一种灵活的计算框架,用于在人类疾病易感性的遗传研究中检测、表征和解释上位性的统计模式。
J Theor Biol. 2006 Jul 21;241(2):252-61. doi: 10.1016/j.jtbi.2005.11.036. Epub 2006 Feb 2.
7
A novel algorithm and web-based tool for comparing two alternative phylogenetic trees.一种用于比较两棵备选系统发育树的新型算法及基于网络的工具。
Bioinformatics. 2006 Jan 1;22(1):117-9. doi: 10.1093/bioinformatics/bti720. Epub 2005 Oct 18.