• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PStrain:一种用于鸟枪法宏基因组测序数据的迭代微生物菌株分析算法。

PStrain: an iterative microbial strains profiling algorithm for shotgun metagenomic sequencing data.

作者信息

Wang Shuai, Jiang Yiqi, Li Shuaicheng

机构信息

Department of Computer Science, City University of Hong Kong, Kowloon Tong, Hong Kong.

出版信息

Bioinformatics. 2021 Apr 1;36(22-23):5499-5506. doi: 10.1093/bioinformatics/btaa1056.

DOI:10.1093/bioinformatics/btaa1056
PMID:33346799
Abstract

MOTIVATION

The microbial community plays an essential role in human diseases and physiological activities. The functions of microbes can differ due to strain-level differences in the genome sequences. Shotgun metagenomic sequencing allows us to profile the strains in microbial communities practically. However, current methods are underdeveloped due to the highly similar sequences among strains. We observe that strains genotypes at the same single nucleotide variant (SNV) locus can be speculated by the genotype frequencies. Also, the variants in different loci covered by the same reads can provide evidence that they reside on the same strain.

RESULTS

These insights inspire us to design PStrain, an optimization method that utilizes genotype frequencies and the reads which cover multiple SNV loci to profile strains iteratively based on SNVs in a set of MetaPhlAn2 marker genes. Compared to the state-of-art methods, PStrain, on average, improved the performance of inferring strains abundances and genotypes by 87.75% and 59.45%, respectively. We have applied the PStrain package to the dataset with two cohorts of colorectal cancer (CRC) and found that the sequences of Bacteroides coprocola strains are significantly different between CRC and control samples, which is the first time to report the potential role of B.coprocola in the gut microbiota of CRC.

AVAILABILITYAND IMPLEMENTATION

https://github.com/wshuai294/PStrain.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

微生物群落在人类疾病和生理活动中起着至关重要的作用。由于基因组序列中菌株水平的差异,微生物的功能可能会有所不同。鸟枪法宏基因组测序使我们能够实际分析微生物群落中的菌株。然而,由于菌株之间的序列高度相似,目前的方法尚不完善。我们观察到,可以通过基因型频率推测同一单核苷酸变异(SNV)位点的菌株基因型。此外,同一 reads 覆盖的不同位点的变异可以提供证据表明它们存在于同一菌株上。

结果

这些见解启发我们设计了 PStrain,这是一种优化方法,它利用基因型频率和覆盖多个 SNV 位点的 reads,基于一组 MetaPhlAn2 标记基因中的 SNV 对菌株进行迭代分析。与现有方法相比,PStrain 平均将推断菌株丰度和基因型的性能分别提高了 87.75%和 59.45%。我们将 PStrain 软件包应用于两个结直肠癌(CRC)队列的数据集,发现 CRC 样本和对照样本中粪拟杆菌菌株的序列存在显著差异,这是首次报道粪拟杆菌在 CRC 肠道微生物群中的潜在作用。

可用性和实现方式

https://github.com/wshuai294/PStrain。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
PStrain: an iterative microbial strains profiling algorithm for shotgun metagenomic sequencing data.PStrain:一种用于鸟枪法宏基因组测序数据的迭代微生物菌株分析算法。
Bioinformatics. 2021 Apr 1;36(22-23):5499-5506. doi: 10.1093/bioinformatics/btaa1056.
2
TIPP2: metagenomic taxonomic profiling using phylogenetic markers.TIPP2:使用系统发育标记进行宏基因组分类分析
Bioinformatics. 2021 Jul 27;37(13):1839-1845. doi: 10.1093/bioinformatics/btab023.
3
An integrated strain-level analytic pipeline utilizing longitudinal metagenomic data.利用纵向宏基因组数据的综合菌株水平分析管道。
Microbiol Spectr. 2024 Nov 5;12(11):e0143124. doi: 10.1128/spectrum.01431-24. Epub 2024 Sep 23.
4
MSPminer: abundance-based reconstitution of microbial pan-genomes from shotgun metagenomic data.MSPminer:基于丰度的宏基因组数据中微生物泛基因组重建。
Bioinformatics. 2019 May 1;35(9):1544-1552. doi: 10.1093/bioinformatics/bty830.
5
StrainR2 accurately deconvolutes strain-level abundances in synthetic microbial communities.StrainR2能够准确地反卷积合成微生物群落中菌株水平的丰度。
bioRxiv. 2024 Aug 9:2024.08.08.607172. doi: 10.1101/2024.08.08.607172.
6
Establishing a novel colorectal cancer predictive model based on unique gut microbial single nucleotide variant markers.基于独特肠道微生物单核苷酸变异标记物建立新型结直肠癌预测模型。
Gut Microbes. 2021 Jan-Dec;13(1):1-6. doi: 10.1080/19490976.2020.1869505.
7
metaSNV v2: detection of SNVs and subspecies in prokaryotic metagenomes.metaSNV v2:原核生物宏基因组中单个核苷酸变异和亚种的检测
Bioinformatics. 2022 Jan 27;38(4):1162-1164. doi: 10.1093/bioinformatics/btab789.
8
OGRE: Overlap Graph-based metagenomic Read clustEring.OGRE:基于重叠图的宏基因组读聚类。
Bioinformatics. 2021 May 17;37(7):905-912. doi: 10.1093/bioinformatics/btaa760.
9
MultiGeMS: detection of SNVs from multiple samples using model selection on high-throughput sequencing data.MultiGeMS:利用高通量测序数据的模型选择从多个样本中检测单核苷酸变异
Bioinformatics. 2016 May 15;32(10):1486-92. doi: 10.1093/bioinformatics/btv753. Epub 2016 Jan 18.
10
The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples.测序深度对宏基因组样本推断的分类组成和抗菌药物耐药基因含量的影响。
Environ Microbiome. 2019 Oct 24;14(1):7. doi: 10.1186/s40793-019-0347-1.

引用本文的文献

1
StrainFacts accurately quantifies both endogenous and live biotherapeutic product strain abundances in simulated and clinical vaginal microbiota samples.StrainFacts能够准确量化模拟和临床阴道微生物群样本中内源性和活生物治疗产品菌株的丰度。
bioRxiv. 2025 Aug 21:2025.08.15.670563. doi: 10.1101/2025.08.15.670563.
2
StrainPanDA: Linked reconstruction of strain composition and gene content profiles via pangenome-based decomposition of metagenomic data.StrainPanDA:通过基于泛基因组的宏基因组数据分解对菌株组成和基因含量谱进行关联重建
Imeta. 2022 Aug 1;1(3):e41. doi: 10.1002/imt2.41. eCollection 2022 Sep.
3
Inspecting mother-to-infant microbiota transmission: disturbance of strain inheritance by cesarian section.
检查母婴微生物群传播:剖宫产对菌株遗传的干扰。
Front Microbiol. 2024 Feb 29;15:1292377. doi: 10.3389/fmicb.2024.1292377. eCollection 2024.
4
CyanoStrainChip: A Novel DNA Microarray Tool for High-Throughput Detection of Environmental Cyanobacteria at the Strain Level.氰基藻芯片:一种高通量检测环境蓝藻的新型 DNA 微阵列工具,可在菌株水平上进行。
Environ Sci Technol. 2024 Mar 19;58(11):5024-5034. doi: 10.1021/acs.est.3c11096. Epub 2024 Mar 7.
5
SpecHLA enables full-resolution HLA typing from sequencing data.SpecHLA 能够从测序数据中进行全分辨率 HLA 分型。
Cell Rep Methods. 2023 Sep 25;3(9):100589. doi: 10.1016/j.crmeth.2023.100589. Epub 2023 Sep 14.
6
Orally administrated TM13 and LG55 can restore the vaginal health of patients recovering from bacterial vaginosis.口服 TM13 和 LG55 可以恢复细菌性阴道病患者的阴道健康。
Front Immunol. 2023 Jul 27;14:1125239. doi: 10.3389/fimmu.2023.1125239. eCollection 2023.
7
AD-Syn-Net: systematic identification of Alzheimer's disease-associated mutation and co-mutation vulnerabilities via deep learning.AD-Syn-Net:通过深度学习系统地识别阿尔茨海默病相关突变和共突变脆弱性。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad030.