• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Pseudoalignment for metagenomic read assignment.用于宏基因组读分配的伪比对。
Bioinformatics. 2017 Jul 15;33(14):2082-2088. doi: 10.1093/bioinformatics/btx106.
2
Mora: abundance aware metagenomic read re-assignment for disentangling similar strains.莫拉:用于区分相似菌株的丰度感知宏基因组读数重新分配法
BMC Bioinformatics. 2024 Apr 23;25(1):161. doi: 10.1186/s12859-024-05768-9.
3
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
4
MetaWRAP-a flexible pipeline for genome-resolved metagenomic data analysis.MetaWRAP-一个用于基因组解析宏基因组数据分析的灵活管道。
Microbiome. 2018 Sep 15;6(1):158. doi: 10.1186/s40168-018-0541-1.
5
COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge.可口可乐:利用序列组成、读段覆盖度、共比对和双端读段连接对宏基因组重叠群进行分箱。
Bioinformatics. 2017 Mar 15;33(6):791-798. doi: 10.1093/bioinformatics/btw290.
6
ViraPipe: scalable parallel pipeline for viral metagenome analysis from next generation sequencing reads.ViraPipe:用于从下一代测序读取中进行病毒宏基因组分析的可扩展并行管道。
Bioinformatics. 2018 Mar 15;34(6):928-935. doi: 10.1093/bioinformatics/btx702.
7
A novel data structure to support ultra-fast taxonomic classification of metagenomic sequences with k-mer signatures.一种新的数据结构,用于支持基于 k-mer 特征的宏基因组序列的超快速分类学分类。
Bioinformatics. 2018 Jan 1;34(1):171-178. doi: 10.1093/bioinformatics/btx432.
8
Fast and space-efficient taxonomic classification of long reads with hierarchical interleaved XOR filters.基于分层交错异或过滤器的长读快速且节省空间的分类学分类。
Genome Res. 2024 Jul 23;34(6):914-924. doi: 10.1101/gr.278623.123.
9
Flexible taxonomic assignment of ambiguous sequencing reads.灵活的分类学分配模糊测序读段。
BMC Bioinformatics. 2011 Jan 7;12:8. doi: 10.1186/1471-2105-12-8.
10
Sigma: strain-level inference of genomes from metagenomic analysis for biosurveillance.西格玛:用于生物监测的宏基因组分析中基因组的菌株水平推断。
Bioinformatics. 2015 Jan 15;31(2):170-7. doi: 10.1093/bioinformatics/btu641. Epub 2014 Sep 29.

引用本文的文献

1
Alevin-fry-atac enables rapid and memory frugal mapping of single-cell ATAC-seq data using virtual colors for accurate genomic pseudoalignment.Alevin-fry-atac可使用虚拟颜色实现单细胞ATAC-seq数据的快速且节省内存的映射,以进行准确的基因组伪比对。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i237-i245. doi: 10.1093/bioinformatics/btaf234.
2
Scaling laws of bacterial and archaeal plasmids.细菌和古菌质粒的标度律。
Nat Commun. 2025 Jul 2;16(1):6023. doi: 10.1038/s41467-025-61205-2.
3
Gut bacterial metabolite imidazole propionate potentiates Alzheimer's disease pathology.肠道细菌代谢物咪唑丙酸会加剧阿尔茨海默病的病理变化。
bioRxiv. 2025 Jun 8:2025.06.08.657719. doi: 10.1101/2025.06.08.657719.
4
Longitudinal profiling of low-abundance strains in microbiomes with ChronoStrain.使用ChronoStrain对微生物群落中低丰度菌株进行纵向分析。
Nat Microbiol. 2025 May;10(5):1184-1197. doi: 10.1038/s41564-025-01983-z. Epub 2025 May 6.
5
Bracken: estimating species abundance in metagenomics data.蕨类植物:宏基因组学数据中物种丰度的估计
PeerJ Comput Sci. 2017;3. doi: 10.7717/peerj-cs.104. Epub 2017 Jan 2.
6
Addressing the dynamic nature of reference data: a new nucleotide database for robust metagenomic classification.应对参考数据的动态特性:一个用于可靠宏基因组分类的新核苷酸数据库。
mSystems. 2025 Apr 22;10(4):e0123924. doi: 10.1128/msystems.01239-24. Epub 2025 Mar 20.
7
Identification of novel fructo-oligosaccharide bacterial consumers by pulse metatranscriptomics in a human stool sample.通过脉冲宏转录组学在一份人类粪便样本中鉴定新型低聚果糖细菌消费者。
mSphere. 2025 Jan 28;10(1):e0066824. doi: 10.1128/msphere.00668-24. Epub 2024 Dec 19.
8
Alevin-fry-atac enables rapid and memory frugal mapping of single-cell ATAC-seq data using virtual colors for accurate genomic pseudoalignment.Alevin-fry-atac可通过使用虚拟颜色进行准确的基因组伪比对,实现对单细胞ATAC-seq数据的快速且节省内存的图谱绘制。
bioRxiv. 2025 Mar 12:2024.11.27.625771. doi: 10.1101/2024.11.27.625771.
9
Where the Patterns Are: Repetition-Aware Compression for Colored de Bruijn Graphs.模式所在:带重复感知的彩色 de Bruijn 图压缩。
J Comput Biol. 2024 Oct;31(10):1022-1044. doi: 10.1089/cmb.2024.0714. Epub 2024 Oct 9.
10
A microbiome-directed therapeutic food for children recovering from severe acute malnutrition.一种针对严重急性营养不良康复期儿童的微生物群定向治疗性食品。
Sci Transl Med. 2024 Oct 2;16(767):eadn2366. doi: 10.1126/scitranslmed.adn2366.

本文引用的文献

1
Mash: fast genome and metagenome distance estimation using MinHash.Mash:使用MinHash进行快速的基因组和宏基因组距离估计。
Genome Biol. 2016 Jun 20;17(1):132. doi: 10.1186/s13059-016-0997-x.
2
Near-optimal probabilistic RNA-seq quantification.近乎最优的概率 RNA-seq 定量。
Nat Biotechnol. 2016 May;34(5):525-7. doi: 10.1038/nbt.3519. Epub 2016 Apr 4.
3
An evaluation of the accuracy and speed of metagenome analysis tools.宏基因组分析工具的准确性和速度评估。
Sci Rep. 2016 Jan 18;6:19233. doi: 10.1038/srep19233.
4
Rapid antibiotic-resistance predictions from genome sequence data for Staphylococcus aureus and Mycobacterium tuberculosis.通过金黄色葡萄球菌和结核分枝杆菌的基因组序列数据进行快速抗生素耐药性预测。
Nat Commun. 2015 Dec 21;6:10063. doi: 10.1038/ncomms10063.
5
Ensembl Genomes 2016: more genomes, more complexity.《Ensembl基因组2016:更多基因组,更多复杂性》
Nucleic Acids Res. 2016 Jan 4;44(D1):D574-80. doi: 10.1093/nar/gkv1209. Epub 2015 Nov 17.
6
Insights from 20 years of bacterial genome sequencing.20 年细菌基因组测序的启示。
Funct Integr Genomics. 2015 Mar;15(2):141-61. doi: 10.1007/s10142-015-0433-4. Epub 2015 Feb 27.
7
Trimmomatic: a flexible trimmer for Illumina sequence data.Trimmomatic:一款适用于 Illumina 测序数据的灵活修剪工具。
Bioinformatics. 2014 Aug 1;30(15):2114-20. doi: 10.1093/bioinformatics/btu170. Epub 2014 Apr 1.
8
Kraken: ultrafast metagenomic sequence classification using exact alignments.克拉肯:使用精确比对的超快速宏基因组序列分类
Genome Biol. 2014 Mar 3;15(3):R46. doi: 10.1186/gb-2014-15-3-r46.
9
Differential abundance analysis for microbial marker-gene surveys.微生物标记基因调查的差异丰度分析。
Nat Methods. 2013 Dec;10(12):1200-2. doi: 10.1038/nmeth.2658. Epub 2013 Sep 29.
10
Shigella strains are not clones of Escherichia coli but sister species in the genus Escherichia.志贺氏菌菌株不是大肠杆菌的克隆体,而是大肠杆菌属中的姐妹种。
Genomics Proteomics Bioinformatics. 2013 Feb;11(1):61-5. doi: 10.1016/j.gpb.2012.11.002. Epub 2012 Dec 29.

用于宏基因组读分配的伪比对。

Pseudoalignment for metagenomic read assignment.

机构信息

Department of Molecular and Cell Biology, UC Berkeley, Berkeley, CA, USA.

Department of Genetics, Stanford University, Stanford, CA, USA.

出版信息

Bioinformatics. 2017 Jul 15;33(14):2082-2088. doi: 10.1093/bioinformatics/btx106.

DOI:10.1093/bioinformatics/btx106
PMID:28334086
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5870846/
Abstract

MOTIVATION

Read assignment is an important first step in many metagenomic analysis workflows, providing the basis for identification and quantification of species. However ambiguity among the sequences of many strains makes it difficult to assign reads at the lowest level of taxonomy, and reads are typically assigned to taxonomic levels where they are unambiguous. We explore connections between metagenomic read assignment and the quantification of transcripts from RNA-Seq data in order to develop novel methods for rapid and accurate quantification of metagenomic strains.

RESULTS

We find that the recent idea of pseudoalignment introduced in the RNA-Seq context is highly applicable in the metagenomics setting. When coupled with the Expectation-Maximization (EM) algorithm, reads can be assigned far more accurately and quickly than is currently possible with state of the art software, making it possible and practical for the first time to analyze abundances of individual genomes in metagenomics projects.

AVAILABILITY AND IMPLEMENTATION

Pipeline and analysis code can be downloaded from http://github.com/pachterlab/metakallisto.

CONTACT

lpachter@math.berkeley.edu.

摘要

动机

读分配是许多宏基因组分析工作流程中的重要第一步,为物种的鉴定和定量提供了基础。然而,许多菌株的序列之间存在歧义,使得在最低分类学水平上难以分配读取,并且读取通常被分配到没有歧义的分类学水平。我们探索了宏基因组读取分配与 RNA-Seq 数据中转录物定量之间的联系,以便开发用于快速准确定量宏基因组菌株的新方法。

结果

我们发现,在 RNA-Seq 上下文中引入的伪比对的最新思想在宏基因组学环境中具有高度的适用性。当与期望最大化 (EM) 算法结合使用时,与当前最先进的软件相比,读取可以更准确和快速地分配,从而首次有可能在宏基因组学项目中分析单个基因组的丰度。

可用性和实现

管道和分析代码可从 http://github.com/pachterlab/metakallisto 下载。

联系

lpachter@math.berkeley.edu。