• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

mKmer:微生物单微生物RNA测序数据的无偏K-mer嵌入

mKmer: an unbiased K-mer embedding of microbiomic single-microbe RNA sequencing data.

作者信息

Mo Fangyu, Qian Qinghong, Lu Xiaolin, Zheng Dihuai, Cai Wenjie, Yao Jie, Chen Hongyu, Huang Yujie, Zhang Xiang, Wu Sanling, Shen Yifei, Bai Yinqi, Wang Yongcheng, Jiang Weiqin, Fan Longjiang

机构信息

Hainan Institute, Zhejiang University, Zhenzhou Road, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025, Hainan Province, China.

Institute of Crop Science, Zhejiang University, 866 Yuhangtang Road, Xihu District, Hangzhou 310058, Zhejiang Province, China.

出版信息

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf227.

DOI:10.1093/bib/bbaf227
PMID:40407385
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12100620/
Abstract

The advanced single-microbe RNA sequencing (smRNA-seq) technique addresses the pressing need to understand the complexity and diversity of microbial communities, as well as the distinct microbial states defined by different gene expression profiles. Current analyses of smRNA-seq data heavily rely on the integrity of reference genomes within the queried microbiota. However, establishing a comprehensive collection of microbial reference genomes or gene sets remains a significant challenge for most real-world microbial ecosystems. Here, we developed an unbiased embedding algorithm utilizing K-mer signatures, named mKmer, which bypasses gene or genome alignment to enable species identification for individual microbes and downstream functional enrichment analysis. By substituting gene features in the canonical cell-by-gene matrix with highly conserved K-mers, we demonstrate that mKmer outperforms gene-based methods in clustering and motif inference tasks using benchmark datasets from crop soil and human gut microbiomes. Our method provides a reference genome-free analytical framework for advancing smRNA-seq studies.

摘要

先进的单微生物RNA测序(smRNA-seq)技术满足了人们迫切需要,即了解微生物群落的复杂性和多样性,以及由不同基因表达谱定义的独特微生物状态。目前对smRNA-seq数据的分析严重依赖于所查询微生物群中参考基因组的完整性。然而,对于大多数实际的微生物生态系统而言,建立全面的微生物参考基因组或基因集仍然是一项重大挑战。在此,我们开发了一种利用K-mer特征的无偏嵌入算法,名为mKmer,它绕过基因或基因组比对,能够对单个微生物进行物种鉴定并进行下游功能富集分析。通过用高度保守的K-mer替代标准基因对细胞矩阵中的基因特征,我们证明在使用来自作物土壤和人类肠道微生物群的基准数据集进行聚类和基序推断任务时,mKmer优于基于基因的方法。我们的方法为推进smRNA-seq研究提供了一个无需参考基因组的分析框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/4482b7f8a805/bbaf227f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/d87fdba21f04/bbaf227f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/e9827847b115/bbaf227f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/f6ef307f266c/bbaf227f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/2cd3124ccc6a/bbaf227f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/91c0d1d76c73/bbaf227f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/4482b7f8a805/bbaf227f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/d87fdba21f04/bbaf227f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/e9827847b115/bbaf227f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/f6ef307f266c/bbaf227f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/2cd3124ccc6a/bbaf227f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/91c0d1d76c73/bbaf227f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c926/12100620/4482b7f8a805/bbaf227f6.jpg

相似文献

1
mKmer: an unbiased K-mer embedding of microbiomic single-microbe RNA sequencing data.mKmer:微生物单微生物RNA测序数据的无偏K-mer嵌入
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf227.
2
High-throughput single-microbe RNA sequencing reveals adaptive state heterogeneity and host-phage activity associations in human gut microbiome.高通量单微生物RNA测序揭示了人类肠道微生物群中的适应性状态异质性和宿主-噬菌体活性关联。
Protein Cell. 2025 Mar 8;16(3):211-226. doi: 10.1093/procel/pwae027.
3
16S rRNA sequence embeddings: Meaningful numeric feature representations of nucleotide sequences that are convenient for downstream analyses.16S rRNA 序列嵌入:核苷酸序列有意义的数值特征表示形式,方便下游分析。
PLoS Comput Biol. 2019 Feb 26;15(2):e1006721. doi: 10.1371/journal.pcbi.1006721. eCollection 2019 Feb.
4
Assessment of k-mer spectrum applicability for metagenomic dissimilarity analysis.用于宏基因组差异分析的k-mer谱适用性评估。
BMC Bioinformatics. 2016 Jan 16;17:38. doi: 10.1186/s12859-015-0875-7.
5
Integrating sequence composition information into microbial diversity analyses with k-mer frequency counting.通过k-mer频率计数将序列组成信息整合到微生物多样性分析中。
mSystems. 2025 Mar 18;10(3):e0155024. doi: 10.1128/msystems.01550-24. Epub 2025 Feb 20.
6
Estimating the total genome length of a metagenomic sample using k-mers.利用 k- -mer 估算宏基因组样本的总基因组长度。
BMC Genomics. 2019 Apr 4;20(Suppl 2):183. doi: 10.1186/s12864-019-5467-x.
7
Metatranscriptomic analysis of diverse microbial communities reveals core metabolic pathways and microbiome-specific functionality.宏转录组分析揭示了不同微生物群落的核心代谢途径和微生物组特有的功能。
Microbiome. 2016 Jan 12;4:2. doi: 10.1186/s40168-015-0146-x.
8
K-mer-Based Motif Analysis in Insect Species across , , and Genera and Its Application to Species Classification.基于 K- -mer 的昆虫种、属和科的基序分析及其在物种分类中的应用。
Comput Math Methods Med. 2019 Nov 15;2019:4259479. doi: 10.1155/2019/4259479. eCollection 2019.
9
TSEE: an elastic embedding method to visualize the dynamic gene expression patterns of time series single-cell RNA sequencing data.TSEE:一种弹性嵌入方法,用于可视化时间序列单细胞 RNA 测序数据的动态基因表达模式。
BMC Genomics. 2019 Apr 4;20(Suppl 2):224. doi: 10.1186/s12864-019-5477-8.
10
PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme.PLEK:一种基于改进的k-mer方案预测长链非编码RNA和信使RNA的工具。
BMC Bioinformatics. 2014 Sep 19;15(1):311. doi: 10.1186/1471-2105-15-311.

本文引用的文献

1
Bracken: estimating species abundance in metagenomics data.蕨类植物:宏基因组学数据中物种丰度的估计
PeerJ Comput Sci. 2017;3. doi: 10.7717/peerj-cs.104. Epub 2017 Jan 2.
2
Complete genome sequence of strain RI1, isolated from human feces.从人类粪便中分离出的RI1菌株的全基因组序列。
Microbiol Resour Announc. 2024 Nov 12;13(11):e0086324. doi: 10.1128/mra.00863-24. Epub 2024 Sep 30.
3
Bioprospecting marine microbial genomes to improve biotechnology.生物勘探海洋微生物基因组以改进生物技术。
Nature. 2024 Sep;633(8029):287-288. doi: 10.1038/d41586-024-02661-6.
4
SeqKit2: A Swiss army knife for sequence and alignment processing.SeqKit2:一款用于序列和比对处理的瑞士军刀式工具。
Imeta. 2024 Apr 5;3(3):e191. doi: 10.1002/imt2.191. eCollection 2024 Jun.
5
Metabolite profiling of human-originated Lachnospiraceae at the strain level.人类源毛螺菌科菌株水平的代谢物谱分析。
Imeta. 2022 Oct 13;1(4):e58. doi: 10.1002/imt2.58. eCollection 2022 Dec.
6
High-throughput single-microbe RNA sequencing reveals adaptive state heterogeneity and host-phage activity associations in human gut microbiome.高通量单微生物RNA测序揭示了人类肠道微生物群中的适应性状态异质性和宿主-噬菌体活性关联。
Protein Cell. 2025 Mar 8;16(3):211-226. doi: 10.1093/procel/pwae027.
7
Metabuli: sensitive and specific metagenomic classification via joint analysis of amino acid and DNA.Metabuli:通过联合分析氨基酸和 DNA 进行敏感和特异的宏基因组分类。
Nat Methods. 2024 Jun;21(6):971-973. doi: 10.1038/s41592-024-02273-y. Epub 2024 May 20.
8
Scalable and unbiased sequence-informed embedding of single-cell ATAC-seq data with CellSpace.使用 CellSpace 对单细胞 ATAC-seq 数据进行可扩展且无偏的基于序列的嵌入。
Nat Methods. 2024 Jun;21(6):1014-1022. doi: 10.1038/s41592-024-02274-x. Epub 2024 May 9.
9
Toward universal cell embeddings: integrating single-cell RNA-seq datasets across species with SATURN.迈向通用细胞嵌入:使用SATURN整合跨物种的单细胞RNA测序数据集。
Nat Methods. 2024 Aug;21(8):1492-1500. doi: 10.1038/s41592-024-02191-z. Epub 2024 Feb 16.
10
Multi-omics of the gut microbial ecosystem in patients with microsatellite-instability-high gastrointestinal cancer resistant to immunotherapy.免疫治疗耐药的微卫星不稳定高胃肠道癌患者肠道微生物生态系统的多组学研究。
Cell Rep Med. 2024 Jan 16;5(1):101355. doi: 10.1016/j.xcrm.2023.101355. Epub 2024 Jan 8.