• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

zUMIs - 一个快速灵活的带有 UMIs 的 RNA 测序数据处理流水线。

zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs.

机构信息

Anthropology and Human Genomics, Department of Biology II, Ludwig-Maximilians University, Grosshaderner Str. 2, 82152 Martinsried, Germany.

出版信息

Gigascience. 2018 Jun 1;7(6). doi: 10.1093/gigascience/giy059.

DOI:10.1093/gigascience/giy059
PMID:29846586
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6007394/
Abstract

BACKGROUND

Single-cell RNA-sequencing (scRNA-seq) experiments typically analyze hundreds or thousands of cells after amplification of the cDNA. The high throughput is made possible by the early introduction of sample-specific bar codes (BCs), and the amplification bias is alleviated by unique molecular identifiers (UMIs). Thus, the ideal analysis pipeline for scRNA-seq data needs to efficiently tabulate reads according to both BC and UMI.

FINDINGS

zUMIs is a pipeline that can handle both known and random BCs and also efficiently collapse UMIs, either just for exon mapping reads or for both exon and intron mapping reads. If BC annotation is missing, zUMIs can accurately detect intact cells from the distribution of sequencing reads. Another unique feature of zUMIs is the adaptive downsampling function that facilitates dealing with hugely varying library sizes but also allows the user to evaluate whether the library has been sequenced to saturation. To illustrate the utility of zUMIs, we analyzed a single-nucleus RNA-seq dataset and show that more than 35% of all reads map to introns. Also, we show that these intronic reads are informative about expression levels, significantly increasing the number of detected genes and improving the cluster resolution.

CONCLUSIONS

zUMIs flexibility makes if possible to accommodate data generated with any of the major scRNA-seq protocols that use BCs and UMIs and is the most feature-rich, fast, and user-friendly pipeline to process such scRNA-seq data.

摘要

背景

单细胞 RNA 测序 (scRNA-seq) 实验通常在 cDNA 扩增后分析数百或数千个细胞。通过早期引入样本特异性条形码 (BC) 实现高通量,通过独特的分子标识符 (UMI) 缓解扩增偏差。因此,scRNA-seq 数据的理想分析流程需要根据 BC 和 UMI 有效地对读取进行制表。

发现

zUMIs 是一个能够处理已知和随机 BC 的管道,并且还可以有效地合并 UMI,无论是用于外显子映射读取还是用于外显子和内含子映射读取。如果缺少 BC 注释,zUMIs 可以根据测序读取的分布准确检测完整的细胞。zUMIs 的另一个独特功能是自适应下采样功能,它便于处理差异极大的文库大小,同时允许用户评估文库是否已测序饱和。为了说明 zUMIs 的实用性,我们分析了一个单核 RNA-seq 数据集,结果表明超过 35%的所有读取都映射到内含子上。此外,我们还表明这些内含子读取可提供有关表达水平的信息,显著增加了检测到的基因数量,并提高了聚类分辨率。

结论

zUMIs 的灵活性使其能够适应使用 BC 和 UMI 的任何主要 scRNA-seq 协议生成的数据,并且是处理此类 scRNA-seq 数据最具特色、最快、最用户友好的管道。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c2/6007394/43a5ae34250a/giy059fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c2/6007394/4d66d9128b37/giy059fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c2/6007394/67f278499810/giy059fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c2/6007394/9073aca8caa5/giy059fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c2/6007394/43a5ae34250a/giy059fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c2/6007394/4d66d9128b37/giy059fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c2/6007394/67f278499810/giy059fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c2/6007394/9073aca8caa5/giy059fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c2/6007394/43a5ae34250a/giy059fig4.jpg

相似文献

1
zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs.zUMIs - 一个快速灵活的带有 UMIs 的 RNA 测序数据处理流水线。
Gigascience. 2018 Jun 1;7(6). doi: 10.1093/gigascience/giy059.
2
Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers.无唯一分子标识符的单细胞 RNA-seq 读数的分位数归一化。
Genome Biol. 2020 Jul 3;21(1):160. doi: 10.1186/s13059-020-02078-0.
3
Gene length and detection bias in single cell RNA sequencing protocols.单细胞RNA测序方案中的基因长度与检测偏差
F1000Res. 2017 Apr 28;6:595. doi: 10.12688/f1000research.11290.1. eCollection 2017.
4
Full-Length Single-Cell RNA-Sequencing with FLASH-seq.使用FLASH-seq进行全长单细胞RNA测序。
Methods Mol Biol. 2023;2584:123-164. doi: 10.1007/978-1-0716-2756-3_5.
5
Detecting differential alternative splicing events in scRNA-seq with or without Unique Molecular Identifiers.在有或没有独特分子标识符的 scRNA-seq 中检测差异剪接事件。
PLoS Comput Biol. 2020 Jun 5;16(6):e1007925. doi: 10.1371/journal.pcbi.1007925. eCollection 2020 Jun.
6
Insufficiently complex unique-molecular identifiers (UMIs) distort small RNA sequencing.非充分复杂的唯一分子标识符(UMIs)会扭曲小 RNA 测序结果。
Sci Rep. 2020 Sep 3;10(1):14593. doi: 10.1038/s41598-020-71323-0.
7
Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data.从 10× 基因组单细胞 RNA 测序数据估计单核苷酸变异的等位基因特异性表达。
Genes (Basel). 2020 Feb 25;11(3):240. doi: 10.3390/genes11030240.
8
Je, a versatile suite to handle multiplexed NGS libraries with unique molecular identifiers.Je是一个多功能套件,用于处理带有独特分子标识符的多重NGS文库。
BMC Bioinformatics. 2016 Oct 8;17(1):419. doi: 10.1186/s12859-016-1284-2.
9
Alevin efficiently estimates accurate gene abundances from dscRNA-seq data.Alevin 能够有效地从 dscRNA-seq 数据中估计准确的基因丰度。
Genome Biol. 2019 Mar 27;20(1):65. doi: 10.1186/s13059-019-1670-y.
10
Elimination of PCR duplicates in RNA-seq and small RNA-seq using unique molecular identifiers.利用独特分子标识符消除 RNA-seq 和 small RNA-seq 中的 PCR 重复。
BMC Genomics. 2018 Jul 13;19(1):531. doi: 10.1186/s12864-018-4933-1.

引用本文的文献

1
smartSim: simulation of splice aware single cell smart-seq3 data.smartSim:剪接感知单细胞smart-seq3数据的模拟
Bioinform Adv. 2025 Jul 30;5(1):vbaf183. doi: 10.1093/bioadv/vbaf183. eCollection 2025.
2
Establishment of chromatin architecture interplays with embryo hypertranscription.染色质结构的建立与胚胎超转录相互作用。
Nature. 2025 Aug 13. doi: 10.1038/s41586-025-09400-5.
3
Protocol update to: High-throughput scNMT protocol for multiomics profiling of single cells from mouse brain and pancreatic organoids.方案更新为:用于小鼠脑和胰腺类器官单细胞多组学分析的高通量scNMT方案。

本文引用的文献

1
scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.scPipe:用于单细胞 RNA 测序数据的灵活 R/Bioconductor 预处理流水线。
PLoS Comput Biol. 2018 Aug 10;14(8):e1006361. doi: 10.1371/journal.pcbi.1006361. eCollection 2018 Aug.
2
Sensitive and powerful single-cell RNA sequencing using mcSCRB-seq.使用 mcSCRB-seq 进行灵敏且强大的单细胞 RNA 测序。
Nat Commun. 2018 Jul 26;9(1):2937. doi: 10.1038/s41467-018-05347-6.
3
dropEst: pipeline for accurate estimation of molecular counts in droplet-based single-cell RNA-seq experiments.
STAR Protoc. 2025 Jul 24;6(3):103980. doi: 10.1016/j.xpro.2025.103980.
4
Deciphering direct transcriptional effects of epigenetic compounds through large-scale new RNA profiling.通过大规模新RNA分析解读表观遗传化合物的直接转录效应。
Nat Commun. 2025 Jul 18;16(1):6629. doi: 10.1038/s41467-025-61769-z.
5
Synthbar: A Lightweight Tool for Adding Synthetic Barcodes to Sequencing Reads.合成条形码生成器:一种用于向测序读数添加合成条形码的轻量级工具。
bioRxiv. 2025 Jun 2:2025.05.30.657070. doi: 10.1101/2025.05.30.657070.
6
Loss of UBE3A impacts both neuronal and non-neuronal cells in human cerebral organoids.UBE3A的缺失会影响人类大脑类器官中的神经元细胞和非神经元细胞。
Commun Biol. 2025 May 30;8(1):838. doi: 10.1038/s42003-025-08262-x.
7
nf-core/marsseq: systematic preprocessing pipeline for MARS-seq experiments.nf-core/marsseq:用于MARS-seq实验的系统预处理流程。
Bioinform Adv. 2025 May 23;5(1):vbaf089. doi: 10.1093/bioadv/vbaf089. eCollection 2025.
8
TIRE-seq simplifies transcriptomics via integrated RNA capture and library preparation.TIRE-seq通过整合RNA捕获和文库制备简化了转录组学。
Sci Rep. 2025 May 2;15(1):15385. doi: 10.1038/s41598-025-98282-8.
9
Construction of edit-distance graphs for large sets of short reads through minimizer-bucketing.通过最小化器分桶为大量短读段构建编辑距离图。
Bioinform Adv. 2025 Apr 10;5(1):vbaf081. doi: 10.1093/bioadv/vbaf081. eCollection 2025.
10
Targeting CD38 immunometabolic checkpoint improves metabolic fitness and cognition in a mouse model of Alzheimer's disease.靶向CD38免疫代谢检查点可改善阿尔茨海默病小鼠模型的代谢适应性和认知能力。
Nat Commun. 2025 Apr 20;16(1):3736. doi: 10.1038/s41467-025-58494-y.
dropEst:基于液滴的单细胞 RNA-seq 实验中分子计数的精确估计的流水线。
Genome Biol. 2018 Jun 19;19(1):78. doi: 10.1186/s13059-018-1449-6.
4
Integrating single-cell transcriptomic data across different conditions, technologies, and species.整合不同条件、技术和物种的单细胞转录组数据。
Nat Biotechnol. 2018 Jun;36(5):411-420. doi: 10.1038/nbt.4096. Epub 2018 Apr 2.
5
Quantitative single-cell transcriptomics.定量单细胞转录组学。
Brief Funct Genomics. 2018 Jul 1;17(4):220-232. doi: 10.1093/bfgp/ely009.
6
Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding.单细胞分析发育中的鼠脑和脊髓的分裂池条形码技术。
Science. 2018 Apr 13;360(6385):176-182. doi: 10.1126/science.aam8999. Epub 2018 Mar 15.
7
Bias, robustness and scalability in single-cell differential expression analysis.单细胞差异表达分析中的偏差、稳健性和可扩展性。
Nat Methods. 2018 Apr;15(4):255-261. doi: 10.1038/nmeth.4612. Epub 2018 Feb 26.
8
Mapping the Mouse Cell Atlas by Microwell-Seq.通过微室测序绘制小鼠细胞图谱。
Cell. 2018 Feb 22;172(5):1091-1107.e17. doi: 10.1016/j.cell.2018.02.001.
9
The Human Cell Atlas.人类细胞图谱
Elife. 2017 Dec 5;6:e27041. doi: 10.7554/eLife.27041.
10
STRT-seq-2i: dual-index 5' single cell and nucleus RNA-seq on an addressable microwell array.STRT-seq-2i:基于寻址微井阵列的双索引 5' 单细胞和核 RNA-seq。
Sci Rep. 2017 Nov 27;7(1):16327. doi: 10.1038/s41598-017-16546-4.