• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

smartSim:剪接感知单细胞smart-seq3数据的模拟

smartSim: simulation of splice aware single cell smart-seq3 data.

作者信息

Van Hecke Marie, Marchal Kathleen

机构信息

IDLab, Department of Information Technology, Ghent University-imec, 9052 Ghent, Belgium.

Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.

出版信息

Bioinform Adv. 2025 Jul 30;5(1):vbaf183. doi: 10.1093/bioadv/vbaf183. eCollection 2025.

DOI:10.1093/bioadv/vbaf183
PMID:40861396
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12373632/
Abstract

MOTIVATION

Smart-seq3 is a powerful full-length single-cell RNA sequencing protocol that enables transcript-level quantification and splicing analysis by preserving unique molecular identifier (UMI) information. However, benchmarking computational tools for isoform reconstruction and splicing quantification remains challenging due to the lack of ground truth datasets. Herein, we present smartSim, a Smart-seq3 read simulator designed to generate realistic sequencing data that accurately reflects the complexities of single-cell transcriptomics.

RESULTS

smartSim simulates known and novel splicing events, generates both UMI-containing and internal reads, and mimics protocol-specific biases by leveraging empirical data distributions. Our results show that smartSim-generated data closely resembles real Smart-seq3 datasets in terms of fragment length distributions, internal read counts, and read quality scores. It generates raw sequencing reads in FASTQ format, making it compatible with both genome- and transcriptome-based alignment tools. By extending simulation beyond gene-level quantification, smartSim provides a crucial resource for evaluating and improving computational methods for alternative splicing detection and isoform reconstruction in single-cell RNA sequencing.

AVAILABILITY AND IMPLEMENTATION

smartSim is available at https://github.com/MarchalLab/smartSim.

摘要

动机

Smart-seq3是一种强大的全长单细胞RNA测序方案,通过保留独特分子标识符(UMI)信息实现转录本水平定量和剪接分析。然而,由于缺乏真实数据集,对异构体重建和剪接定量的计算工具进行基准测试仍然具有挑战性。在此,我们展示了smartSim,这是一种Smart-seq3读取模拟器,旨在生成能够准确反映单细胞转录组学复杂性的逼真测序数据。

结果

smartSim模拟已知和新的剪接事件,生成包含UMI的读取和内部读取,并通过利用经验数据分布模拟特定方案的偏差。我们的结果表明,smartSim生成的数据在片段长度分布、内部读取计数和读取质量分数方面与真实的Smart-seq3数据集非常相似。它以FASTQ格式生成原始测序读取,使其与基于基因组和转录组的比对工具兼容。通过将模拟扩展到基因水平定量之外,smartSim为评估和改进单细胞RNA测序中可变剪接检测和异构体重建的计算方法提供了关键资源。

可用性和实现

smartSim可在https://github.com/MarchalLab/smartSim上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/318cbc6242ca/vbaf183f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/685bc7b2fd3a/vbaf183f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/1f3e291e9f06/vbaf183f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/624209f41d47/vbaf183f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/98c92c11f4b0/vbaf183f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/318cbc6242ca/vbaf183f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/685bc7b2fd3a/vbaf183f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/1f3e291e9f06/vbaf183f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/624209f41d47/vbaf183f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/98c92c11f4b0/vbaf183f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc2a/12373632/318cbc6242ca/vbaf183f5.jpg

相似文献

1
smartSim: simulation of splice aware single cell smart-seq3 data.smartSim:剪接感知单细胞smart-seq3数据的模拟
Bioinform Adv. 2025 Jul 30;5(1):vbaf183. doi: 10.1093/bioadv/vbaf183. eCollection 2025.
2
Transcriptome assembly at single-cell resolution with Beaver.使用海狸实现单细胞分辨率的转录组组装。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i323-i331. doi: 10.1093/bioinformatics/btaf236.
3
SAKit: An all-in-one analysis pipeline for identifying novel proteins resulting from variant events at both large and small scales.SAKit:一种用于鉴定由大尺度和小尺度变异事件产生的新型蛋白质的一体化分析管道。
J Bioinform Comput Biol. 2024 Oct;22(5):2450022. doi: 10.1142/S0219720024500227. Epub 2024 Oct 1.
4
Single cell and spatial alternative splicing analysis with long read sequencing.利用长读长测序进行单细胞和空间可变剪接分析。
Res Sq. 2023 Mar 21:rs.3.rs-2674892. doi: 10.21203/rs.3.rs-2674892/v1.
5
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
6
Transcriptome Assembly at Single-Cell Resolution with Beaver.使用海狸实现单细胞分辨率的转录组组装。
bioRxiv. 2024 Nov 6:2024.11.04.621958. doi: 10.1101/2024.11.04.621958.
7
Oarfish: enhanced probabilistic modeling leads to improved accuracy in long read transcriptome quantification.皇带鱼:增强的概率模型可提高长读长转录组定量的准确性。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i304-i313. doi: 10.1093/bioinformatics/btaf240.
8
Oarfish: Enhanced probabilistic modeling leads to improved accuracy in long read transcriptome quantification.皇带鱼:增强的概率模型可提高长读长转录组定量的准确性。
bioRxiv. 2024 Mar 1:2024.02.28.582591. doi: 10.1101/2024.02.28.582591.
9
EVscope: A Comprehensive Bioinformatics Pipeline for Accurate and Robust Analysis of Total RNA Sequencing from Extracellular Vesicles.EVscope:一种用于准确且稳健地分析细胞外囊泡总RNA测序的综合生物信息学流程。
bioRxiv. 2025 Jun 27:2025.06.24.660984. doi: 10.1101/2025.06.24.660984.
10
SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.SQANTI:用于全长转录组鉴定和定量的长读转录序列的广泛特征化,以进行质量控制。
Genome Res. 2018 Mar 1;28(3):396-411. doi: 10.1101/gr.222976.117.

本文引用的文献

1
scReadSim: a single-cell RNA-seq and ATAC-seq read simulator.scReadSim:一种单细胞 RNA-seq 和 ATAC-seq 读段模拟软件。
Nat Commun. 2023 Nov 18;14(1):7482. doi: 10.1038/s41467-023-43162-w.
2
Scalable single-cell RNA sequencing from full transcripts with Smart-seq3xpress.基于 Smart-seq3xpress 的全长转录本可扩展的单细胞 RNA 测序。
Nat Biotechnol. 2022 Oct;40(10):1452-1457. doi: 10.1038/s41587-022-01311-4. Epub 2022 May 30.
3
muscat detects subpopulation-specific state transitions from multi-sample multi-condition single-cell transcriptomics data.
Muscat 可从多样本多条件单细胞转录组学数据中检测到亚群特异性状态转变。
Nat Commun. 2020 Nov 30;11(1):6077. doi: 10.1038/s41467-020-19894-4.
4
Single-cell RNA counting at allele and isoform resolution using Smart-seq3.基于 Smart-seq3 技术进行等位基因和异构体分辨率的单细胞 RNA 计数
Nat Biotechnol. 2020 Jun;38(6):708-714. doi: 10.1038/s41587-020-0497-0. Epub 2020 May 4.
5
zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs.zUMIs - 一个快速灵活的带有 UMIs 的 RNA 测序数据处理流水线。
Gigascience. 2018 Jun 1;7(6). doi: 10.1093/gigascience/giy059.
6
Splatter: simulation of single-cell RNA sequencing data.Splatter:单细胞 RNA 测序数据模拟。
Genome Biol. 2017 Sep 12;18(1):174. doi: 10.1186/s13059-017-1305-0.