• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SCSsim:一种用于模拟单细胞基因组测序数据的集成工具。

SCSsim: an integrated tool for simulating single-cell genome sequencing data.

机构信息

Department of Software Engineering, Ningxia University, Yinchuan 750021, China.

Department of Electronic Science and Technology, University of Science and Technology of China, Hefei 230027, China.

出版信息

Bioinformatics. 2020 Feb 15;36(4):1281-1282. doi: 10.1093/bioinformatics/btz713.

DOI:10.1093/bioinformatics/btz713
PMID:31584615
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7703785/
Abstract

MOTIVATION

Allele dropout (ADO) and unbalanced amplification of alleles are main technical issues of single-cell sequencing (SCS), and effectively emulating these issues is necessary for reliably benchmarking SCS-based bioinformatics tools. Unfortunately, currently available sequencing simulators are free of whole-genome amplification involved in SCS technique and therefore not suited for generating SCS datasets. We develop a new software package (SCSsim) that can efficiently simulate SCS datasets in a parallel fashion with minimal user intervention. SCSsim first constructs the genome sequence of single cell by mimicking a complement of genomic variations under user-controlled manner, and then amplifies the genome according to MALBAC technique and finally yields sequencing reads from the amplified products based on inferred sequencing profiles. Comprehensive evaluation in simulating different ADO rates, variation detection efficiency and genome coverage demonstrates that SCSsim is a very useful tool in mimicking single-cell sequencing data with high efficiency.

AVAILABILITY AND IMPLEMENTATION

SCSsim is freely available at https://github.com/qasimyu/scssim.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

等位基因丢失 (ADO) 和等位基因不平衡扩增是单细胞测序 (SCS) 的主要技术问题,有效地模拟这些问题对于可靠地对基于 SCS 的生物信息学工具进行基准测试是必要的。不幸的是,目前可用的测序模拟器不涉及 SCS 技术中的全基因组扩增,因此不适合生成 SCS 数据集。我们开发了一个新的软件包 (SCSsim),可以以最小的用户干预以并行方式有效地模拟 SCS 数据集。SCSsim 首先通过模拟用户控制下的基因组变异来构建单细胞的基因组序列,然后根据 MALBAC 技术对基因组进行扩增,最后根据推断的测序谱从扩增产物中生成测序reads。在模拟不同的 ADO 率、变异检测效率和基因组覆盖度方面的综合评估表明,SCSsim 是一种非常有用的工具,可高效模拟单细胞测序数据。

可用性和实现

SCSsim 可在 https://github.com/qasimyu/scssim 上免费获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/158d/7703785/b8bace978a8d/btz713f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/158d/7703785/b8bace978a8d/btz713f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/158d/7703785/b8bace978a8d/btz713f1.jpg

相似文献

1
SCSsim: an integrated tool for simulating single-cell genome sequencing data.SCSsim:一种用于模拟单细胞基因组测序数据的集成工具。
Bioinformatics. 2020 Feb 15;36(4):1281-1282. doi: 10.1093/bioinformatics/btz713.
2
Haplotype phasing in single-cell DNA-sequencing data.单细胞 DNA 测序数据中的单倍型相位。
Bioinformatics. 2018 Jul 1;34(13):i211-i217. doi: 10.1093/bioinformatics/bty286.
3
Simulating Illumina metagenomic data with InSilicoSeq.用 InSilicoSeq 模拟 Illumina 宏基因组数据。
Bioinformatics. 2019 Feb 1;35(3):521-522. doi: 10.1093/bioinformatics/bty630.
4
Lep-MAP3: robust linkage mapping even for low-coverage whole genome sequencing data.Lep-MAP3:即使对于低覆盖度的全基因组测序数据,也能实现稳健的连锁图谱构建。
Bioinformatics. 2017 Dec 1;33(23):3726-3732. doi: 10.1093/bioinformatics/btx494.
5
Novel algorithms for efficient subsequence searching and mapping in nanopore raw signals towards targeted sequencing.针对靶向测序的新型算法,用于纳米孔原始信号中的高效子序列搜索和映射。
Bioinformatics. 2020 Mar 1;36(5):1333-1343. doi: 10.1093/bioinformatics/btz742.
6
Nubeam-dedup: a fast and RAM-efficient tool to de-duplicate sequencing reads without mapping.Nubeam-dedup:一款快速且节省内存的去重工具,无需进行测序读取映射。
Bioinformatics. 2020 May 1;36(10):3254-3256. doi: 10.1093/bioinformatics/btaa112.
7
Structural variant analysis for linked-read sequencing data with gemtools.使用 gemtools 进行链接读取测序数据的结构变异分析。
Bioinformatics. 2019 Nov 1;35(21):4397-4399. doi: 10.1093/bioinformatics/btz239.
8
rMETL: sensitive mobile element insertion detection with long read realignment.rMETL:使用长读重-align 进行敏感的移动元件插入检测。
Bioinformatics. 2019 Sep 15;35(18):3484-3486. doi: 10.1093/bioinformatics/btz106.
9
lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data.lordFAST:用于长噪声测序数据的敏感快速比对搜索工具。
Bioinformatics. 2019 Jan 1;35(1):20-27. doi: 10.1093/bioinformatics/bty544.
10
Simulating the dynamics of targeted capture sequencing with CapSim.使用 CapSim 模拟靶向捕获测序的动力学。
Bioinformatics. 2018 Mar 1;34(5):873-874. doi: 10.1093/bioinformatics/btx691.

引用本文的文献

1
scTCA: a hybrid Transformer-CNN architecture for imputation and denoising of scDNA-seq data.scTCA:一种用于 scDNA-seq 数据插补和去噪的混合 Transformer-CNN 架构。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae577.
2
CoT: a transformer-based method for inferring tumor clonal copy number substructure from scDNA-seq data.CoT:一种基于 Transformer 的方法,用于从 scDNA-seq 数据推断肿瘤克隆拷贝数亚结构。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae187.
3
Inferring single-cell copy number profiles through cross-cell segmentation of read counts.

本文引用的文献

1
Single-cell mutation identification via phylogenetic inference.通过系统发育推断进行单细胞突变鉴定。
Nat Commun. 2018 Dec 4;9(1):5144. doi: 10.1038/s41467-018-07627-7.
2
Computational enhancement of single-cell sequences for inferring tumor evolution.计算增强单细胞序列以推断肿瘤进化。
Bioinformatics. 2018 Sep 1;34(17):i917-i926. doi: 10.1093/bioinformatics/bty571.
3
Intelligent image-based in situ single-cell isolation.基于智能图像的原位单细胞分离。
通过读取计数的跨细胞分割推断单细胞拷贝数谱。
BMC Genomics. 2024 Jan 2;25(1):25. doi: 10.1186/s12864-023-09901-5.
4
CNAsim: improved simulation of single-cell copy number profiles and DNA-seq data from tumors.CNAsim:改进了单细胞拷贝数谱和肿瘤 DNA-seq 数据的模拟。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad434.
5
SCSilicon: a tool for synthetic single-cell DNA sequencing data generation.SCSilicon:一种用于合成单细胞 DNA 测序数据生成的工具。
BMC Genomics. 2022 May 11;23(Suppl 4):359. doi: 10.1186/s12864-022-08566-w.
6
Editorial: Unsupervised Learning Models for Unlabeled Genomic, Transcriptomic & Proteomic Data.社论:用于未标记基因组、转录组和蛋白质组数据的无监督学习模型
Front Genet. 2021 Nov 11;12:781698. doi: 10.3389/fgene.2021.781698. eCollection 2021.
7
SCYN: single cell CNV profiling method using dynamic programming.SCYN:一种使用动态规划进行单细胞 CNV 分析的方法。
BMC Genomics. 2021 Nov 16;22(Suppl 5):651. doi: 10.1186/s12864-021-07941-3.
8
SCSIM: Jointly simulating correlated single-cell and bulk next-generation DNA sequencing data.SCSIM:联合模拟相关单细胞和批量下一代 DNA 测序数据。
BMC Bioinformatics. 2020 May 26;21(1):215. doi: 10.1186/s12859-020-03550-1.
Nat Commun. 2018 Jan 15;9(1):226. doi: 10.1038/s41467-017-02628-4.
4
SiFit: inferring tumor trees from single-cell sequencing data under finite-sites models.SiFit:在有限位点模型下从单细胞测序数据中推断肿瘤树。
Genome Biol. 2017 Sep 19;18(1):178. doi: 10.1186/s13059-017-1311-2.
5
A comparison of tools for the simulation of genomic next-generation sequencing data.用于模拟基因组下一代测序数据的工具比较。
Nat Rev Genet. 2016 Aug;17(8):459-69. doi: 10.1038/nrg.2016.57. Epub 2016 Jun 20.
6
IntSIM: An Integrated Simulator of Next-Generation Sequencing Data.IntSIM:下一代测序数据集成模拟器
IEEE Trans Biomed Eng. 2017 Feb;64(2):441-451. doi: 10.1109/TBME.2016.2560939. Epub 2016 Apr 29.
7
Monovar: single-nucleotide variant detection in single cells.Monovar:单细胞中的单核苷酸变异检测
Nat Methods. 2016 Jun;13(6):505-7. doi: 10.1038/nmeth.3835. Epub 2016 Apr 18.
8
OncoNEM: inferring tumor evolution from single-cell sequencing data.OncoNEM:从单细胞测序数据推断肿瘤进化
Genome Biol. 2016 Apr 15;17:69. doi: 10.1186/s13059-016-0929-9.
9
Genome-wide detection of single-nucleotide and copy-number variations of a single human cell.单个人类细胞中单核苷酸和拷贝数变异的全基因组检测。
Science. 2012 Dec 21;338(6114):1622-6. doi: 10.1126/science.1229164.
10
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.