• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用分割码对技术序列进行灵活解析、解释和编辑。

Flexible parsing, interpretation, and editing of technical sequences with splitcode.

作者信息

Sullivan Delaney K, Pachter Lior

机构信息

UCLA-Caltech Medical Scientist Training Program, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA.

Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA.

出版信息

bioRxiv. 2023 Dec 9:2023.03.20.533521. doi: 10.1101/2023.03.20.533521.

DOI:10.1101/2023.03.20.533521
PMID:36993532
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10055216/
Abstract

Next-generation sequencing libraries are constructed with numerous synthetic constructs such as sequencing adapters, barcodes, and unique molecular identifiers. Such sequences can be essential for interpreting results of sequencing assays, and when they contain information pertinent to an experiment, they must be processed and analyzed. We present a tool called splitcode, that enables flexible and efficient parsing, interpreting, and editing of sequencing reads. This versatile tool facilitates simple, reproducible preprocessing of reads from libraries constructed for a large array of single-cell and bulk sequencing assays.

摘要

下一代测序文库是用众多合成构建体构建的,如测序接头、条形码和独特分子标识符。这些序列对于解读测序分析结果可能至关重要,并且当它们包含与实验相关的信息时,必须对其进行处理和分析。我们提出了一种名为splitcode的工具,它能够灵活、高效地解析、解读和编辑测序读数。这个多功能工具便于对为大量单细胞和批量测序分析构建的文库中的读数进行简单、可重复的预处理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/283f/10713027/8054c2a9462c/nihpp-2023.03.20.533521v3-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/283f/10713027/f6d539ff6891/nihpp-2023.03.20.533521v3-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/283f/10713027/cf2c393b99d8/nihpp-2023.03.20.533521v3-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/283f/10713027/8054c2a9462c/nihpp-2023.03.20.533521v3-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/283f/10713027/f6d539ff6891/nihpp-2023.03.20.533521v3-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/283f/10713027/cf2c393b99d8/nihpp-2023.03.20.533521v3-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/283f/10713027/8054c2a9462c/nihpp-2023.03.20.533521v3-f0003.jpg

相似文献

1
Flexible parsing, interpretation, and editing of technical sequences with splitcode.使用分割码对技术序列进行灵活解析、解释和编辑。
bioRxiv. 2023 Dec 9:2023.03.20.533521. doi: 10.1101/2023.03.20.533521.
2
Flexible parsing, interpretation, and editing of technical sequences with splitcode.使用 splitcode 灵活解析、解释和编辑技术序列。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae331.
3
alignparse: A Python package for parsing complex features from high-throughput long-read sequencing.alignparse:一个用于从高通量长读长测序中解析复杂特征的Python软件包。
J Open Source Softw. 2019;4(44). doi: 10.21105/joss.01915. Epub 2019 Dec 11.
4
UMIc: A Preprocessing Method for UMI Deduplication and Reads Correction.UMIc:一种用于UMI去重和读段校正的预处理方法。
Front Genet. 2021 May 28;12:660366. doi: 10.3389/fgene.2021.660366. eCollection 2021.
5
TagDust2: a generic method to extract reads from sequencing data.TagDust2:一种从测序数据中提取读数的通用方法。
BMC Bioinformatics. 2015 Jan 28;16:24. doi: 10.1186/s12859-015-0454-y.
6
Insertion and deletion correcting DNA barcodes based on watermarks.基于水印的插入和缺失校正DNA条形码
BMC Bioinformatics. 2015 Feb 18;16:50. doi: 10.1186/s12859-015-0482-7.
7
AdapterRemoval: easy cleaning of next-generation sequencing reads.AdapterRemoval:轻松清理新一代测序读数。
BMC Res Notes. 2012 Jul 2;5:337. doi: 10.1186/1756-0500-5-337.
8
Identification and quantification of chimeric sequencing reads in a highly multiplexed RAD-seq protocol.在高度多重化 RAD-seq 方案中鉴定和量化嵌合测序读取。
Mol Ecol Resour. 2022 Nov;22(8):2860-2870. doi: 10.1111/1755-0998.13661. Epub 2022 Jun 27.
9
Je, a versatile suite to handle multiplexed NGS libraries with unique molecular identifiers.Je是一个多功能套件,用于处理带有独特分子标识符的多重NGS文库。
BMC Bioinformatics. 2016 Oct 8;17(1):419. doi: 10.1186/s12859-016-1284-2.
10
Btrim: a fast, lightweight adapter and quality trimming program for next-generation sequencing technologies.Btrim:一种快速、轻量级的适用于新一代测序技术的接头和质量修剪程序。
Genomics. 2011 Aug;98(2):152-3. doi: 10.1016/j.ygeno.2011.05.009. Epub 2011 May 30.

本文引用的文献

1
A machine-readable specification for genomics assays.基因组学检测的机器可读规范
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae168.
2
Best Practices in Designing, Sequencing, and Identifying Random DNA Barcodes.设计、测序和鉴定随机 DNA 条码的最佳实践。
J Mol Evol. 2023 Jun;91(3):263-280. doi: 10.1007/s00239-022-10083-z. Epub 2023 Jan 18.
3
A universal sequencing read interpreter.通用测序读码器。
Sci Adv. 2023 Jan 4;9(1):eadd2793. doi: 10.1126/sciadv.add2793.
4
A flexible cross-platform single-cell data processing pipeline.一个灵活的跨平台单细胞数据处理管道。
Nat Commun. 2022 Nov 11;13(1):6847. doi: 10.1038/s41467-022-34681-z.
5
SPRITE: a genome-wide method for mapping higher-order 3D interactions in the nucleus using combinatorial split-and-pool barcoding.SPRITE:一种使用组合式拆分与混合条形码技术在细胞核中绘制高阶三维相互作用图谱的全基因组方法。
Nat Protoc. 2022 Jan;17(1):36-75. doi: 10.1038/s41596-021-00633-y. Epub 2022 Jan 10.
6
Mapping and modeling the genomic basis of differential RNA isoform expression at single-cell resolution with LR-Split-seq.利用 LR-Split-seq 技术在单细胞分辨率下绘制和建模差异 RNA 亚型表达的基因组基础。
Genome Biol. 2021 Oct 7;22(1):286. doi: 10.1186/s13059-021-02505-w.
7
Single-cell RNA counting at allele and isoform resolution using Smart-seq3.基于 Smart-seq3 技术进行等位基因和异构体分辨率的单细胞 RNA 计数
Nat Biotechnol. 2020 Jun;38(6):708-714. doi: 10.1038/s41587-020-0497-0. Epub 2020 May 4.
8
Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language.Fuzzysplit:使用声明性语言对测序DNA进行解复用和修剪
PeerJ. 2019 Jun 19;7:e7170. doi: 10.7717/peerj.7170. eCollection 2019.
9
The barcode, UMI, set format and BUStools.条码、UMI、设定格式和 BUStools。
Bioinformatics. 2019 Nov 1;35(21):4472-4473. doi: 10.1093/bioinformatics/btz279.
10
fastp: an ultra-fast all-in-one FASTQ preprocessor.fastp:一个超快速的一体化 FASTQ 预处理程序。
Bioinformatics. 2018 Sep 1;34(17):i884-i890. doi: 10.1093/bioinformatics/bty560.