• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞基因组学的条形码识别。

Barcode identification for single cell genomics.

机构信息

Division of Biology and Biological Engineering, California Institute of Technology, 116 Kerckhoff Laboratory, Pasadena, CA, 91125, USA.

Departments of Biology and Computing & Mathematical Sciences, California Institute of Technology, 116 Kerckhoff Laboratory, Pasadena, CA, 91125, USA.

出版信息

BMC Bioinformatics. 2019 Jan 17;20(1):32. doi: 10.1186/s12859-019-2612-0.

DOI:10.1186/s12859-019-2612-0
PMID:30654736
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6337828/
Abstract

BACKGROUND

Single-cell sequencing experiments use short DNA barcode 'tags' to identify reads that originate from the same cell. In order to recover single-cell information from such experiments, reads must be grouped based on their barcode tag, a crucial processing step that precedes other computations. However, this step can be difficult due to high rates of mismatch and deletion errors that can afflict barcodes.

RESULTS

Here we present an approach to identify and error-correct barcodes by traversing the de Bruijn graph of circularized barcode k-mers. Our approach is based on the observation that circularizing a barcode sequence can yield error-free k-mers even when the size of k is large relative to the length of the barcode sequence, a regime which is typical single-cell barcoding applications. This allows for assignment of reads to consensus fingerprints constructed from k-mers.

CONCLUSION

We show that for single-cell RNA-Seq circularization improves the recovery of accurate single-cell transcriptome estimates, especially when there are a high number of errors per read. This approach is robust to the type of error (mismatch, insertion, deletion), as well as to the relative abundances of the cells. Sircel, a software package that implements this approach is described and publically available.

摘要

背景

单细胞测序实验使用短 DNA 条码“标签”来识别来自同一细胞的读取。为了从这类实验中恢复单细胞信息,必须根据其条码标签对读取进行分组,这是在其他计算之前的关键处理步骤。然而,由于条码可能会出现高错配和删除错误,因此此步骤可能会很困难。

结果

我们在此提出了一种通过遍历圆形化条码 k-mer 的 de Bruijn 图来识别和纠正条码的方法。我们的方法基于这样的观察结果:即使当 k 的大小相对于条码序列的长度较大时,圆形化条码序列也可以产生无错误的 k-mer,这种情况在典型的单细胞条码应用中很常见。这允许将读取分配给由 k-mer 构建的共识指纹。

结论

我们表明,对于单细胞 RNA-Seq,圆形化可以提高准确的单细胞转录组估计的恢复,特别是在每个读取有大量错误的情况下。这种方法对错误类型(错配、插入、删除)以及细胞的相对丰度都具有鲁棒性。描述并公开了一个实现该方法的软件包 Sircel。

相似文献

1
Barcode identification for single cell genomics.单细胞基因组学的条形码识别。
BMC Bioinformatics. 2019 Jan 17;20(1):32. doi: 10.1186/s12859-019-2612-0.
2
TraRECo: a greedy approach based de novo transcriptome assembler with read error correction using consensus matrix.TraRECo:一种基于贪心策略的从头转录组组装方法,使用一致矩阵进行读错误校正。
BMC Genomics. 2018 Sep 4;19(1):653. doi: 10.1186/s12864-018-5034-x.
3
Pheniqs 2.0: accurate, high-performance Bayesian decoding and confidence estimation for combinatorial barcode indexing.Pheniqs 2.0:用于组合条码索引的准确、高性能贝叶斯解码和置信度估计。
BMC Bioinformatics. 2021 Jul 2;22(1):359. doi: 10.1186/s12859-021-04267-5.
4
Insertion and deletion correcting DNA barcodes based on watermarks.基于水印的插入和缺失校正DNA条形码
BMC Bioinformatics. 2015 Feb 18;16:50. doi: 10.1186/s12859-015-0482-7.
5
Assembly of long error-prone reads using de Bruijn graphs.使用德布鲁因图组装长易错读段。
Proc Natl Acad Sci U S A. 2016 Dec 27;113(52):E8396-E8405. doi: 10.1073/pnas.1604560113. Epub 2016 Dec 12.
6
Levenshtein error-correcting barcodes for multiplexed DNA sequencing.莱文斯坦纠错条码在多重 DNA 测序中的应用。
BMC Bioinformatics. 2013 Sep 11;14:272. doi: 10.1186/1471-2105-14-272.
7
Compact representation of k-mer de Bruijn graphs for genome read assembly.用于基因组读取组装的 k-mer de Bruijn 图的紧凑表示。
BMC Bioinformatics. 2013 Oct 23;14:313. doi: 10.1186/1471-2105-14-313.
8
INC-Seq: accurate single molecule reads using nanopore sequencing.INC-Seq:使用纳米孔测序进行准确的单分子读取。
Gigascience. 2016 Aug 2;5(1):34. doi: 10.1186/s13742-016-0140-7.
9
Integrating long-range connectivity information into de Bruijn graphs.将长程连接信息整合到 de Bruijn 图中。
Bioinformatics. 2018 Aug 1;34(15):2556-2565. doi: 10.1093/bioinformatics/bty157.
10
Sequencing barcode construction and identification methods based on block error-correction codes.基于块纠错码的测序条码构建和识别方法。
Sci China Life Sci. 2020 Oct;63(10):1580-1592. doi: 10.1007/s11427-019-1651-3. Epub 2020 Apr 14.

引用本文的文献

1
Opportunities to advance cervical cancer prevention and care.推进宫颈癌预防与护理的机遇。
Tumour Virus Res. 2024 Dec;18:200292. doi: 10.1016/j.tvr.2024.200292. Epub 2024 Oct 25.
2
A survey of k-mer methods and applications in bioinformatics.生物信息学中k-mer方法及其应用综述。
Comput Struct Biotechnol J. 2024 May 21;23:2289-2303. doi: 10.1016/j.csbj.2024.05.025. eCollection 2024 Dec.
3
Single cell RNA-seq: a novel tool to unravel virus-host interplay.单细胞RNA测序:揭示病毒与宿主相互作用的新型工具。

本文引用的文献

1
Evolution of pallium, hippocampus, and cortical cell types revealed by single-cell transcriptomics in reptiles.单细胞转录组学揭示爬行动物脑皮层、边缘皮层和海马的演化
Science. 2018 May 25;360(6391):881-888. doi: 10.1126/science.aar4237. Epub 2018 May 3.
2
Cell type atlas and lineage tree of a whole complex animal by single-cell transcriptomics.单细胞转录组学绘制完整复杂动物的细胞类型图谱和谱系树。
Science. 2018 May 25;360(6391). doi: 10.1126/science.aaq1723. Epub 2018 Apr 19.
3
Cell type transcriptome atlas for the planarian .涡虫细胞类型转录组图谱
Virusdisease. 2024 Mar;35(1):41-54. doi: 10.1007/s13337-024-00859-w. Epub 2024 Mar 9.
4
Sequencing the origins of life.探寻生命的起源。
BBA Adv. 2022 Mar 5;2:100049. doi: 10.1016/j.bbadva.2022.100049. eCollection 2022.
5
Single-Cell RNA Sequencing of Bone Marrow Mesenchymal Stem Cells from the Elderly People.老年人骨髓间充质干细胞的单细胞RNA测序
Int J Stem Cells. 2022 May 30;15(2):173-182. doi: 10.15283/ijsc21042.
6
Single-Cell Transcriptome Profiling Simulation Reveals the Impact of Sequencing Parameters and Algorithms on Clustering.单细胞转录组分析模拟揭示测序参数和算法对聚类的影响。
Life (Basel). 2021 Jul 19;11(7):716. doi: 10.3390/life11070716.
7
Single-Cell Transcriptome Analysis as a Promising Tool to Study Pluripotent Stem Cell Reprogramming.单细胞转录组分析作为研究多能干细胞重编程的有前途的工具。
Int J Mol Sci. 2021 Jun 1;22(11):5988. doi: 10.3390/ijms22115988.
8
Efficient CRISPR/Cas9 mediated Pooled-sgRNAs assembly accelerates targeting multiple genes related to male sterility in cotton.高效的CRISPR/Cas9介导的混合sgRNA组装加速了对棉花中多个与雄性不育相关基因的靶向
Plant Methods. 2021 Feb 8;17(1):16. doi: 10.1186/s13007-021-00712-x.
9
Low-complexity and highly robust barcodes for error-rich single molecular sequencing.用于富含错误的单分子测序的低复杂度且高度稳健的条形码。
3 Biotech. 2021 Feb;11(2):78. doi: 10.1007/s13205-020-02607-5. Epub 2021 Jan 16.
10
Mapping regulators of cell fate determination: Approaches and challenges.绘制细胞命运决定的调控因子:方法与挑战。
APL Bioeng. 2020 Jul 1;4(3):031501. doi: 10.1063/5.0004611. eCollection 2020 Sep.
Science. 2018 May 25;360(6391). doi: 10.1126/science.aaq1736. Epub 2018 Apr 19.
4
Single-cell RNA-seq of rheumatoid arthritis synovial tissue using low-cost microfluidic instrumentation.利用低成本微流控仪器对类风湿关节炎滑膜组织进行单细胞 RNA 测序。
Nat Commun. 2018 Feb 23;9(1):791. doi: 10.1038/s41467-017-02659-x.
5
The embryo at single-cell transcriptome resolution.单细胞转录组分辨率下的胚胎。
Science. 2017 Oct 13;358(6360):194-199. doi: 10.1126/science.aan3235. Epub 2017 Aug 31.
6
Pseudoalignment for metagenomic read assignment.用于宏基因组读分配的伪比对。
Bioinformatics. 2017 Jul 15;33(14):2082-2088. doi: 10.1093/bioinformatics/btx106.
7
Power analysis of single-cell RNA-sequencing experiments.单细胞 RNA 测序实验的功效分析。
Nat Methods. 2017 Apr;14(4):381-387. doi: 10.1038/nmeth.4220. Epub 2017 Mar 6.
8
Seq-Well: portable, low-cost RNA sequencing of single cells at high throughput.Seq-Well:高通量、便携式、低成本的单细胞 RNA 测序。
Nat Methods. 2017 Apr;14(4):395-398. doi: 10.1038/nmeth.4179. Epub 2017 Feb 13.
9
Fast and accurate single-cell RNA-seq analysis by clustering of transcript-compatibility counts.通过转录本兼容性计数聚类实现快速准确的单细胞RNA测序分析
Genome Biol. 2016 May 26;17(1):112. doi: 10.1186/s13059-016-0970-8.
10
Near-optimal probabilistic RNA-seq quantification.近乎最优的概率 RNA-seq 定量。
Nat Biotechnol. 2016 May;34(5):525-7. doi: 10.1038/nbt.3519. Epub 2016 Apr 4.