• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Low-complexity and highly robust barcodes for error-rich single molecular sequencing.用于富含错误的单分子测序的低复杂度且高度稳健的条形码。
3 Biotech. 2021 Feb;11(2):78. doi: 10.1007/s13205-020-02607-5. Epub 2021 Jan 16.
2
Sequencing barcode construction and identification methods based on block error-correction codes.基于块纠错码的测序条码构建和识别方法。
Sci China Life Sci. 2020 Oct;63(10):1580-1592. doi: 10.1007/s11427-019-1651-3. Epub 2020 Apr 14.
3
Indel-correcting DNA barcodes for high-throughput sequencing.高通量测序的无错切 DNA 条形码。
Proc Natl Acad Sci U S A. 2018 Jul 3;115(27):E6217-E6226. doi: 10.1073/pnas.1802640115. Epub 2018 Jun 20.
4
Insertion and deletion correcting DNA barcodes based on watermarks.基于水印的插入和缺失校正DNA条形码
BMC Bioinformatics. 2015 Feb 18;16:50. doi: 10.1186/s12859-015-0482-7.
5
Levenshtein error-correcting barcodes for multiplexed DNA sequencing.莱文斯坦纠错条码在多重 DNA 测序中的应用。
BMC Bioinformatics. 2013 Sep 11;14:272. doi: 10.1186/1471-2105-14-272.
6
Pheniqs 2.0: accurate, high-performance Bayesian decoding and confidence estimation for combinatorial barcode indexing.Pheniqs 2.0:用于组合条码索引的准确、高性能贝叶斯解码和置信度估计。
BMC Bioinformatics. 2021 Jul 2;22(1):359. doi: 10.1186/s12859-021-04267-5.
7
Mutation Analysis of Synthetic DNA Barcodes in a Fission Yeast Gene Deletion Library by Sanger Sequencing.利用桑格测序法对裂殖酵母基因缺失文库中的合成DNA条形码进行突变分析
Genomics Inform. 2018 Jun;16(2):22-29. doi: 10.5808/GI.2018.16.2.22. Epub 2018 Jun 30.
8
Designing robust watermark barcodes for multiplex long-read sequencing.为多重长读长测序设计稳健的水印条形码。
Bioinformatics. 2017 Mar 15;33(6):807-813. doi: 10.1093/bioinformatics/btw322.
9
Fast trimer statistics facilitate accurate decoding of large random DNA barcode sets even at large sequencing error rates.快速三聚体统计即使在测序错误率很高的情况下也有助于对大型随机DNA条形码集进行准确解码。
PNAS Nexus. 2022 Nov 4;1(5):pgac252. doi: 10.1093/pnasnexus/pgac252. eCollection 2022 Nov.
10
BARCOSEL: a tool for selecting an optimal barcode set for high-throughput sequencing.BARCOSEL:一种用于为高通量测序选择最佳条码集的工具。
BMC Bioinformatics. 2018 Jul 5;19(1):257. doi: 10.1186/s12859-018-2262-7.

引用本文的文献

1
Large-Scale Rice Mutant Establishment and High-Throughput Mutant Manipulation Help Advance Rice Functional Genomics.大规模水稻突变体构建及高通量突变体操作助力水稻功能基因组学发展
Plants (Basel). 2025 May 16;14(10):1492. doi: 10.3390/plants14101492.

本文引用的文献

1
Sequencing barcode construction and identification methods based on block error-correction codes.基于块纠错码的测序条码构建和识别方法。
Sci China Life Sci. 2020 Oct;63(10):1580-1592. doi: 10.1007/s11427-019-1651-3. Epub 2020 Apr 14.
2
Playing a long game.进行一场持久战。
Nat Methods. 2019 Aug;16(8):683-686. doi: 10.1038/s41592-019-0507-7.
3
DNA barcodes for rapid, whole genome, single-molecule analyses.用于快速、全基因组、单分子分析的 DNA 条形码。
Nucleic Acids Res. 2019 Jul 9;47(12):e68. doi: 10.1093/nar/gkz212.
4
Barcode identification for single cell genomics.单细胞基因组学的条形码识别。
BMC Bioinformatics. 2019 Jan 17;20(1):32. doi: 10.1186/s12859-019-2612-0.
5
Detection and removal of barcode swapping in single-cell RNA-seq data.单细胞 RNA-seq 数据中条形码交换的检测与去除。
Nat Commun. 2018 Jul 10;9(1):2667. doi: 10.1038/s41467-018-05083-x.
6
BARCOSEL: a tool for selecting an optimal barcode set for high-throughput sequencing.BARCOSEL:一种用于为高通量测序选择最佳条码集的工具。
BMC Bioinformatics. 2018 Jul 5;19(1):257. doi: 10.1186/s12859-018-2262-7.
7
Indel-correcting DNA barcodes for high-throughput sequencing.高通量测序的无错切 DNA 条形码。
Proc Natl Acad Sci U S A. 2018 Jul 3;115(27):E6217-E6226. doi: 10.1073/pnas.1802640115. Epub 2018 Jun 20.
8
Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms.基于大规模平行测序平台的非冗余双索引对样本索引交换的特征描述和修复。
BMC Genomics. 2018 May 8;19(1):332. doi: 10.1186/s12864-018-4703-0.
9
Computational correction of index switching in multiplexed sequencing libraries.多重测序文库中索引切换的计算校正
Nat Methods. 2018 Apr 27;15(5):305-307. doi: 10.1038/nmeth.4666.
10
Sample-Index Misassignment Impacts Tumour Exome Sequencing.样本索引错配影响肿瘤外显子组测序。
Sci Rep. 2018 Mar 28;8(1):5307. doi: 10.1038/s41598-018-23563-4.

用于富含错误的单分子测序的低复杂度且高度稳健的条形码。

Low-complexity and highly robust barcodes for error-rich single molecular sequencing.

作者信息

Chen Weigang, Wang Panpan, Wang Lixia, Zhang Dalu, Han Mingzhe, Han Mingyong, Song Lifu

机构信息

School of Microelectronics, Tianjin University, Tianjin, 300072 People's Republic of China.

Frontier Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin, 300072 People's Republic of China.

出版信息

3 Biotech. 2021 Feb;11(2):78. doi: 10.1007/s13205-020-02607-5. Epub 2021 Jan 16.

DOI:10.1007/s13205-020-02607-5
PMID:33505833
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7811498/
Abstract

DNA barcodes are frequently corrupted due to insertion, deletion, and substitution errors during DNA synthesis, amplification and sequencing, resulting in index hopping. In this paper, we propose a new DNA barcode construction scheme that combines a cyclic block code with a predetermined pseudo-random sequence bit by bit to form bit pairs, and then converts the bit pairs to bases, i.e., the DNA barcodes. Then, we present a barcode identification scheme for noisy sequencing reads, which uses a combination of cyclic shifting and traditional dynamic programming to mark the insertion and deletion positions, and then performs erasure-and-error-correction decoding on the corrupted codewords. Furthermore, we verify the identification error rate of barcodes for multiple errors and evaluate the reliability of the barcodes in DNA context. This method can be easily generalized for constructing long barcodes, which may be used in scenarios with serious errors. Simulation results show that the bit error rate after identifying insertions/deletions is greatly reduced using the combination of cyclic shift and dynamic programming compared to using dynamic programming only. It indicates that the proposed method can effectively improve the accuracy for estimating insertion/deletion errors. And the overall identification error rate of the proposed method is lower than when the probability of each base mutation is less than 0.1, which is the typical scenario in third-generation sequencing.

摘要

DNA条形码在DNA合成、扩增和测序过程中经常因插入、缺失和替换错误而损坏,导致索引跳跃。在本文中,我们提出了一种新的DNA条形码构建方案,该方案将循环分组码与预定的伪随机序列逐位组合形成位对,然后将位对转换为碱基,即DNA条形码。然后,我们提出了一种针对噪声测序读数的条形码识别方案,该方案使用循环移位和传统动态规划的组合来标记插入和缺失位置,然后对损坏的码字进行擦除和纠错解码。此外,我们验证了条形码对多个错误的识别错误率,并评估了DNA环境中条形码的可靠性。该方法可以很容易地推广用于构建长条形码,可用于错误严重的场景。仿真结果表明,与仅使用动态规划相比,使用循环移位和动态规划的组合在识别插入/缺失后的误码率大大降低。这表明所提出的方法可以有效地提高估计插入/缺失错误的准确性。并且当每个碱基突变的概率小于0.1时,所提出方法的总体识别错误率低于 ,这是第三代测序中的典型情况。