消除瓶颈：引入cMatch——一种用于合成生物学中构建体匹配的轻量级工具。

Removing the Bottleneck: Introducing cMatch - A Lightweight Tool for Construct-Matching in Synthetic Biology.

作者信息

Casas Alexis, Bultelle Matthieu, Motraghi Charles, Kitney Richard

机构信息

Department of Bioengineering, Imperial College London, London, United Kingdom.

出版信息

Front Bioeng Biotechnol. 2022 Jan 10;9:785131. doi: 10.3389/fbioe.2021.785131. eCollection 2021.

DOI:10.3389/fbioe.2021.785131

PMID:35083201

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8784771/

Abstract

We present a software tool, called cMatch, to reconstruct and identify synthetic genetic constructs from their sequences, or a set of sub-sequences-based on two practical pieces of information: their modular structure, and libraries of components. Although developed for combinatorial pathway engineering problems and addressing their quality control (QC) bottleneck, cMatch is not restricted to these applications. QC takes place post assembly, transformation and growth. It has a simple goal, to verify that the genetic material contained in a cell matches what was intended to be built - and when it is not the case, to locate the discrepancies and estimate their severity. In terms of reproducibility/reliability, the QC step is crucial. Failure at this step requires repetition of the construction and/or sequencing steps. When performed manually or semi-manually QC is an extremely time-consuming, error prone process, which scales very poorly with the number of constructs and their complexity. To make QC frictionless and more reliable, cMatch performs an operation we have called "construct-matching" and automates it. Construct-matching is more thorough than simple sequence-matching, as it matches at the functional level-and quantifies the matching at the individual component level and across the whole construct. Two algorithms (called CM_1 and CM_2) are presented. They differ according to the nature of their inputs. CM_1 is the core algorithm for construct-matching and is to be used when input sequences are long enough to cover constructs in their entirety (e.g., obtained with methods such as next generation sequencing). CM_2 is an extension designed to deal with shorter data (e.g., obtained with Sanger sequencing), and that need recombining. Both algorithms are shown to yield accurate construct-matching in a few minutes (even on hardware with limited processing power), together with a set of metrics that can be used to improve the robustness of the decision-making process. To ensure reliability and reproducibility, cMatch builds on the highly validated pairwise-matching Smith-Waterman algorithm. All the tests presented have been conducted on synthetic data for challenging, yet realistic constructs - and on real data gathered during studies on a metabolic engineering example (lycopene production).

摘要

我们展示了一种名为cMatch的软件工具，用于根据合成基因构建体的序列或基于两个实用信息的一组子序列来重建和识别它们：模块化结构和组件库。尽管cMatch是为组合途径工程问题而开发，并解决其质量控制（QC）瓶颈，但它并不局限于这些应用。质量控制在组装、转化和生长之后进行。它有一个简单的目标，即验证细胞中包含的遗传物质是否与预期构建的物质相匹配——如果不匹配，则找出差异并估计其严重程度。就可重复性/可靠性而言，质量控制步骤至关重要。此步骤失败需要重复构建和/或测序步骤。当手动或半手动执行时，质量控制是一个极其耗时、容易出错的过程，其随着构建体数量及其复杂性的增加扩展性很差。为了使质量控制更顺畅、更可靠，cMatch执行了一种我们称为“构建体匹配”的操作并将其自动化。构建体匹配比简单的序列匹配更全面，因为它在功能层面进行匹配，并在单个组件层面以及整个构建体层面量化匹配情况。我们提出了两种算法（称为CM_1和CM_2）。它们根据输入的性质而有所不同。CM_1是构建体匹配的核心算法，当输入序列足够长以完全覆盖构建体时（例如，通过下一代测序等方法获得）使用。CM_2是为处理较短数据（例如，通过桑格测序获得）而设计的扩展算法，这些数据需要重新组合。结果表明，这两种算法都能在几分钟内（即使在处理能力有限的硬件上）实现准确的构建体匹配，并提供一组可用于提高决策过程稳健性的指标。为确保可靠性和可重复性，cMatch基于经过高度验证的成对匹配史密斯 - 沃特曼算法构建。所展示的所有测试均针对具有挑战性但现实的构建体的合成数据以及在一个代谢工程实例（番茄红素生产）研究期间收集的真实数据进行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/303e/8784771/5f115129e58d/fbioe-09-785131-g001.jpg

相似文献

Removing the Bottleneck: Introducing cMatch - A Lightweight Tool for Construct-Matching in Synthetic Biology.消除瓶颈：引入cMatch——一种用于合成生物学中构建体匹配的轻量级工具。

Front Bioeng Biotechnol. 2022 Jan 10;9:785131. doi: 10.3389/fbioe.2021.785131. eCollection 2021.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象：化学与物理邂逅生物学（瑞士阿斯科纳，2012年6月10日至14日）

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

SMRT Gate: A method for validation of synthetic constructs on Pacific Biosciences sequencing platforms.SMRT Gate：一种用于在太平洋生物科学测序平台上验证合成构建体的方法。

Biotechniques. 2017 Jul 1;63(1):13-20. doi: 10.2144/000114565.

Gene composer: database software for protein construct design, codon engineering, and gene synthesis.基因编写器：用于蛋白质构建体设计、密码子工程和基因合成的数据库软件。

BMC Biotechnol. 2009 Apr 21;9:36. doi: 10.1186/1472-6750-9-36.

Assembly of Complex Pathways Using Type IIs Restriction Enzymes.使用IIs型限制酶组装复杂途径

Methods Mol Biol. 2019;1927:93-109. doi: 10.1007/978-1-4939-9142-6_7.

Lightweight Pattern Matching Method for DNA Sequencing in Internet of Medical Things.物联网中 DNA 测序的轻量级模式匹配方法。

Comput Intell Neurosci. 2022 Sep 8;2022:6980335. doi: 10.1155/2022/6980335. eCollection 2022.

Assembly of Multigene Constructs Using the Modular Cloning System MoClo.使用模块化克隆系统 MoClo 进行多基因构建体的组装。

Methods Mol Biol. 2020;2205:125-141. doi: 10.1007/978-1-0716-0908-8_8.

Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。

Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学：基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

Engineering Aspects of Olfaction嗅觉的工程学方面

引用本文的文献

Bioengineering Outer-Membrane Vesicles for Vaccine Development: Strategies, Advances, and Perspectives.用于疫苗开发的生物工程外膜囊泡：策略、进展与展望

Vaccines (Basel). 2025 Jul 20;13(7):767. doi: 10.3390/vaccines13070767.

Engineering biology and automation-Replicability as a design principle.工程生物学与自动化——作为一种设计原则的可重复性

Eng Biol. 2024 Jul 12;8(4):53-68. doi: 10.1049/enb2.12035. eCollection 2024 Dec.

Opportunities for engineering outer membrane vesicles using synthetic biology approaches.利用合成生物学方法改造外膜囊泡的机会。

Extracell Vesicles Circ Nucl Acids. 2023 Jun 8;4(2):255-261. doi: 10.20517/evcna.2023.21. eCollection 2023.

basicsynbio and the BASIC SEVA collection: software and vectors for an established DNA assembly method.基础合成生物学与基础SEVA文库：用于成熟DNA组装方法的软件和载体。

Synth Biol (Oxf). 2022 Oct 11;7(1):ysac023. doi: 10.1093/synbio/ysac023. eCollection 2022.

本文引用的文献

Construction and Characterization of a Gradient Strength Promoter Library for Fine-Tuned Gene Expression in .构建和表征梯度强度启动子文库，用于精细调控. 中的基因表达。

ACS Synth Biol. 2021 Sep 17;10(9):2331-2339. doi: 10.1021/acssynbio.1c00242. Epub 2021 Aug 27.

Importance of the 5' regulatory region to bacterial synthetic biology applications.5' 调控区对细菌合成生物学应用的重要性。

Microb Biotechnol. 2021 Nov;14(6):2291-2315. doi: 10.1111/1751-7915.13868. Epub 2021 Jun 25.

pLannotate: engineered plasmid annotation.pLannotate：工程质粒注释。

Nucleic Acids Res. 2021 Jul 2;49(W1):W516-W522. doi: 10.1093/nar/gkab374.

Statistical Design of Experiments for Synthetic Biology.合成生物学实验的统计设计。

ACS Synth Biol. 2021 Jan 15;10(1):1-18. doi: 10.1021/acssynbio.0c00385. Epub 2021 Jan 7.

Improving the reaction mix of a cell-free system using a design of experiments approach to minimise experimental effort.采用实验设计方法改进无细胞系统的反应混合物，以尽量减少实验工作量。

Synth Syst Biotechnol. 2020 Jun 23;5(3):137-144. doi: 10.1016/j.synbio.2020.06.003. eCollection 2020 Sep.

Applying Statistical Design of Experiments To Understanding the Effect of Growth Medium Components on Cupriavidus necator H16 Growth.应用实验设计统计学方法理解生长介质成分对necator H16 生长的影响。

Appl Environ Microbiol. 2020 Aug 18;86(17). doi: 10.1128/AEM.00705-20.

Application of combinatorial optimization strategies in synthetic biology.组合优化策略在合成生物学中的应用。

Nat Commun. 2020 May 15;11(1):2446. doi: 10.1038/s41467-020-16175-y.

Repository-based plasmid design.基于库的质粒设计。

PLoS One. 2020 Jan 9;15(1):e0223935. doi: 10.1371/journal.pone.0223935. eCollection 2020.

Automated Design of Diverse Stand-Alone Riboswitches.多种独立核糖开关的自动化设计

ACS Synth Biol. 2019 Aug 16;8(8):1838-1846. doi: 10.1021/acssynbio.9b00142. Epub 2019 Jul 29.

A design of experiments approach for the rapid formulation of a chemically defined medium for metabolic profiling of industrially important microbes.一种用于快速制定工业上重要微生物代谢轮廓分析的化学定义培养基的实验设计方法。

PLoS One. 2019 Jun 12;14(6):e0218208. doi: 10.1371/journal.pone.0218208. eCollection 2019.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

消除瓶颈：引入cMatch——一种用于合成生物学中构建体匹配的轻量级工具。

Removing the Bottleneck: Introducing cMatch - A Lightweight Tool for Construct-Matching in Synthetic Biology.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献