Electrical and Computer Engineering, University of Texas at Austin, Austin, TX 78712.
Department of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712.
Proc Natl Acad Sci U S A. 2023 Sep 12;120(37):e2217330120. doi: 10.1073/pnas.2217330120. Epub 2023 Sep 5.
DNA is an incredibly dense storage medium for digital data. However, computing on the stored information is expensive and slow, requiring rounds of sequencing, in silico computation, and DNA synthesis. Prior work on accessing and modifying data using DNA hybridization or enzymatic reactions had limited computation capabilities. Inspired by the computational power of "DNA strand displacement," we augment DNA storage with "in-memory" molecular computation using strand displacement reactions to algorithmically modify data in a parallel manner. We show programs for binary counting and Turing universal cellular automaton Rule 110, the latter of which is, in principle, capable of implementing any computer algorithm. Information is stored in the nicks of DNA, and a secondary sequence-level encoding allows high-throughput sequencing-based readout. We conducted multiple rounds of computation on 4-bit data registers, as well as random access of data (selective access and erasure). We demonstrate that large strand displacement cascades with 244 distinct strand exchanges (sequential and in parallel) can use naturally occurring DNA sequence from M13 bacteriophage without stringent sequence design, which has the potential to improve the scale of computation and decrease cost. Our work merges DNA storage and DNA computing, setting the foundation of entirely molecular algorithms for parallel manipulation of digital information preserved in DNA.
DNA 是一种用于数字数据的非常密集的存储介质。然而,对存储信息进行计算既昂贵又缓慢,需要进行多次测序、计算机模拟计算和 DNA 合成。先前使用 DNA 杂交或酶反应来访问和修改数据的工作具有有限的计算能力。受“DNA 链置换”的计算能力的启发,我们使用链置换反应在内存中进行分子计算,以并行方式算法式地修改数据,从而增强 DNA 存储的能力。我们展示了用于二进制计数和图灵通用细胞自动机规则 110 的程序,后者原则上能够实现任何计算机算法。信息存储在 DNA 的缺口处,二级序列级别的编码允许基于高通量测序的读出。我们对 4 位数据寄存器进行了多次计算,以及数据的随机访问(选择性访问和擦除)。我们证明,具有 244 个不同链交换(顺序和并行)的大链置换级联可以使用来自 M13 噬菌体的天然存在的 DNA 序列,而无需严格的序列设计,这有可能提高计算的规模并降低成本。我们的工作将 DNA 存储和 DNA 计算相结合,为在 DNA 中保存的数字信息的并行操作建立了完全基于分子的算法基础。