Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.
Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.
Nat Commun. 2022 May 27;13(1):2984. doi: 10.1038/s41467-022-30140-x.
DNA-based data storage platforms traditionally encode information only in the nucleotide sequence of the molecule. Here we report on a two-dimensional molecular data storage system that records information in both the sequence and the backbone structure of DNA and performs nontrivial joint data encoding, decoding and processing. Our 2DDNA method efficiently stores images in synthetic DNA and embeds pertinent metadata as nicks in the DNA backbone. To avoid costly worst-case redundancy for correcting sequencing/rewriting errors and to mitigate issues associated with mismatched decoding parameters, we develop machine learning techniques for automatic discoloration detection and image inpainting. The 2DDNA platform is experimentally tested by reconstructing a library of images with undetectable or small visual degradation after readout processing, and by erasing and rewriting copyright metadata encoded in nicks. Our results demonstrate that DNA can serve both as a write-once and rewritable memory for heterogenous data and that data can be erased in a permanent, privacy-preserving manner. Moreover, the storage system can be made robust to degrading channel qualities while avoiding global error-correction redundancy.
基于 DNA 的数据存储平台传统上仅在分子的核苷酸序列中编码信息。在这里,我们报告了一种二维分子数据存储系统,该系统在 DNA 的序列和骨架结构中记录信息,并执行复杂的联合数据编码、解码和处理。我们的 2DDNA 方法可以有效地将图像存储在合成 DNA 中,并将相关元数据嵌入 DNA 骨架中的缺口。为了避免纠正测序/重写错误的昂贵最坏情况冗余,并减轻与解码参数不匹配相关的问题,我们开发了用于自动变色检测和图像修复的机器学习技术。2DDNA 平台通过在读取处理后重建库中不可检测或具有较小视觉降级的图像,并通过擦除和重写缺口中编码的版权元数据来进行实验测试。我们的结果表明,DNA 可以作为异构数据的一次性写入和可重写内存,并且可以以永久的、隐私保护的方式擦除数据。此外,存储系统可以在避免全局纠错冗余的情况下对降级的信道质量具有鲁棒性。