IEEE Trans Nanobioscience. 2021 Apr;20(2):212-222. doi: 10.1109/TNB.2021.3056351. Epub 2021 Mar 31.
With the development of information technology, huge amounts of data are produced at the same time. How to store data efficiently and at low cost has become an urgent problem. DNA is a high-density and persistent medium, making DNA storage a viable solution. In a DNA data storage system, the first consideration is how to encode the data effectively into code words. However, DNA strands are prone to non-specific hybridization during the hybridization reaction process and are prone to errors during synthesis and sequencing. In order to reduce the error rate, a thermodynamic minimum free energy (MFE) constraint is proposed and applied to the construction of coding sets for DNA storage. The Brownian multi-verse optimizer (BMVO) algorithm, based on the Multi-verse optimizer (MVO) algorithm, incorporates the idea of Brownian motion and Nelder-Mead method, and it is used to design a better DNA storage coding set. In addition, compared with previous works, the coding set has been increasing by 4%-50% in size and has better thermodynamic properties. With the improvement of the quality of the DNA coding set, the accuracy of reading and writing and the robustness of the DNA storage system are also enhanced.
随着信息技术的发展,同时产生了大量的数据。如何高效、低成本地存储数据已成为一个紧迫的问题。DNA 是一种高密度、持久性的介质,使得 DNA 存储成为一种可行的解决方案。在 DNA 数据存储系统中,首先要考虑的是如何将数据有效地编码成码字。然而,在杂交反应过程中,DNA 链容易发生非特异性杂交,并且在合成和测序过程中容易出错。为了降低错误率,提出了热力学最小自由能(MFE)约束,并将其应用于 DNA 存储编码集的构建。布朗多宇宙优化器(BMVO)算法是在多宇宙优化器(MVO)算法的基础上,结合了布朗运动和Nelder-Mead 方法的思想,用于设计更好的 DNA 存储编码集。此外,与以前的工作相比,编码集的大小增加了 4%-50%,并且具有更好的热力学特性。随着 DNA 编码集质量的提高,读写的准确性和 DNA 存储系统的鲁棒性也得到了增强。