Department of Computer Science and Engineering, School of Engineering, Shiv Nadar University NCR, India.
Amity Institute of Nanotechnology, Noida, Amity University Uttar Pradesh, India.
IET Nanobiotechnol. 2020 Sep;14(7):635-641. doi: 10.1049/iet-nbt.2020.0157.
There has been exponential growth in the amount of data being generated on a daily basis. Such a huge amount of data creates a need for efficient data storage techniques. Due to the limitations of existing storage media, new storage solutions have always been of interest. There have been recent developments in order to efficiently use synthetic deoxyribonucleic acid (DNA) for information storage. DNA storage has attracted researchers because of its extremely high data storage density, about 1 exabyte/mm and long life under easily achievable conditions. This work presents an encoding scheme for DNA-based data storage system with controllable redundancy and reliability, the authors have also talked about the feasibility of the proposed method. The authors have also analysed the proposed algorithm for time and space complexity. The proposed encoding scheme tries to minimise the bases per letter ratio while controlling the redundancy. They have experimented with three different types of data with a value of redundancy as 0.75. In the randomised simulation setup, it was observed that the proposed algorithm was able to correctly retrieve the stored data in our experiments about 94% of the time. In the situation, where redundancy was increased to 1, the authors were able to retrieve all the information correctly in the proposed experiments.
每天生成的数据量呈指数级增长。如此大量的数据需要高效的数据存储技术。由于现有存储介质的限制,新的存储解决方案一直受到关注。最近已经开发出了一些方法,以便有效地利用合成脱氧核糖核酸(DNA)进行信息存储。由于其极高的数据存储密度(约为 1 艾字节/毫米)和在易于实现的条件下的长寿命,DNA 存储吸引了研究人员的注意。这项工作提出了一种具有可控冗余和可靠性的基于 DNA 的数据存储系统的编码方案,作者还讨论了该方法的可行性。作者还分析了所提出算法的时间和空间复杂度。所提出的编码方案试图在控制冗余的同时最小化每个字母的碱基比。他们用三种不同类型的数据进行了实验,冗余值为 0.75。在随机模拟设置中,观察到所提出的算法能够在大约 94%的时间内正确地检索存储的数据。在冗余增加到 1 的情况下,作者能够在提出的实验中正确地检索所有信息。