Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.
Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, 518055, China.
Small Methods. 2024 Oct;8(10):e2301585. doi: 10.1002/smtd.202301585. Epub 2024 May 29.
DNA-based data storage is a new technology in computational and synthetic biology, that offers a solution for long-term, high-density data archiving. Given the critical importance of medical data in advancing human health, there is a growing interest in developing an effective medical data storage system based on DNA. Data integrity, accuracy, reliability, and efficient retrieval are all significant concerns. Therefore, this study proposes an Effective DNA Storage (EDS) approach for archiving medical MRI data. The EDS approach incorporates three key components (i) a novel fraction strategy to address the critical issue of rotating encoding, which often leads to data loss due to single base error propagation; (ii) a novel rule-based quaternary transcoding method that satisfies bio-constraints and ensure reliable mapping; and (iii) an indexing technique designed to simplify random search and access. The effectiveness of this approach is validated through computer simulations and biological experiments, confirming its practicality. The EDS approach outperforms existing methods, providing superior control over bio-constraints and reducing computational time. The results and code provided in this study open new avenues for practical DNA storage of medical MRI data, offering promising prospects for the future of medical data archiving and retrieval.
基于 DNA 的数据存储是计算和合成生物学中的一项新技术,为长期、高密度的数据存档提供了一种解决方案。鉴于医疗数据对于推进人类健康的至关重要性,人们越来越有兴趣开发基于 DNA 的有效医疗数据存储系统。数据完整性、准确性、可靠性和高效检索都是重要的关注点。因此,本研究提出了一种用于存档医疗 MRI 数据的有效 DNA 存储(EDS)方法。EDS 方法包含三个关键组件:(i) 一种新颖的分数策略,用于解决由于单碱基错误传播而导致的旋转编码这一关键问题;(ii) 一种满足生物约束并确保可靠映射的基于规则的四进制转码方法;和 (iii) 一种索引技术,旨在简化随机搜索和访问。通过计算机模拟和生物实验验证了该方法的有效性,证实了其实用性。EDS 方法优于现有方法,对生物约束具有更好的控制,并且减少了计算时间。本研究提供的结果和代码为医疗 MRI 数据的实际 DNA 存储开辟了新途径,为医疗数据存档和检索的未来提供了有前景的展望。