Suppr超能文献

DNA存储模拟器:一种用于DNA数据存储的计算模拟器。

Dna-storalator: a computational simulator for DNA data storage.

作者信息

Chaykin Gadi, Sabary Omer, Furman Nili, Shabat Dvir Ben, Yaakobi Eitan

机构信息

Henry and Marilyn Taub Faculty of Computer Science, Technion - Israel Institute of Technology, 3200003, Haifa, Israel.

出版信息

BMC Bioinformatics. 2025 Aug 4;26(1):204. doi: 10.1186/s12859-025-06222-0.

Abstract

BACKGROUND

DNA data storage is an emerging technology that caught the attention of many researchers and engineers. This technology uses DNA molecules as a storage medium and thus presents an extremely dense and durable storage device. However, the unique nature of the errors in DNA, which include insertion, deletion, and substitution errors, requires the development of new algorithmic and coding solutions for these storage systems.

RESULTS

The DNA-Storalator is a cross-platform software tool that simulates in a simplified digital point of view biological and computational processes involved in the process of storing data in DNA molecules. The simulator receives an input file with the designed DNA strands that store digital data and emulates the different biological and algorithmical components of DNA-based storage system. The biological component includes simulation of the synthesis, PCR, and sequencing stages which are expensive and complicated and therefore are not widely accessible to the community. These processes amplify the data and generate noisy copies of each DNA strand, where the errors are insertions, deletions, long-deletions, and substitutions. The DNA-Storalator injects errors to the data based on the error rates, as they vary between different synthesis and sequencing technologies. The rates are based on comprehensive analysis of data from previous experiments but can also be customized. Additionally, the tool can analyze new datasets and characterize their error rates to build new error models for future usage in the simulator. The DNA-Storalator also enables control of the amplification process and the distribution of the number of copies per designed strand. The coding and algorithmic components are: 1. Clustering algorithms which partition all output noisy strands into groups according to the designed strand they originated from; 2. State-of-the-art reconstruction algorithms that are invoked on each cluster to output a close/exact estimation of the designed strand; 3. Integration with external error-correcting codes and other encoding and decoding techniques.

CONCLUSIONS

The suggested computational DNA storage simulator grants researchers from all fields an accessible complete simulator to examine new biological technologies, coding techniques, and algorithms for current and future DNA storage systems.

摘要

背景

DNA数据存储是一项新兴技术,引起了众多研究人员和工程师的关注。该技术使用DNA分子作为存储介质,从而呈现出一种极其密集且耐用的存储设备。然而,DNA中错误的独特性质,包括插入、缺失和替换错误,需要为这些存储系统开发新的算法和编码解决方案。

结果

DNA-Storalator是一个跨平台软件工具,它从简化的数字角度模拟了在DNA分子中存储数据过程中涉及的生物学和计算过程。该模拟器接收一个包含设计好的存储数字数据的DNA链的输入文件,并模拟基于DNA的存储系统的不同生物学和算法组件。生物学组件包括对合成、PCR和测序阶段的模拟,这些阶段成本高昂且复杂,因此社区无法广泛使用。这些过程会扩增数据并生成每个DNA链的有噪声副本,其中的错误包括插入、缺失、长缺失和替换。DNA-Storalator根据错误率向数据中注入错误,因为不同的合成和测序技术之间错误率有所不同。这些比率基于对先前实验数据的全面分析,但也可以定制。此外,该工具可以分析新的数据集并表征其错误率,以构建新的错误模型供模拟器未来使用。DNA-Storalator还能够控制扩增过程以及每条设计链的副本数量分布。编码和算法组件包括:1. 聚类算法,它根据输出噪声链所源自的设计链将所有输出噪声链划分为不同组;2. 最先进的重建算法,对每个聚类调用该算法以输出设计链的近似/精确估计;3. 与外部纠错码以及其他编码和解码技术的集成。

结论

所建议的计算DNA存储模拟器为各个领域的研究人员提供了一个易于使用的完整模拟器,用于研究当前和未来DNA存储系统的新生物技术、编码技术和算法。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验