大规模 DNA 数据存储中的随机访问。

Random access in large-scale DNA data storage.

机构信息

Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA.

Microsoft Research, Redmond, Washington, USA.

出版信息

Nat Biotechnol. 2018 Mar;36(3):242-248. doi: 10.1038/nbt.4079. Epub 2018 Feb 19.

DOI:10.1038/nbt.4079

PMID:29457795

Abstract

Synthetic DNA is durable and can encode digital data with high density, making it an attractive medium for data storage. However, recovering stored data on a large-scale currently requires all the DNA in a pool to be sequenced, even if only a subset of the information needs to be extracted. Here, we encode and store 35 distinct files (over 200 MB of data), in more than 13 million DNA oligonucleotides, and show that we can recover each file individually and with no errors, using a random access approach. We design and validate a large library of primers that enable individual recovery of all files stored within the DNA. We also develop an algorithm that greatly reduces the sequencing read coverage required for error-free decoding by maximizing information from all sequence reads. These advances demonstrate a viable, large-scale system for DNA data storage and retrieval.

摘要

合成 DNA 具有耐用性，并且可以高密度地编码数字数据，因此它是一种有吸引力的数据存储介质。然而，目前在大规模上恢复存储的数据需要对池中的所有 DNA 进行测序，即使只需要提取信息的一部分。在这里，我们使用随机访问方法，在超过 1300 万个 DNA 寡核苷酸中编码和存储 35 个不同的文件（超过 200MB 的数据），并表明我们可以单独且无误地恢复每个文件。我们设计并验证了一个大型引物库，该库可以使用单个引物来恢复 DNA 中存储的所有文件。我们还开发了一种算法，该算法通过最大化所有序列读取的信息，大大减少了无错误解码所需的测序读取覆盖率。这些进展证明了一种可行的、大规模的 DNA 数据存储和检索系统。

相似文献

Random access in large-scale DNA data storage.大规模 DNA 数据存储中的随机访问。

Nat Biotechnol. 2018 Mar;36(3):242-248. doi: 10.1038/nbt.4079. Epub 2018 Feb 19.

Iterative Soft Decoding Algorithm for DNA Storage Using Quality Score and Redecoding.基于质量分数和重编码的 DNA 存储迭代软解码算法

IEEE Trans Nanobioscience. 2024 Jan;23(1):81-90. doi: 10.1109/TNB.2023.3284406. Epub 2024 Jan 3.

Driving the Scalability of DNA-Based Information Storage Systems.推动基于DNA的信息存储系统的可扩展性。

ACS Synth Biol. 2019 Jun 21;8(6):1241-1248. doi: 10.1021/acssynbio.9b00100. Epub 2019 May 24.

DNA Fountain enables a robust and efficient storage architecture.DNA 喷泉实现了稳健且高效的存储架构。

Science. 2017 Mar 3;355(6328):950-954. doi: 10.1126/science.aaj2038.

DNA Bloom Filter enables anti-contamination and file version control for DNA-based data storage.DNA Bloom Filter 可实现基于 DNA 的数据存储的防污染和文件版本控制。

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae125.

DNA storage in thermoresponsive microcapsules for repeated random multiplexed data access.热响应微胶囊中的 DNA 存储，用于重复随机多路复用数据访问。

Nat Nanotechnol. 2023 Aug;18(8):912-921. doi: 10.1038/s41565-023-01377-4. Epub 2023 May 4.

Portable and Error-Free DNA-Based Data Storage.基于 DNA 的便携式无错误数据存储。

Sci Rep. 2017 Jul 10;7(1):5011. doi: 10.1038/s41598-017-05188-1.

A Characterization of the DNA Data Storage Channel.DNA 数据存储信道的特性描述。

Sci Rep. 2019 Jul 4;9(1):9663. doi: 10.1038/s41598-019-45832-6.

[DNA for information storage?].[用于信息存储的DNA？]

Med Sci (Paris). 2018 Jun-Jul;34(6-7):622-625. doi: 10.1051/medsci/20183406025. Epub 2018 Jul 31.

In search of perfect reads.寻找完美的读数。

BMC Bioinformatics. 2015;16 Suppl 17(Suppl 17):S7. doi: 10.1186/1471-2105-16-S17-S7. Epub 2015 Dec 7.

引用本文的文献

A compact cassette tape for DNA-based data storage.一种用于基于DNA的数据存储的紧凑型盒式磁带。

Sci Adv. 2025 Sep 12;11(37):eady3406. doi: 10.1126/sciadv.ady3406. Epub 2025 Sep 10.

Advancing synthesis-free and enzyme-free rewritable DNA memory through frameshift encoding and nanopore duplex interruption decoding.通过移码编码和纳米孔双链中断解码推进无合成和无酶的可重写DNA存储器。

PNAS Nexus. 2025 Sep 5;4(9):pgaf233. doi: 10.1093/pnasnexus/pgaf233. eCollection 2025 Sep.

High-speed 3D DNA PAINT and unsupervised clustering for unlocking 3D DNA origami cryptography.用于解锁3D DNA折纸密码学的高速3D DNA PAINT和无监督聚类

bioRxiv. 2025 Aug 19:2023.08.29.555281. doi: 10.1101/2023.08.29.555281.

Dna-storalator: a computational simulator for DNA data storage.DNA存储模拟器：一种用于DNA数据存储的计算模拟器。

BMC Bioinformatics. 2025 Aug 4;26(1):204. doi: 10.1186/s12859-025-06222-0.

Electro-switchable addressing system for achieving repetitive random data access.用于实现重复随机数据访问的电可切换寻址系统。

Nucleic Acids Res. 2025 Jul 19;53(14). doi: 10.1093/nar/gkaf733.

Exploring the intersection of natural sciences and information technology via entropy and randomness.通过熵与随机性探索自然科学与信息技术的交叉领域。

Nat Commun. 2025 Jul 29;16(1):6969. doi: 10.1038/s41467-025-62353-1.

Random access and semantic search in DNA data storage enabled by Cas9 and machine-guided design.由Cas9和机器引导设计实现的DNA数据存储中的随机访问和语义搜索。

Nat Commun. 2025 Jul 10;16(1):6388. doi: 10.1038/s41467-025-61264-5.

Hybridization-encoded DNA tags with paper-based readout for anti-forgery raw material tracking.用于防伪原材料追踪的具有纸质读数的杂交编码DNA标签。

Nat Commun. 2025 Jul 1;16(1):5832. doi: 10.1038/s41467-025-60282-7.

Directed assembly of single-stranded DNA fragments for data storage via protein-free catalytic splint ligation.通过无蛋白质催化夹板连接实现用于数据存储的单链DNA片段的定向组装。

Nucleic Acids Res. 2025 Jun 20;53(12). doi: 10.1093/nar/gkaf582.

INNSE: Invertible neural network-based DNA image storage with self-correction encoding.INNSE：基于可逆神经网络的具有自校正编码的DNA图像存储

Comput Struct Biotechnol J. 2025 Jun 6;27:2492-2502. doi: 10.1016/j.csbj.2025.06.003. eCollection 2025.

本文引用的文献

Portable and Error-Free DNA-Based Data Storage.基于 DNA 的便携式无错误数据存储。

Sci Rep. 2017 Jul 10;7(1):5011. doi: 10.1038/s41598-017-05188-1.

DNA Fountain enables a robust and efficient storage architecture.DNA 喷泉实现了稳健且高效的存储架构。

Science. 2017 Mar 3;355(6328):950-954. doi: 10.1126/science.aaj2038.

A Rewritable, Random-Access DNA-Based Storage System.一种基于DNA的可重写随机存取存储系统。

Sci Rep. 2015 Sep 18;5:14138. doi: 10.1038/srep14138.

Robust chemical preservation of digital information on DNA in silica with error-correcting codes.利用纠错码在硅基片上对 DNA 中的数字信息进行稳健的化学保存。

Angew Chem Int Ed Engl. 2015 Feb 16;54(8):2552-5. doi: 10.1002/anie.201411378. Epub 2015 Feb 4.

Large-scale de novo DNA synthesis: technologies and applications.大规模从头 DNA 合成：技术与应用。

Nat Methods. 2014 May;11(5):499-507. doi: 10.1038/nmeth.2918.

Towards practical, high-capacity, low-maintenance information storage in synthesized DNA.在合成 DNA 中实现实用、大容量、低维护的信息存储。

Nature. 2013 Feb 7;494(7435):77-80. doi: 10.1038/nature11875. Epub 2013 Jan 23.

Next-generation digital information storage in DNA.DNA 中的下一代数字信息存储。

Science. 2012 Sep 28;337(6102):1628. doi: 10.1126/science.1226355. Epub 2012 Aug 16.

NUPACK: Analysis and design of nucleic acid systems.NUPACK：核酸系统的分析与设计。

J Comput Chem. 2011 Jan 15;32(1):170-3. doi: 10.1002/jcc.21596.

Design of 240,000 orthogonal 25mer DNA barcode probes.240,000个正交25聚体DNA条形码探针的设计

Proc Natl Acad Sci U S A. 2009 Feb 17;106(7):2289-94. doi: 10.1073/pnas.0812506106. Epub 2009 Jan 26.

Long-term data storage in DNA.DNA中的长期数据存储。

Trends Biotechnol. 2001 Jul;19(7):247-50. doi: 10.1016/s0167-7799(01)01671-7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

大规模 DNA 数据存储中的随机访问。

Random access in large-scale DNA data storage.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献