New York Genome Center, New York, NY 10013, USA.
Department of Computer Science, Fu Foundation School of Engineering, Columbia University, New York, NY 10027, USA.
Science. 2017 Mar 3;355(6328):950-954. doi: 10.1126/science.aaj2038.
DNA is an attractive medium to store digital information. Here we report a storage strategy, called DNA Fountain, that is highly robust and approaches the information capacity per nucleotide. Using our approach, we stored a full computer operating system, movie, and other files with a total of 2.14 × 10 bytes in DNA oligonucleotides and perfectly retrieved the information from a sequencing coverage equivalent to a single tile of Illumina sequencing. We also tested a process that can allow 2.18 × 10 retrievals using the original DNA sample and were able to perfectly decode the data. Finally, we explored the limit of our architecture in terms of bytes per molecule and obtained a perfect retrieval from a density of 215 petabytes per gram of DNA, orders of magnitude higher than previous reports.
DNA 是存储数字信息的有吸引力的介质。在这里,我们报告了一种存储策略,称为 DNA 喷泉,它具有高度的鲁棒性,接近每个核苷酸的信息容量。使用我们的方法,我们将一个完整的计算机操作系统、电影和其他文件总计 2.14×10 字节存储在 DNA 寡核苷酸中,并从相当于单个 Illumina 测序模块的测序覆盖度中完美地检索到了信息。我们还测试了一个可以使用原始 DNA 样本进行 2.18×10 次检索的过程,并能够完美地解码数据。最后,我们探索了我们的架构在每分子字节方面的极限,并从每克 DNA 215 拍字节的密度中获得了完美的检索,这比以前的报告高出几个数量级。