Lee Ji-Hoon, Lee Seung Hwan, Baek Christina, Chun Hyosun, Ryu Je-Hwan, Kim Jin-Woo, Deaton Russell, Zhang Byoung-Tak
Graduate Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea.
School of Chemical and Biological Engineering, Seoul National University, Seoul, Republic of Korea.
Biosystems. 2017 Aug;158:1-9. doi: 10.1016/j.biosystems.2017.04.005. Epub 2017 Apr 29.
Programmable biomolecules, such as DNA strands, deoxyribozymes, and restriction enzymes, have been used to solve computational problems, construct large-scale logic circuits, and program simple molecular games. Although studies have shown the potential of molecular computing, the capability of computational learning with DNA molecules, i.e., molecular machine learning, has yet to be experimentally verified. Here, we present a novel molecular learning in vitro model in which symmetric internal loops of double-stranded DNA are exploited to measure the differences between training instances, thus enabling the molecules to learn from small errors. The model was evaluated on a data set of twenty dialogue sentences obtained from the television shows Friends and Prison Break. The wet DNA-computing experiments confirmed that the molecular learning machine was able to generalize the dialogue patterns of each show and successfully identify the show from which the sentences originated. The molecular machine learning model described here opens the way for solving machine learning problems in computer science and biology using in vitro molecular computing with the data encoded in DNA molecules.
可编程生物分子,如DNA链、脱氧核酶和限制酶,已被用于解决计算问题、构建大规模逻辑电路以及编写简单的分子游戏程序。尽管研究已经显示了分子计算的潜力,但利用DNA分子进行计算学习的能力,即分子机器学习,尚未得到实验验证。在此,我们提出了一种新型的体外分子学习模型,其中利用双链DNA的对称内部环来测量训练实例之间的差异,从而使分子能够从小误差中学习。该模型在从电视剧《老友记》和《越狱》中获取的20个对话句子的数据集上进行了评估。湿DNA计算实验证实,分子学习机器能够归纳出每个电视剧的对话模式,并成功识别出句子所源自的电视剧。这里描述的分子机器学习模型为利用体外分子计算以及DNA分子中编码的数据来解决计算机科学和生物学中的机器学习问题开辟了道路。