Department of Materials Science and Engineering, University of California, Irvine, California 92697, USA.
Department of Physics and Astronomy, University of California, Irvine, California 92697, USA.
Chem Commun (Camb). 2023 Aug 24;59(69):10360-10375. doi: 10.1039/d3cc02890a.
Sequence-encoded biomolecules such as DNA and peptides are powerful programmable building blocks for nanomaterials. This paradigm is enabled by decades of prior research into how nucleic acid and amino acid sequences dictate biomolecular interactions. The properties of biomolecular materials can be significantly expanded with non-natural interactions, including metal ion coordination of nucleic acids and amino acids. However, these approaches present design challenges because it is often not well-understood how biomolecular sequence dictates such non-natural interactions. This Feature Article presents a case study in overcoming challenges in biomolecular materials with emerging approaches in data mining and machine learning for chemical design. We review progress in this area for a specific class of DNA-templated metal nanomaterials with complex sequence-to-property relationships: DNA-stabilized silver nanoclusters (Ag-DNAs) with bright, sequence-tuned fluorescence colors and promise for biophotonics applications. A brief overview of machine learning concepts is presented, and high-throughput experimental synthesis and characterization of Ag-DNAs are discussed. Then, recent progress in machine learning-guided design of DNA sequences that select for specific Ag-DNA fluorescence properties is reviewed. We conclude with emerging opportunities in machine learning-guided design and discovery of Ag-DNAs and other sequence-encoded biomolecular nanomaterials.
序列编码的生物分子,如 DNA 和肽,是用于纳米材料的强大可编程构建块。这一范例得益于几十年来对核酸和氨基酸序列如何决定生物分子相互作用的研究。通过非天然相互作用,可以显著扩展生物分子材料的性能,包括核酸和氨基酸的金属离子配位。然而,这些方法提出了设计挑战,因为通常不太清楚生物分子序列如何决定这种非天然相互作用。本文通过数据挖掘和机器学习在化学设计中的新兴方法,针对具有复杂序列-性能关系的一类特定生物分子材料(即 DNA 模板金属纳米材料),提供了克服挑战的案例研究。我们回顾了该领域的进展,该领域涉及具有明亮、序列可调荧光颜色的 DNA 稳定银纳米团簇(Ag-DNAs),有望应用于生物光子学。本文还简要介绍了机器学习概念,并讨论了 Ag-DNAs 的高通量实验合成和表征。然后,我们回顾了机器学习指导设计选择特定 Ag-DNA 荧光特性的 DNA 序列的最新进展。最后,我们探讨了机器学习指导 Ag-DNAs 和其他序列编码生物分子纳米材料的设计和发现的新兴机会。