Department of Chemistry, University of Warwick , Coventry CV4 7AL, United Kingdom.
Clarendon Laboratory, Department of Physics, University of Oxford , Parks Road, Oxford OX1 3PU, United Kingdom.
Acc Chem Res. 2017 Oct 17;50(10):2496-2509. doi: 10.1021/acs.accounts.7b00280. Epub 2017 Sep 15.
Precise control over reactivity and molecular structure is a fundamental goal of the chemical sciences. Billions of years of evolution by natural selection have resulted in chemical systems capable of information storage, self-replication, catalysis, capture and production of light, and even cognition. In all these cases, control over molecular structure is required to achieve a particular function: without structural control, function may be impaired, unpredictable, or impossible. The search for molecules with a desired function is often achieved by synthesizing a combinatorial library, which contains many or all possible combinations of a set of chemical building blocks (BBs), and then screening this library to identify "successful" structures. The largest libraries made by conventional synthesis are currently of the order of 10 distinct molecules. To put this in context, there are 10 ways of arranging the 21 proteinogenic amino acids in chains up to 10 units long. Given that we know that a number of these compounds have potent biological activity, it would be highly desirable to be able to search them all to identify leads for new drug molecules. Large libraries of oligonucleotides can be synthesized combinatorially and translated into peptides using systems based on biological replication such as mRNA display, with selected molecules identified by DNA sequencing; but these methods are limited to BBs that are compatible with cellular machinery. In order to search the vast tracts of chemical space beyond nucleic acids and natural peptides, an alternative approach is required. DNA-templated synthesis (DTS) could enable us to meet this challenge. DTS controls chemical product formation by using the specificity of DNA hybridization to bring selected reactants into close proximity, and is capable of the programmed synthesis of many distinct products in the same reaction vessel. By making use of dynamic, programmable DNA processes, it is possible to engineer a system that can translate instructions coded as a sequence of DNA bases into a chemical structure-a process analogous to the action of the ribosome in living organisms but with the potential to create a much more chemically diverse set of products. It is also possible to ensure that each product molecule is tagged with its identifying DNA sequence. Compound libraries synthesized in this way can be exposed to selection against suitable targets, enriching successful molecules. The encoding DNA can then be amplified using the polymerase chain reaction and decoded by DNA sequencing. More importantly, the DNA instruction sequences can be mutated and reused during multiple rounds of amplification, translation, and selection. In other words, DTS could be used as the foundation for a system of synthetic molecular evolution, which could allow us to efficiently search a vast chemical space. This has huge potential to revolutionize materials discovery-imagine being able to evolve molecules for light harvesting, or catalysts for CO fixation. The field of DTS has developed to the point where a wide variety of reactions can be performed on a DNA template. Complex architectures and autonomous "DNA robots" have been implemented for the controlled assembly of BBs, and these mechanisms have in turn enabled the one-pot synthesis of large combinatorial libraries. Indeed, DTS libraries are being exploited by pharmaceutical companies and have already found their way into drug lead discovery programs. This Account explores the processes involved in DTS and highlights the challenges that remain in creating a general system for molecular discovery by evolution.
精确控制反应性和分子结构是化学科学的基本目标。通过自然选择,数十亿年的进化产生了能够进行信息存储、自我复制、催化、捕获和产生光,甚至认知的化学系统。在所有这些情况下,都需要控制分子结构来实现特定的功能:没有结构控制,功能可能会受到损害、不可预测或不可能。寻找具有所需功能的分子通常是通过合成组合文库来实现的,该文库包含一组化学构建块 (BB) 的许多或所有可能组合,然后对该文库进行筛选以识别“成功”的结构。目前,通过传统合成方法合成的最大文库的规模约为 10 个不同的分子。为了说明这一点,有 10 种排列方式可以将 21 种蛋白质氨基酸排列成 10 个单位长的链。鉴于我们知道其中一些化合物具有很强的生物活性,那么能够搜索它们以识别新药分子的先导化合物将是非常理想的。可以通过基于生物复制的系统(如 mRNA 显示)组合合成和翻译寡核苷酸文库成肽,通过 DNA 测序鉴定选定的分子;但这些方法仅限于与细胞机制兼容的 BB。为了搜索超出核酸和天然肽的广阔化学空间,需要采用替代方法。DNA 模板合成 (DTS) 可以使我们能够应对这一挑战。DTS 通过利用 DNA 杂交的特异性将选定的反应物拉近,从而控制化学产物的形成,并且能够在同一反应容器中进行许多不同产物的程序化合成。通过利用动态、可编程的 DNA 过程,可以设计出一种能够将编码为 DNA 碱基序列的指令转化为化学结构的系统——这一过程类似于活生物体中核糖体的作用,但具有创造更具化学多样性的产物集的潜力。还可以确保每个产物分子都带有其识别的 DNA 序列。以这种方式合成的化合物文库可以暴露于对合适靶标的选择,从而富集成功的分子。然后可以使用聚合酶链反应扩增编码 DNA,并通过 DNA 测序进行解码。更重要的是,在多次扩增、翻译和选择过程中,可以突变和重复使用 DNA 指令序列。换句话说,DTS 可以用作合成分子进化系统的基础,这可以使我们能够有效地搜索广阔的化学空间。这具有彻底改变材料发现的巨大潜力——想象一下能够为光收集或 CO 固定的催化剂进化分子。DTS 领域已经发展到可以在 DNA 模板上进行各种反应的程度。已经为 BB 的受控组装实施了复杂的架构和自主的“DNA 机器人”,这些机制反过来又能够一锅合成大型组合文库。事实上,制药公司正在利用 DTS 文库,并且已经将其纳入药物先导发现计划。本账户探讨了 DTS 中涉及的过程,并强调了在创建通过进化进行分子发现的通用系统方面仍然存在的挑战。