Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
Acc Chem Res. 2023 Mar 21;56(6):712-727. doi: 10.1021/acs.accounts.3c00029. Epub 2023 Mar 9.
ConspectusNucleic acids represent a unique class of highly programmable molecules, where the sequence of monomer units incorporated into the polymer chain can be read through duplex formation with a complementary oligomer. It should be possible to encode information in synthetic oligomers as a sequence of different monomer units in the same way that the four different bases program information into DNA and RNA. In this Account, we describe our efforts to develop synthetic duplex-forming oligomers composed of sequences of two complementary recognition units that can base-pair in organic solvents through formation of a single H-bond, and we outline some general guidelines for the design of new sequence-selective recognition systems.The design strategy has focused on three interchangeable modules that control recognition, synthesis, and backbone geometry. For a single H-bond to be effective as a base-pairing interaction, very polar recognition units, such as phosphine oxide and phenol, are required. Reliable base-pairing in organic solvents requires a nonpolar backbone, so that the only polar functional groups present are the donor and acceptor sites on the two recognition units. This criterion limits the range of functional groups that can be produced in the synthesis of oligomers. In addition, the chemistry used for polymerization should be orthogonal to the recognition units. Several compatible high yielding coupling chemistries that are suitable for the synthesis of recognition-encoded polymers are explored. Finally, the conformational properties of the backbone module play an important role in determining the supramolecular assembly pathways that are accessible to mixed sequence oligomers.Almost all complementary homo-oligomers will form duplexes provided the product of the association constant for formation of a base-pair and the effective molarity for the intramolecular base-pairing interactions that zip up the duplex is significantly greater than one. For these systems, the structure of the backbone does not play a major role, and the effective molarities for duplex formation tend to fall in the range 10-100 mM for both rigid and flexible backbones. For mixed sequences, intramolecular H-bonding interactions lead to folding. The competition between folding and duplex formation depends critically on the conformational properties of the backbone, and high-fidelity sequence-selective duplex formation is only observed for backbones that are sufficiently rigid to prevent short-range folding between bases that are close in sequence. The final section of the Account highlights the prospects for functional properties, other than duplex formation, that might be encoded with sequence.
核酸是一类独特的高度可编程分子,其中聚合物链中掺入的单体单元的序列可以通过与互补寡聚物形成双链来读取。应该可以像用四个不同碱基将信息编入 DNA 和 RNA 一样,将不同的单体单元序列编码为合成寡聚物中的信息。在本报告中,我们描述了开发由两个互补识别单元序列组成的合成双链形成寡聚物的努力,这些寡聚物可以通过形成单个氢键在有机溶剂中碱基配对,并且我们概述了一些新的序列选择性识别系统设计的一般准则。
设计策略集中在三个可互换的模块上,这些模块控制识别、合成和骨架几何形状。为了使单个氢键有效地作为碱基对相互作用,需要非常极性的识别单元,例如氧化膦和苯酚。在有机溶剂中可靠的碱基配对需要非极性骨架,因此存在的唯一极性官能团是两个识别单元上的供体和受体位点。该标准限制了可在寡聚物合成中产生的官能团的范围。此外,用于聚合的化学应与识别单元正交。探索了几种适合于识别编码聚合物合成的兼容高产偶联化学。最后,骨架模块的构象性质在决定混合序列寡聚物可及的超分子组装途径方面起着重要作用。
几乎所有互补的同寡聚物都会形成双链,只要形成碱基对的缔合常数与将双链拉链起来的分子内碱基配对相互作用的有效摩尔浓度显著大于一。对于这些系统,骨架的结构不起主要作用,并且刚性和柔性骨架的双链形成有效摩尔浓度倾向于落在 10-100mM 的范围内。对于混合序列,分子内氢键相互作用导致折叠。折叠和双链形成之间的竞争取决于骨架的构象性质,只有在足够刚性以防止序列相近的碱基之间短程折叠的情况下,才能观察到高保真序列选择性双链形成。报告的最后一部分强调了除双链形成之外的功能特性,这些特性可能被序列编码。