Filipovska Aleksandra, Rackham Oliver
Mol Biosyst. 2012 Mar;8(3):699-708. doi: 10.1039/c2mb05392f. Epub 2012 Jan 10.
Sequence specific binding of DNA and RNA is of fundamental importance in the regulation of cellular gene expression. Because of their modular structure repeat domain proteins are particularly well suited for these processes and have been widely adopted throughout evolution. Detailed biochemical and structural data has revealed the key residues responsible for recognition of RNA by Pumilio and FBF homology (PUF) repeat proteins and shown that the base specificity can be predicted and re-engineered. Recent work on the DNA-binding properties of transcription activator-like effector (TALE) proteins has shown that their specificity also relies on only a few key residues with a predictable code that can be used to design new DNA-binding proteins. Although less well understood, pentatricopeptide repeat (PPR) proteins contain motifs that appear to contribute to RNA recognition and comparisons to TALE and PUF proteins may help elucidate the code by which they recognize their RNA targets. Understanding how repeat proteins bind nucleic acids enables their biological roles to be uncovered and the design of engineered proteins with predictable RNA and DNA targets for use in biotechnology.
DNA和RNA的序列特异性结合在细胞基因表达调控中至关重要。由于其模块化结构,重复结构域蛋白特别适合这些过程,并且在整个进化过程中被广泛采用。详细的生化和结构数据揭示了负责Pumilio和FBF同源性(PUF)重复蛋白识别RNA的关键残基,并表明碱基特异性可以被预测和重新设计。最近关于转录激活因子样效应物(TALE)蛋白DNA结合特性的研究表明,它们的特异性也仅依赖于少数具有可预测编码的关键残基,该编码可用于设计新的DNA结合蛋白。尽管了解较少,但五肽重复(PPR)蛋白包含似乎有助于RNA识别的基序,与TALE和PUF蛋白的比较可能有助于阐明它们识别RNA靶标的编码。了解重复蛋白如何结合核酸能够揭示它们的生物学作用,并设计出具有可预测RNA和DNA靶标的工程蛋白用于生物技术。