Zeitlinger Julia
Stowers Institute for Medical Research, Kansas City, MO, USA.
The University of Kansas Medical Center, Kansas City, KS, USA.
Curr Opin Syst Biol. 2020 Oct;23:22-31. doi: 10.1016/j.coisb.2020.08.002. Epub 2020 Sep 4.
Genomics data are now being generated at large quantities, of exquisite high resolution and from single cells. They offer a unique opportunity to develop powerful machine learning algorithms, including neural networks, to uncover the rules of the cis-regulatory code. However, current modeling assumptions are often not based on state-of-the-art knowledge of the cis-regulatory code from transcription, developmental genetics, imaging and structural studies. Here I aim to fill this gap by giving a brief historical overview of the field, describing common misconceptions and providing knowledge that might help to guide computational approaches. I will describe the principles and mechanisms involved in the combinatorial requirement of transcription factor binding motifs for enhancer activity, including the role of chromatin accessibility, repressors and low-affinity motifs in the cis-regulatory code. Deciphering the cis-regulatory code would unlock an enormous amount of regulatory information in the genome and would allow us to locate cis-regulatory genetic variants involved in development and disease.
基因组学数据如今正以前所未有的高分辨率大量生成,且来自单细胞。它们为开发强大的机器学习算法(包括神经网络)提供了独特机遇,以揭示顺式调控密码的规则。然而,当前的建模假设往往并非基于转录、发育遗传学、成像和结构研究中关于顺式调控密码的最新知识。在此,我旨在通过简要回顾该领域的历史、描述常见误解并提供可能有助于指导计算方法的知识来填补这一空白。我将描述转录因子结合基序对增强子活性的组合需求所涉及的原理和机制,包括染色质可及性、阻遏物和低亲和力基序在顺式调控密码中的作用。解读顺式调控密码将解锁基因组中大量的调控信息,并使我们能够定位参与发育和疾病的顺式调控基因变异。