Department of Computational and Systems Biology, Dorothy P. and Richard P. Simmons Center for Interstitial Lung Disease, Division of Pulmonary, Allergy and Critical Care Medicine and Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, USA.
Nucleic Acids Res. 2011 Jun;39(11):e76. doi: 10.1093/nar/gkr205. Epub 2011 Apr 12.
DNA sequences bound by a transcription factor (TF) are presumed to contain sequence elements that reflect its DNA binding preferences and its downstream-regulatory effects. Experimentally identified TF binding sites (TFBSs) are usually similar enough to be summarized by a 'consensus' motif, representative of the TF DNA binding specificity. Studies have shown that groups of nucleotide TFBS variants (subtypes) can contribute to distinct modes of downstream regulation by the TF via differential recruitment of cofactors. A TF(A) may bind to TFBS subtypes a(1) or a(2) depending on whether it associates with cofactors TF(B) or TF(C), respectively. While some approaches can discover motif pairs (dyads), none address the problem of identifying 'variants' of dyads. TFs are key components of multiple regulatory pathways targeting different sets of genes perhaps with different binding preferences. Identifying the discriminating TF-DNA associations that lead to the differential downstream regulation is thus essential. We present DiSCo (Discovery of Subtypes and Cofactors), a novel approach for identifying variants of dyad motifs (and their respective target sequence sets) that are instrumental for differential downstream regulation. Using both simulated and experimental datasets, we demonstrate how current motif discovery can be successfully leveraged to address this question.
DNA 序列与转录因子 (TF) 结合,假定包含反映其 DNA 结合偏好及其下游调节效应的序列元件。实验鉴定的 TF 结合位点 (TFBS) 通常足够相似,可以用代表 TF DNA 结合特异性的“共识”基序来概括。研究表明,核苷酸 TFBS 变体(亚型)组可通过 TF 募集的不同辅助因子,有助于 TF 的下游调控的不同模式。TF(A)可能根据其与辅助因子 TF(B)或 TF(C)的结合情况,分别与 TFBS 亚型 a(1)或 a(2)结合。虽然有些方法可以发现基序对(对偶),但没有一种方法可以解决对偶变体的问题。TF 是靶向不同基因集(可能具有不同结合偏好)的多个调控途径的关键组成部分。因此,识别导致下游调控差异的区分 TF-DNA 关联至关重要。我们提出了 DiSCo(对偶基序的亚型和辅助因子的发现),这是一种用于识别对偶基序变体(及其各自的靶序列集)的新方法,对偶基序变体对差异下游调控至关重要。我们使用模拟和实验数据集演示了如何成功利用当前的基序发现来解决这个问题。