Suppr超能文献

用于生成具有生物学意义和准确的系统发育足迹分析的TF-DNA结合的分子和结构考量:以LysR型转录调节因子家族作为研究模型

Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model.

作者信息

Oliver Patricia, Peralta-Gil Martín, Tabche María-Luisa, Merino Enrique

机构信息

Departmento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México.

Escuela Superior de Apan de la Universidad Autónoma del Estado de Hidalgo, Carretera Apan-Calpulalpan, Km 8, Chimalpa Tlalayote s/n, Colonia Chimalpa, Apan, Hidalgo, México.

出版信息

BMC Genomics. 2016 Aug 27;17(1):686. doi: 10.1186/s12864-016-3025-3.

Abstract

BACKGROUND

The goal of most programs developed to find transcription factor binding sites (TFBSs) is the identification of discrete sequence motifs that are significantly over-represented in a given set of sequences where a transcription factor (TF) is expected to bind. These programs assume that the nucleotide conservation of a specific motif is indicative of a selective pressure required for the recognition of a TF for its corresponding TFBS. Despite their extensive use, the accuracies reached with these programs remain low. In many cases, true TFBSs are excluded from the identification process, especially when they correspond to low-affinity but important binding sites of regulatory systems.

RESULTS

We developed a computational protocol based on molecular and structural criteria to perform biologically meaningful and accurate phylogenetic footprinting analyses. Our protocol considers fundamental aspects of the TF-DNA binding process, such as: i) the active homodimeric conformations of TFs that impose symmetric structures on the TFBSs, ii) the cooperative binding of TFs, iii) the effects of the presence or absence of co-inducers, iv) the proximity between two TFBSs or one TFBS and a promoter that leads to very long spurious motifs, v) the presence of AT-rich sequences not recognized by the TF but that are required for DNA flexibility, and vi) the dynamic order in which the different binding events take place to determine a regulatory response (i.e., activation or repression). In our protocol, the abovementioned criteria were used to analyze a profile of consensus motifs generated from canonical Phylogenetic Footprinting Analyses using a set of analysis windows of incremental sizes. To evaluate the performance of our protocol, we analyzed six members of the LysR-type TF family in Gammaproteobacteria.

CONCLUSIONS

The identification of TFBSs based exclusively on the significance of the over-representation of motifs in a set of sequences might lead to inaccurate results. The consideration of different molecular and structural properties of the regulatory systems benefits the identification of TFBSs and enables the development of elaborate, biologically meaningful and precise regulatory models that offer a more integrated view of the dynamics of the regulatory process of transcription.

摘要

背景

大多数用于寻找转录因子结合位点(TFBSs)的程序的目标是识别离散的序列基序,这些基序在一组预期转录因子(TF)会结合的序列中显著富集。这些程序假定特定基序的核苷酸保守性表明了识别TF与其相应TFBS所需的选择压力。尽管它们被广泛使用,但这些程序所达到的准确率仍然很低。在许多情况下,真正的TFBSs在识别过程中被排除,特别是当它们对应于调节系统的低亲和力但重要的结合位点时。

结果

我们开发了一种基于分子和结构标准的计算协议,以进行具有生物学意义和准确的系统发育足迹分析。我们的协议考虑了TF-DNA结合过程的基本方面,例如:i)TF的活性同二聚体构象,其在TFBSs上施加对称结构;ii)TF的协同结合;iii)共诱导剂存在或不存在的影响;iv)两个TFBSs之间或一个TFBS与启动子之间的接近度,这会导致非常长的假基序;v)TF不识别但DNA灵活性所需的富含AT序列的存在;vi)不同结合事件发生的动态顺序以确定调节反应(即激活或抑制)。在我们的协议中,上述标准用于分析使用一组增量大小的分析窗口从经典系统发育足迹分析生成的共有基序概况。为了评估我们协议的性能,我们分析了γ-变形菌中LysR型TF家族的六个成员。

结论

仅基于一组序列中基序富集的显著性来识别TFBSs可能会导致不准确的结果。考虑调节系统的不同分子和结构特性有利于TFBSs的识别,并能够开发出精细的、具有生物学意义和精确的调节模型,从而提供对转录调节过程动态的更综合的观点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6517/5002191/8a4c04385075/12864_2016_3025_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验