Mao Linyong, Mackenzie Chris, Roh Jung H, Eraso Jesus M, Kaplan Samuel, Resat Haluk
Pacific Northwest National Laboratory, Computational Biology and Bioinformatics Group, PO Box 999, MS: K7-90, Richland, WA 99352, USA.
Department of Microbiology and Molecular Genetics, The University of Texas Health Science Center, Medical School, Houston, TX 77030, USA.
Microbiology (Reading). 2005 Oct;151(Pt 10):3197-3213. doi: 10.1099/mic.0.28167-0.
The ability to detect regulatory elements within genome sequences is important in understanding how gene expression is controlled in biological systems. In this work, microarray data analysis is combined with genome sequence analysis to predict DNA sequences in the photosynthetic bacterium Rhodobacter sphaeroides that bind the regulators PrrA, PpsR and FnrL. These predictions were made by using hierarchical clustering to detect genes that share similar expression patterns. The DNA sequences upstream of these genes were then searched for possible transcription factor recognition motifs that may be involved in their co-regulation. The approach used promises to be widely applicable for the prediction of cis-acting DNA binding elements. Using this method the authors were independently able to detect and extend the previously described consensus sequences that have been suggested to bind FnrL and PpsR. In addition, sequences that may be recognized by the global regulator PrrA were predicted. The results support the earlier suggestions that the DNA binding sequence of PrrA may have a variable-sized gap between its conserved block elements. Using the predicted DNA binding sequences, a whole-genome-scale analysis was performed to determine the relative importance of the interplay between the three regulators PpsR, FnrL and PrrA. Results of this analysis showed that, compared to the regulation by PpsR and FnrL, a much larger number of genes are candidates to be regulated by PrrA. The study demonstrates by example that integration of multiple data types can be a powerful approach for inferring transcriptional regulatory patterns in microbial systems, and it allowed the detection of photosynthesis-related regulatory patterns in R. sphaeroides.
检测基因组序列中的调控元件对于理解生物系统中基因表达的控制方式至关重要。在这项研究中,微阵列数据分析与基因组序列分析相结合,以预测光合细菌球形红杆菌中与调控因子PrrA、PpsR和FnrL结合的DNA序列。这些预测是通过使用层次聚类来检测具有相似表达模式的基因来进行的。然后在这些基因的上游DNA序列中搜索可能参与其共同调控的转录因子识别基序。所采用的方法有望广泛应用于顺式作用DNA结合元件的预测。使用这种方法,作者能够独立地检测并扩展先前描述的、被认为与FnrL和PpsR结合的共有序列。此外,还预测了可能被全局调控因子PrrA识别的序列。结果支持了早期的推测,即PrrA的DNA结合序列在其保守块元件之间可能有大小可变的间隙。利用预测的DNA结合序列,进行了全基因组规模的分析,以确定PpsR、FnrL和PrrA这三种调控因子之间相互作用的相对重要性。该分析结果表明,与PpsR和FnrL的调控相比,有更多基因可能受PrrA调控。该研究通过实例证明,整合多种数据类型可能是推断微生物系统中转录调控模式的有力方法,并且它能够检测球形红杆菌中与光合作用相关的调控模式。