Alkema Wynand B L, Lenhard Boris, Wasserman Wyeth W
Center for Genomics and Bioinformatics, Karolinska Institutet, Stockholm, Sweden.
Genome Res. 2004 Jul;14(7):1362-73. doi: 10.1101/gr.2242604.
A transcriptional regulatory network encompasses sets of genes (regulons) whose expression states are directly altered in response to an activating signal, mediated by trans-acting regulatory proteins and cis-acting regulatory sequences. Enumeration of these network components is an essential step toward the creation of a framework for systems-based analysis of biological processes. Profile-based methods for the detection of cis-regulatory elements are often applied to predict regulon members, but they suffer from poor specificity. In this report we describe Regulogger, a novel computational method that uses comparative genomics to eliminate spurious members of predicted gene regulons. Regulogger produces regulogs, sets of coregulated genes for which the regulatory sequence has been conserved across multiple organisms. The quantitative method assigns a confidence score to each predicted regulog member on the basis of the degree of conservation of protein sequence and regulatory mechanisms. When applied to a reference collection of regulons from Escherichia coli, Regulogger increased the specificity of predictions up to 25-fold over methods that use cis-element detection in isolation. The enhanced specificity was observed across a wide range of biologically meaningful parameter combinations, indicating a robust and broad utility for the method. The power of computational pattern discovery methods coupled with Regulogger to unravel transcriptional networks was demonstrated in an analysis of the genome of Staphylococcus aureus. A total of 125 regulogs were found in this organism, including both well-defined functional groups and a subset with unknown functions.
转录调控网络包含这样一些基因集(调控子),其表达状态会响应激活信号而直接改变,这种改变由反式作用调控蛋白和顺式作用调控序列介导。列举这些网络组件是创建基于系统的生物过程分析框架的关键一步。基于图谱的顺式调控元件检测方法常被用于预测调控子成员,但它们的特异性较差。在本报告中,我们描述了Regulogger,这是一种利用比较基因组学来消除预测基因调控子中虚假成员的新型计算方法。Regulogger产生调控记录,即一组共调控基因,其调控序列在多种生物体中保守。这种定量方法基于蛋白质序列的保守程度和调控机制为每个预测的调控子成员赋予一个置信度分数。当应用于来自大肠杆菌的调控子参考集合时,与单独使用顺式元件检测的方法相比,Regulogger将预测的特异性提高了25倍。在广泛的生物学意义参数组合中都观察到了特异性的增强,这表明该方法具有强大且广泛的实用性。在对金黄色葡萄球菌基因组的分析中,证明了计算模式发现方法与Regulogger相结合在解析转录网络方面的能力。在该生物体中总共发现了125个调控记录,包括明确的功能组和一个功能未知的子集。