Pavesi Giulio, Mauri Giancarlo, Pesole Graziano
University of Milan, Milano, Italy.
Brief Bioinform. 2004 Sep;5(3):217-36. doi: 10.1093/bib/5.3.217.
Understanding the complex mechanisms governing basic biological processes requires the characterisation of regulatory motifs modulating gene expression at transcriptional and post-transcriptional level. In particular, extent, chronology and cell-specificity of transcription are modulated by the interaction of transcription factors with their corresponding binding sites, mostly located near (or sometimes quite far away from) the transcription start site of the gene. The constantly growing amount of genomic data, complemented by other sources of information such as expression data derived from microarray experiments, has opened new opportunities to researchers in this field. Many different methods have been proposed for the identification of transcription factor binding sites in the regulatory regions of co-expressed genes: unfortunately this is a very challenging problem both from the computational and the biological viewpoint. This paper provides a survey of existing methods proposed for the problem, focusing both on the ideas underlying them and their availability to the scientific community.
要理解调控基本生物学过程的复杂机制,需要对在转录和转录后水平调节基因表达的调控基序进行表征。特别是,转录的范围、时间顺序和细胞特异性是由转录因子与其相应结合位点的相互作用所调节的,这些结合位点大多位于基因转录起始位点附近(有时距离转录起始位点很远)。不断增长的基因组数据,辅以其他信息来源,如来自微阵列实验的表达数据,为该领域的研究人员带来了新的机遇。已经提出了许多不同的方法来识别共表达基因调控区域中的转录因子结合位点:不幸的是,从计算和生物学角度来看,这都是一个极具挑战性的问题。本文对针对该问题提出的现有方法进行了综述,重点关注其背后的思想以及科学界对这些方法的可获取性。