Kittler Josef T., Oliver Peter L.
Over the last 10 years, large-scale genomic sequencing has resulted in the completion of the mouse and human genomes and those of several invertebrate model organisms, including the nematode worm and fruit fly. Of the approximately 30,000 proteins that are likely to exist in a particular mammalian genome, as many as 30% of these are estimated to be represented in the brain. However, reductionist approaches (such as one gene or one protein at a time) have to date provided functional information for only 10 to 15% of predicted proteins. Consequently, a large gap exists between the number of known genes and the identification of the corresponding proteins’ function. Similarly, the synaptic function is not known for the vast majority of all proteins expressed in the brain, or even the 500 to 1000 components of the post-synaptic density (PSD) revealed by proteomic analyses [1–6]. Therefore, one long-term goal of neuroscience research is to understand the role each and every gene plays in regulating aspects of nervous system function and synaptic physiology. The availability of genomic data sets is greatly facilitating the implementation of additional “genome-wide” projects, providing new research tools and reagents to the wider scientific community. Although varied in approach, these programs have all benefited from the availability of genomic sequence information and have the common goal of determining functions for the entire set of genes of a particular organism. These resources therefore have direct applicability to gene function determination in the nervous system. Several of the many projects include: the identification and cataloguing of the entire collection of expressed proteins (transcriptome) for a particular organism, including human, mouse, rat, fly and worm, and the determination of their expression pattern and subcellular localization; the identification of the protein constituents (proteome) of particular tissues, cell types or subcellular compartments (for instance, the PSD); the systematic identification of the entire complement of protein-protein interactions for a particular organism or biological process within that organism; the systematic knockdown/disruption of every gene in a particular organism by RNA interference (RNAi) or gene knockout approaches; and the production of large sets of physiologically relevant mutant phenotypes in several model organisms, including the mouse. These ongoing projects are beginning to provide new data sets and resources that can be directly accessed by the neuroscience community and which, over the next few years, will provide an expanding source of information and reagents that can be directly implemented into studies of neuronal physiology. The availability and standardization of such datasets will benefit biologists in the long term, whereby less time producing similar reagents on a small scale will be required. In addition, these resources will also allow a “system’s biology” approach to understanding brain function and the regulation of the synapse [7]. Here, we review some of the newly available genomic and post-genomic resources, which could have a direct relevance for current and future studies of synapse biology.
在过去十年中,大规模基因组测序已完成了小鼠、人类以及包括线虫和果蝇在内的几种无脊椎动物模式生物的基因组测序。在特定哺乳动物基因组中可能存在的约30000种蛋白质中,估计多达30%在大脑中有表达。然而,迄今为止,还原论方法(如一次研究一个基因或一种蛋白质)只为10%至15%的预测蛋白质提供了功能信息。因此,已知基因的数量与相应蛋白质功能的鉴定之间存在很大差距。同样,对于大脑中表达的绝大多数蛋白质,甚至蛋白质组学分析揭示的突触后致密区(PSD)的500至1000个成分,其突触功能也尚不清楚[1 - 6]。因此,神经科学研究的一个长期目标是了解每个基因在调节神经系统功能和突触生理学方面所起的作用。基因组数据集的可用性极大地促进了更多“全基因组”项目的实施,为更广泛的科学界提供了新的研究工具和试剂。尽管这些项目方法各异,但都受益于基因组序列信息的可用性,并且有一个共同目标,即确定特定生物体的全套基因的功能。因此,这些资源可直接应用于神经系统中基因功能的确定。众多项目中的几个包括:鉴定和编目特定生物体(包括人类、小鼠、大鼠、果蝇和线虫)表达的所有蛋白质(转录组),并确定其表达模式和亚细胞定位;鉴定特定组织、细胞类型或亚细胞区室(如PSD)的蛋白质成分(蛋白质组);系统鉴定特定生物体或该生物体内特定生物学过程中蛋白质 - 蛋白质相互作用的全部组成;通过RNA干扰(RNAi)或基因敲除方法系统敲低/破坏特定生物体中的每个基因;以及在包括小鼠在内的几种模式生物中产生大量生理相关的突变表型。这些正在进行的项目开始提供新的数据集和资源,神经科学界可直接获取,并且在未来几年将提供不断扩大的信息和试剂来源,可直接应用于神经元生理学研究。这些数据集的可用性和标准化从长远来看将使生物学家受益,从而减少小规模生产类似试剂所需的时间。此外,这些资源还将允许采用“系统生物学”方法来理解脑功能和突触调节[7]。在这里,我们回顾一些新可用的基因组和后基因组资源,它们可能与当前和未来的突触生物学研究直接相关。