通过挖掘小鼠胚胎视网膜表达数据库发现的新型视网膜基因。
Novel retinal genes discovered by mining the mouse embryonic RetinalExpress database.
作者信息
Liang Shuguang, Zhao Sheng, Mu Xiuqian, Thomas Terry, Klein William H
机构信息
Department of Biochemistry and Molecular Biology, The University of Texas M. D. Anderson Cancer Center, Houston, TX 77030, USA.
出版信息
Mol Vis. 2004 Oct 8;10:773-86.
PURPOSE
Bioinformatics has emerged as a powerful tool for identifying novel genes and pathways associated with retinal biology and disease. The developing mouse retina expresses an exceedingly large and complex variety of genes. Many of these genes have not been characterized but nevertheless are likely to have important developmental or physiological functions. The purpose of this study was to use an in silico approach with a mouse embryonic retinal database of cDNAs/expressed sequence tags (ESTs) named RetinalExpress to identify previously uncharacterized genes that are represented in the developing retina.
METHODS
cDNA clones unique to the RetinalExpress database were identified by comparing clones in the RetinalExpress database with those in other cDNA/EST databases. We used a hierarchical filtering procedure with high stringency criteria that included sequence quality, colinearity with hypothetical gene sequences, and absence of any substantial existing annotation to select clones that were likely to represent novel genes. Selected clones were located on mouse chromosomes using National Center for Biotechnology Informatics Map Viewer software and the database from the University of California at Santa Cruz Genome Bioinformatics Web browser. The expression of selected retinal transcripts was determined using reverse transcriptase (RT)-PCR. In situ hybridization of sectioned embryonic and postnatal retinas was performed to determine spatial expression patterns of selected transcripts.
RESULTS
Of the 27,765 cDNA clones from RetinalExpress that we filtered through several public cDNA/EST databases, 26 cDNA/EST sequences were identified that, at the time of the analysis, were unique to RetinalExpress. Seventeen clones were selected for RT-PCR analysis, and retinal transcripts corresponding to previously uncharacterized genes were unambiguously detected for six clones. Three genes encoded open reading frames containing putative functional domains; one sequence contained an HMG DNA binding domain, another, an RFX DNA binding domain, and another, a phospholipase C catalytic domain X. Transcripts from the genes encoding DNA binding domains were expressed in embryonic and postnatal retinas with distinct spatial patterns.
CONCLUSIONS
The characterization of 26 mouse genes whose partial nucleotide sequences were uniquely represented in the RetinalExpress cDNA/EST database demonstrated the feasibility of retinal gene discovery using in silico analysis. Two of these genes had distinctive spatial expression patterns in the retina and one was likely to function as a DNA binding protein in embryonic and postnatal retinas. The gene identification approach described here demonstrates the usefulness of establishing large cDNA/EST databases from highly specialized neuronal tissues such as the retina to find novel genes.
目的
生物信息学已成为一种强大的工具,用于识别与视网膜生物学和疾病相关的新基因和通路。发育中的小鼠视网膜表达种类极其繁多且复杂的基因。其中许多基因尚未得到表征,但很可能具有重要的发育或生理功能。本研究的目的是利用一种计算机方法,借助一个名为RetinalExpress的小鼠胚胎视网膜cDNA/表达序列标签(EST)数据库,来识别发育中的视网膜中所呈现的先前未表征的基因。
方法
通过将RetinalExpress数据库中的克隆与其他cDNA/EST数据库中的克隆进行比较,鉴定出RetinalExpress数据库特有的cDNA克隆。我们采用了具有高严格标准的分级筛选程序,这些标准包括序列质量、与假设基因序列的共线性以及不存在任何实质性的现有注释,以选择可能代表新基因的克隆。使用美国国立生物技术信息中心地图查看器软件和加利福尼亚大学圣克鲁兹分校基因组生物信息学网络浏览器的数据库,将选定的克隆定位到小鼠染色体上。使用逆转录酶(RT)-PCR测定选定的视网膜转录本的表达。对胚胎和出生后视网膜切片进行原位杂交,以确定选定转录本的空间表达模式。
结果
在我们通过几个公共cDNA/EST数据库筛选的来自RetinalExpress的27765个cDNA克隆中,鉴定出26个cDNA/EST序列,在分析时这些序列是RetinalExpress特有的。选择了17个克隆进行RT-PCR分析,其中6个克隆明确检测到了对应于先前未表征基因的视网膜转录本。三个基因编码含有推定功能域的开放阅读框;一个序列含有HMG DNA结合域,另一个含有RFX DNA结合域,还有一个含有磷脂酶C催化域X。编码DNA结合域的基因转录本在胚胎和出生后视网膜中以不同的空间模式表达。
结论
对26个小鼠基因的表征表明,利用计算机分析发现视网膜基因是可行的,这些基因的部分核苷酸序列在RetinalExpress cDNA/EST数据库中是独特呈现的。其中两个基因在视网膜中具有独特的空间表达模式,一个基因在胚胎和出生后视网膜中可能作为DNA结合蛋白发挥作用。本文所述的基因鉴定方法证明了从高度特化的神经元组织如视网膜建立大型cDNA/EST数据库以发现新基因的有用性。