Margulies E H, Kardia S L, Innis J W
Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA.
Genome Res. 2001 Oct;11(10):1686-98. doi: 10.1101/gr.192601.
The analysis of differentially expressed genes is a powerful approach to elucidate the genetic mechanisms underlying the morphological and evolutionary diversity among serially homologous structures, both within the same organism (e.g., hand vs. foot) and between different species (e.g., hand vs. wing). In the developing embryo, limb-specific expression of Pitx1, Tbx4, and Tbx5 regulates the determination of limb identity. However, numerous lines of evidence, including the fact that these three genes encode transcription factors, indicate that additional genes are involved in the Pitx1-Tbx hierarchy. To examine the molecular distinctions coded for by these factors, and to identify novel genes involved in the determination of limb identity, we have used Serial Analysis of Gene Expression (SAGE) to generate comprehensive gene expression profiles from intact, developing mouse forelimbs and hindlimbs. To minimize the extraction of erroneous SAGE tags from low-quality sequence data, we used a new algorithm to extract tags from -analyzed sequence data and obtained 68,406 and 68,450 SAGE tags from forelimb and hindlimb SAGE libraries, respectively. We also developed an improved method for determining the identity of SAGE tags that increases the specificity of and provides additional information about the confidence of the tag-UniGene cluster match. The most differentially expressed gene between our SAGE libraries was Pitx1. The differential expression of Tbx4, Tbx5, and several limb-specific Hox genes was also detected; however, their abundances in the SAGE libraries were low. Because numerous other tags were differentially expressed at this low level, we performed a 'virtual' subtraction with 362,344 tags from six additional nonlimb SAGE libraries to further refine this set of candidate genes. This subtraction reduced the number of candidate genes by 74%, yet preserved the previously identified regulators of limb identity. This study presents the gene expression complexity of the developing limb and identifies candidate genes involved in the regulation of limb identity. We propose that our computational tools and the overall strategy used here are broadly applicable to other SAGE-based studies in a variety of organisms. [SAGE data are all available at GEO (http://www.ncbi.nlm.nih.gov/geo/) under accession nos. GSM55 and GSM56, which correspond to the forelimb and hindlimb raw SAGE data.]
差异表达基因的分析是一种强有力的方法,可用于阐明在同一生物体内部(例如手与脚)以及不同物种之间(例如手与翅膀),系列同源结构的形态和进化多样性背后的遗传机制。在发育中的胚胎中,Pitx1、Tbx4和Tbx5的肢体特异性表达调节肢体身份的确定。然而,包括这三个基因编码转录因子这一事实在内的大量证据表明,其他基因也参与了Pitx1 - Tbx层级关系。为了研究这些因子所编码的分子差异,并鉴定参与肢体身份确定的新基因,我们使用基因表达序列分析(SAGE)技术,从完整的、发育中的小鼠前肢和后肢生成全面的基因表达谱。为了尽量减少从低质量序列数据中提取错误的SAGE标签,我们使用一种新算法从分析过的序列数据中提取标签,分别从前肢和后肢SAGE文库中获得了68,406个和68,450个SAGE标签。我们还开发了一种改进方法来确定SAGE标签的身份,该方法提高了标签与基因簇匹配的特异性,并提供了有关匹配可信度的额外信息。我们的SAGE文库之间差异表达最明显的基因是Pitx1。还检测到了Tbx4、Tbx5和几个肢体特异性Hox基因的差异表达;然而,它们在SAGE文库中的丰度较低。由于许多其他标签在这个低水平上也有差异表达,我们用来自另外六个非肢体SAGE文库的362,344个标签进行了一次“虚拟”减法,以进一步优化这组候选基因。这次减法使候选基因的数量减少了74%,但保留了先前鉴定出的肢体身份调节因子。这项研究展示了发育中肢体的基因表达复杂性,并鉴定出参与肢体身份调节的候选基因。我们提出,我们的计算工具和这里使用的总体策略广泛适用于各种生物体中其他基于SAGE的研究。[SAGE数据均可在GEO(http://www.ncbi.nlm.nih.gov/geo/)上获取,登录号分别为GSM55和GSM56,分别对应前肢和后肢的原始SAGE数据。]