Department of Human Genetics, Molecular Biology Institute, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, USA.
PLoS One. 2009 Dec 31;4(12):e8491. doi: 10.1371/journal.pone.0008491.
In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET).
Celsius, the largest co-normalized microarray dataset of Affymetrix based gene expression, was used to calculate the correlation between all possible gene pairs on all platforms, and generate stored indexes in a web searchable format. The size of Celsius makes UGET a powerful gene characterization tool. Using a small seed list of known cartilage-selective genes, UGET extended the list of known genes by identifying 32 new highly cartilage-selective genes. Of these, 7 of 10 tested were validated by qPCR including the novel cartilage-specific genes SDK2 and FLJ41170. In addition, we retrospectively tested UGET and other gene expression based prioritization tools to identify disease-causing genes within known linkage intervals. We first demonstrated this utility with UGET using genetically heterogeneous disorders such as Joubert syndrome, microcephaly, neuropsychiatric disorders and type 2 limb girdle muscular dystrophy (LGMD2) and then compared UGET to other gene expression based prioritization programs which use small but discrete and well annotated datasets. Finally, we observed a significantly higher gene correlation shared between genes in disease networks associated with similar complex or Mendelian disorders.
UGET is an invaluable resource for a geneticist that permits the rapid inclusion of expression criteria from one to hundreds of genes in genomic intervals linked to disease. By using thousands of arrays UGET annotates and prioritizes genes better than other tools especially with rare tissue disorders or complex multi-tissue biological processes. This information can be critical in prioritization of candidate genes for sequence analysis.
在后基因组时代,生物学的主要目标之一是确定单个基因的特定作用。我们报告了一种新的基因特征描述基因组工具,即加州大学洛杉矶分校基因表达工具(UGET)。
使用 Celsius (基于 Affymetrix 的最大共归一化微阵列基因表达数据集)计算所有平台上所有可能基因对之间的相关性,并以可在网络上搜索的格式生成存储索引。Celsius 的大小使 UGET 成为一种强大的基因特征描述工具。使用已知软骨选择性基因的小种子列表,UGET 通过鉴定 32 个新的高度软骨选择性基因扩展了已知基因列表。其中,10 个测试中有 7 个通过 qPCR 得到验证,包括新的软骨特异性基因 SDK2 和 FLJ41170。此外,我们还回顾性地测试了 UGET 和其他基于基因表达的优先级排序工具,以在已知连锁区间内识别致病基因。我们首先使用 UGET 证明了这种用途,例如使用遗传异质性疾病,如 Joubert 综合征、小头症、神经精神疾病和 2 型肢体带肌营养不良症(LGMD2),然后将 UGET 与其他使用小而离散且注释良好的数据集的基于基因表达的优先级排序程序进行了比较。最后,我们观察到与类似复杂或孟德尔疾病相关的疾病网络中的基因之间具有更高的基因相关性。
UGET 是遗传学家的宝贵资源,它允许快速将一个到数百个与疾病相关的基因组区间中的表达标准纳入基因表达。通过使用数千个数组,UGET 比其他工具更好地注释和优先级排序基因,特别是对于罕见的组织疾病或复杂的多组织生物学过程。在候选基因序列分析的优先级排序中,这些信息可能至关重要。