The Morris Kahn Laboratory of Human Genetics, National Institute for Biotechnology in the Negev, Ben Gurion University, Beer-Sheva, Israel.
Hum Mutat. 2010 Mar;31(3):229-36. doi: 10.1002/humu.21171.
The identification of genomic loci associated with human genetic syndromes has been significantly facilitated through the generation of high density SNP arrays. However, optimal selection of candidate genes from within such loci is still a tedious labor-intensive bottleneck. Syndrome to Gene (S2G) is based on novel algorithms which allow an efficient search for candidate genes in a genomic locus, using known genes whose defects cause phenotypically similar syndromes. S2G (http://fohs.bgu.ac.il/s2g/index.html) includes two components: a phenotype Online Mendelian Inheritance in Man (OMIM)-based search engine that alleviates many of the problems in the existing OMIM search engine (negation phrases, overlapping terms, etc.). The second component is a gene prioritizing engine that uses a novel algorithm to integrate information from 18 databases. When the detailed phenotype of a syndrome is inserted to the web-based software, S2G offers a complete improved search of the OMIM database for similar syndromes. The software then prioritizes a list of genes from within a genomic locus, based on their association with genes whose defects are known to underlie similar clinical syndromes. We demonstrate that in all 30 cases of novel disease genes identified in the past year, the disease gene was within the top 20% of candidate genes predicted by S2G, and in most cases--within the top 10%. Thus, S2G provides clinicians with an efficient tool for diagnosis and researchers with a candidate gene prediction tool based on phenotypic data and a wide range of gene data resources. S2G can also serve in studies of polygenic diseases, and in finding interacting molecules for any gene of choice.
通过生成高密度 SNP 芯片,与人类遗传综合征相关的基因组基因座的鉴定已经得到了显著的促进。然而,从这些基因座中选择候选基因仍然是一个繁琐的、劳动密集型的瓶颈。“综合征到基因(S2G)”是基于新的算法,允许在基因组基因座中高效搜索候选基因,使用已知的缺陷导致表型相似综合征的基因。S2G(http://fohs.bgu.ac.il/s2g/index.html)包括两个组件:基于在线人类孟德尔遗传(OMIM)的表型搜索引擎,它缓解了现有 OMIM 搜索引擎中的许多问题(否定短语、重叠术语等)。第二个组件是一个基因优先级引擎,它使用一种新的算法来整合来自 18 个数据库的信息。当插入综合征的详细表型时,S2G 为类似综合征的 OMIM 数据库提供了一个完整的改进搜索。然后,该软件根据与已知缺陷导致类似临床综合征的基因的关联,从基因组基因座内对基因进行优先级排序。我们证明,在过去一年中确定的 30 个新疾病基因的所有病例中,疾病基因都在 S2G 预测的候选基因的前 20%之列,在大多数情况下,都在前 10%之列。因此,S2G 为临床医生提供了一种高效的诊断工具,为研究人员提供了一种基于表型数据和广泛的基因数据资源的候选基因预测工具。S2G 还可以用于多基因疾病的研究,以及为任何选择的基因寻找相互作用的分子。