Glusman G, Bahar A, Sharon D, Pilpel Y, White J, Lancet D
Department of Molecular Genetics and the Crown Human Genome Center, The Weizmann Institute of Science, Rehovot, Israel.
Mamm Genome. 2000 Nov;11(11):1016-23. doi: 10.1007/s003350010196.
The vertebrate olfactory receptor (OR) subgenome harbors the largest known gene family, which has been expanded by the need to provide recognition capacity for millions of potential odorants. We implemented an automated procedure to identify all OR coding regions from published sequences. This led us to the identification of 831 OR coding regions (including pseudogenes) from 24 vertebrate species. The resulting dataset was subjected to neighbor-joining phylogenetic analysis and classified into 32 distinct families, 14 of which include only genes from tetrapodan species (Class II ORs). We also report here the first identification of OR sequences from a marsupial (koala) and a monotreme (platypus). Analysis of these OR sequences suggests that the ancestral mammal had a small OR repertoire, which expanded independently in all three mammalian subclasses. Classification of "fish-like" (Class I) ORs indicates that some of these ancient ORs were maintained and even expanded in mammals. A nomenclature system for the OR gene superfamily is proposed, based on a divergence evolutionary model. The nomenclature consists of the root symbol 'OR', followed by a family numeral, subfamily letter(s), and a numeral representing the individual gene within the subfamily. For example, OR3A1 is an OR gene of family 3, subfamily A, and OR7E12P is an OR pseudogene of family 7, subfamily E. The symbol is to be preceded by a species indicator. We have assigned the proposed nomenclature symbols for all 330 human OR genes in the database. A WWW tool for automated name assignment is provided.
脊椎动物嗅觉受体(OR)亚基因组包含已知最大的基因家族,该家族因需要为数百万种潜在气味剂提供识别能力而得以扩展。我们实施了一个自动化程序,从已发表的序列中识别所有OR编码区。这使我们从24种脊椎动物物种中鉴定出831个OR编码区(包括假基因)。对所得数据集进行邻接法系统发育分析,并将其分为32个不同的家族,其中14个家族仅包含四足动物物种的基因(II类OR)。我们在此还首次鉴定了有袋动物(考拉)和单孔目动物(鸭嘴兽)的OR序列。对这些OR序列的分析表明,原始哺乳动物的OR库较小,在所有三个哺乳动物亚纲中独立扩展。对“鱼类样”(I类)OR的分类表明,其中一些古老的OR在哺乳动物中得以保留甚至扩展。基于分歧进化模型,提出了一种OR基因超家族的命名系统。该命名由根符号“OR”、家族数字、亚家族字母和代表亚家族内单个基因的数字组成。例如,OR3A1是3家族A亚家族的一个OR基因,OR7E12P是7家族E亚家族的一个OR假基因。该符号前应有物种指示符。我们已为数据库中所有330个人类OR基因指定了建议的命名符号。提供了一个用于自动命名的万维网工具。