Li Chuan-Yun, Liu Qing-Rong, Zhang Ping-Wu, Li Xiao-Mo, Wei Liping, Uhl George R
Molecular Neurobiology Branch, NIH-IRP (NIDA), Baltimore, MD 21224, USA.
Nucleic Acids Res. 2009 Jan;37(Database issue):D251-60. doi: 10.1093/nar/gkn568. Epub 2008 Sep 12.
'Cell adhesion molecules' (CAMs) are essential elements of cell/cell communication that are important for proper development and plasticity of a variety of organs and tissues. In the brain, appropriate assembly and tuning of neuronal connections is likely to require appropriate function of many cell adhesion processes. Genetic studies have linked and/or associated CAM variants with psychiatric, neurologic, neoplastic, immunologic and developmental phenotypes. However, despite increasing recognition of their functional and pathological significance, no systematic study has enumerated CAMs or documented their global features. We now report compilation of 496 human CAM genes in six gene families based on manual curation of protein domain structures, Gene Ontology annotations, and 1487 NCBI Entrez annotations. We map these genes onto a cell adhesion molecule ontology that contains 850 terms, up to seven levels of depth and provides a hierarchical description of these molecules and their functions. We develop OKCAM, a CAM knowledgebase that provides ready access to these data and ontologic system at http://okcam.cbi.pku.edu.cn. We identify global CAM properties that include: (i) functional enrichment, (ii) over-represented regulation modes and expression patterns and (iii) relationships to human Mendelian and complex diseases, and discuss the strengths and limitations of these data.
“细胞黏附分子”(CAMs)是细胞/细胞通讯的关键要素,对多种器官和组织的正常发育及可塑性至关重要。在大脑中,神经元连接的恰当组装和调节可能需要众多细胞黏附过程的正常功能。遗传学研究已将CAM变体与精神、神经、肿瘤、免疫及发育表型联系起来和/或关联起来。然而,尽管对其功能和病理意义的认识不断增加,但尚无系统研究对CAMs进行过枚举或记录其整体特征。我们现在报告基于对蛋白质结构域结构、基因本体注释以及1487条NCBI Entrez注释的人工整理,在六个基因家族中汇编了496个人类CAM基因。我们将这些基因映射到一个细胞黏附分子本体上,该本体包含850个术语,深度可达七个层次,并对这些分子及其功能进行了分层描述。我们开发了OKCAM,一个CAM知识库,可通过http://okcam.cbi.pku.edu.cn随时访问这些数据和本体系统。我们确定了CAM的整体特性,包括:(i)功能富集,(ii)过度代表的调节模式和表达模式,以及(iii)与人类孟德尔疾病和复杂疾病的关系,并讨论了这些数据的优势和局限性。