Dolan Jackie, Walshe Karen, Alsbury Samantha, Hokamp Karsten, O'Keeffe Sean, Okafuji Tatsuya, Miller Suzanne F C, Tear Guy, Mitchell Kevin J
Smurfit Institute of Genetics, Trinity College Dublin, Dublin 2, Ireland.
BMC Genomics. 2007 Sep 14;8:320. doi: 10.1186/1471-2164-8-320.
Leucine-rich repeats (LRRs) are highly versatile and evolvable protein-ligand interaction motifs found in a large number of proteins with diverse functions, including innate immunity and nervous system development. Here we catalogue all of the extracellular LRR (eLRR) proteins in worms, flies, mice and humans. We use convergent evidence from several transmembrane-prediction and motif-detection programs, including a customised algorithm, LRRscan, to identify eLRR proteins, and a hierarchical clustering method based on TribeMCL to establish their evolutionary relationships.
This yields a total of 369 proteins (29 in worm, 66 in fly, 135 in mouse and 139 in human), many of them of unknown function. We group eLRR proteins into several classes: those with only LRRs, those that cluster with Toll-like receptors (Tlrs), those with immunoglobulin or fibronectin-type 3 (FN3) domains and those with some other domain. These groups show differential patterns of expansion and diversification across species. Our analyses reveal several clusters of novel genes, including two Elfn genes, encoding transmembrane proteins with eLRRs and an FN3 domain, and six genes encoding transmembrane proteins with eLRRs only (the Elron cluster). Many of these are expressed in discrete patterns in the developing mouse brain, notably in the thalamus and cortex. We have also identified a number of novel fly eLRR proteins with discrete expression in the embryonic nervous system.
This study provides the necessary foundation for a systematic analysis of the functions of this class of genes, which are likely to include prominently innate immunity, inflammation and neural development, especially the specification of neuronal connectivity.
富含亮氨酸重复序列(LRRs)是高度通用且可进化的蛋白质 - 配体相互作用基序,存在于大量具有多种功能的蛋白质中,包括先天免疫和神经系统发育。在这里,我们对蠕虫、果蝇、小鼠和人类中的所有细胞外LRR(eLRR)蛋白进行了编目。我们使用来自多个跨膜预测和基序检测程序的汇聚证据,包括定制算法LRRscan来识别eLRR蛋白,并使用基于TribeMCL的层次聚类方法来建立它们的进化关系。
这总共产生了369种蛋白质(蠕虫中有29种,果蝇中有66种,小鼠中有135种,人类中有139种),其中许多蛋白质的功能未知。我们将eLRR蛋白分为几类:仅具有LRRs的蛋白、与Toll样受体(Tlrs)聚类的蛋白、具有免疫球蛋白或纤连蛋白3型(FN3)结构域的蛋白以及具有其他一些结构域的蛋白。这些组在不同物种中显示出不同的扩增和多样化模式。我们的分析揭示了几个新基因簇,包括两个Elfn基因,编码具有eLRRs和FN3结构域的跨膜蛋白,以及六个仅编码具有eLRRs的跨膜蛋白的基因(Elron簇)。其中许多在发育中的小鼠大脑中以离散模式表达,特别是在丘脑和皮质中。我们还鉴定了一些在胚胎神经系统中具有离散表达的新型果蝇eLRR蛋白。
本研究为系统分析这类基因的功能提供了必要的基础,这类基因可能主要包括先天免疫、炎症和神经发育,特别是神经元连接的特异性。