Hill-Terán Guillermina, Petrich Julieta, Falcone Ferreyra Maria Lorena, Aybar Manuel J, Coux Gabriela
Instituto Superior de Investigaciones Biológicas (INSIBIO, CONICET-UNT), CONICET-UNT, San Miguel de Tucumán, Tucumán, Argentina.
Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Suipacha 531, (S2002LRK), Rosario, Santa Fe., Argentina.
J Mol Evol. 2024 Dec;92(6):744-760. doi: 10.1007/s00239-024-10200-0. Epub 2024 Sep 13.
Treacher Collins syndrome (TCS) is a genetic disorder affecting facial development, primarily caused by mutations in the TCOF1 gene. TCOF1, along with NOLC1, play important roles in ribosomal RNA transcription and processing. Previously, a zebrafish model of TCS successfully recapitulated the main characteristics of the syndrome by knocking down the expression of a gene on chromosome 13 (coding for Uniprot ID B8JIY2), which was identified as the TCOF1 orthologue. However, database updates renamed this gene as nolc1 and the zebrafish database (ZFIN) identified a different gene on chromosome 14 as the TCOF1 orthologue (coding for Uniprot ID E7F9D9). NOLC1 and TCOF1 are large proteins with unstructured regions and repetitive sequences that complicate alignments and comparisons. Also, the additional whole genome duplication of teleosts sets further difficulty. In this study, we present evidence that endorses that NOLC1 and TCOF1 are paralogs, and that the zebrafish gene on chromosome 14 is a low-complexity LisH domain-containing factor that displays homology to NOLC1 but lacks essential sequence features to accomplish TCOF1 nucleolar functions. Our analysis also supports the idea that zebrafish, as has been suggested for other non-tetrapod vertebrates, lack the TCOF1 gene that is associated with tripartite nucleolus. Using BLAST searches in a group of teleost genomes, we identified fish-specific sequences similar to E7F9D9 zebrafish protein. We propose naming them "LisH-containing Low Complexity Proteins" (LLCP). Interestingly, the gene on chromosome 13 (nolc1) displays the sequence features, developmental expression patterns, and phenotypic impact of depletion that are characteristic of TCOF1 functions. These findings suggest that in teleost fish, the nucleolar functions described for both NOLC1 and TCOF1 mediated by their repeated motifs, are carried out by a single gene, nolc1. Our study, which is mainly based on computational tools available as free web-based algorithms, could help to solve similar conflicts regarding gene orthology in zebrafish.
特雷彻·柯林斯综合征(TCS)是一种影响面部发育的遗传性疾病,主要由TCOF1基因突变引起。TCOF1与NOLC1一起在核糖体RNA转录和加工中发挥重要作用。此前,一个TCS斑马鱼模型通过敲低13号染色体上一个基因(编码Uniprot ID B8JIY2)的表达,成功重现了该综合征的主要特征,该基因被鉴定为TCOF1的直系同源基因。然而,数据库更新将该基因重新命名为nolc1,并且斑马鱼数据库(ZFIN)将14号染色体上的另一个基因鉴定为TCOF1的直系同源基因(编码Uniprot ID E7F9D9)。NOLC1和TCOF1是具有无结构区域和重复序列的大蛋白,这使得比对和比较变得复杂。此外,硬骨鱼额外的全基因组复制带来了更多困难。在本研究中,我们提供的证据支持NOLC1和TCOF1是旁系同源基因,并且14号染色体上的斑马鱼基因是一种含有低复杂性LisH结构域的因子,它与NOLC1具有同源性,但缺乏完成TCOF1核仁功能所需的关键序列特征。我们的分析还支持这样一种观点,即正如对其他非四足动物脊椎动物所提出的那样,斑马鱼缺乏与三联体核仁相关的TCOF1基因。通过在一组硬骨鱼基因组中进行BLAST搜索,我们鉴定出了与斑马鱼E7F9D9蛋白相似的鱼类特异性序列。我们建议将它们命名为“含LisH的低复杂性蛋白”(LLCP)。有趣的是,13号染色体上的基因(nolc1)显示出TCOF1功能所特有的序列特征、发育表达模式以及缺失后的表型影响。这些发现表明,在硬骨鱼中,由NOLC1和TCOF1的重复基序介导的核仁功能由单个基因nolc1执行。我们的研究主要基于可作为免费网络算法使用的计算工具,有助于解决斑马鱼中类似的基因直系同源性冲突问题。