Ershov Pavel V, Yablokov Evgeniy O, Mezentsev Yuri V, Ivanov Alexis S
Institute of Biomedical Chemistry, Moscow 119121, Russia.
Biomedicines. 2024 Feb 28;12(3):544. doi: 10.3390/biomedicines12030544.
Malignant neoplasms are characterized by high molecular heterogeneity due to multilevel deregulation of gene expression and cellular functions. It is known that non-coding RNAs, including long intergenic non-coding RNAs (lincRNAs), can play significant roles in cancer biology. The current review focuses on a systematical analysis of genomic, transcriptomic, epigenomic, interactomic, and literature data on 65 lincRNAs of human chromosome 18 in the context of pan-cancer studies. The entire group of lincRNAs can be conditionally divided into 4 subgroups depending on experimental evidence on direct or indirect involvement in cancers and the biological associations with cancers, which we found during the data-mining process: the most studied (5 lincRNAs), moderately or poorly studied (11 lincRNAs), and understudied (31 lincRNAs). For the remaining 18 lincRNAs, data for analysis were fragmentary or missing. Among the key findings were the following: Of the lincRNAs of human chromosome 18, 40% have tissue-specific expression patterns, 22% of lincRNAs are known to have gene fusions, 40% of lincRNAs are prone to gene amplifications and/or deletions in cancers at a frequency greater than 3%, and 23% of lincRNAs are differentially expressed across cancer types, whereas 7% have subtype-specific expression patterns. LincRNAs' interactomes consist of 'master' microRNAs and 47 proteins (including cancer-associated proteins and microRNAs) that can interact with 3 or more lincRNAs. Functional enrichment analysis of a set of highly co-expressed genes retrieved for 17 lincRNAs in different cancer types indicated the potential associations of these lincRNAs with cellular signaling pathways. Six lincRNAs encoded small open-reading frame (smORF) proteins with emerging roles in cancers, and microRNAs as well as proteins with known functions in molecular carcinogenesis can bind to coding regions of smORFs. We identified seven transcriptomic signatures with potential prognostic value, consisting of two to seven different lincRNAs only. Taken together, the literature, biomedical, and molecular biology data analyzed indicated that only five of all lincRNAs of human chromosome 18 are cancer-associated, while eleven other lincRNAs have the tendency to be associated with cancers.
恶性肿瘤的特征是由于基因表达和细胞功能的多级失调而具有高分子异质性。已知包括长链基因间非编码RNA(lincRNA)在内的非编码RNA在癌症生物学中可发挥重要作用。本综述重点对泛癌研究背景下人类18号染色体上65种lincRNA的基因组、转录组、表观基因组、相互作用组和文献数据进行系统分析。根据在数据挖掘过程中发现的关于lincRNA直接或间接参与癌症以及与癌症的生物学关联的实验证据,lincRNA的整个群体可分为4个亚组:研究最多的(5种lincRNA)、研究中等或较少的(11种lincRNA)以及研究不足的(31种lincRNA)。对于其余18种lincRNA,分析数据不完整或缺失。主要发现如下:在人类18号染色体的lincRNA中,40%具有组织特异性表达模式,22%的lincRNA已知存在基因融合,40%的lincRNA在癌症中容易发生基因扩增和/或缺失,频率大于3%,23%的lincRNA在不同癌症类型中差异表达,而7%具有亚型特异性表达模式。lincRNA的相互作用组由“主要”微小RNA和47种蛋白质(包括癌症相关蛋白质和微小RNA)组成,这些蛋白质可与3种或更多lincRNA相互作用。对在不同癌症类型中为17种lincRNA检索到的一组高度共表达基因进行功能富集分析,表明这些lincRNA与细胞信号通路存在潜在关联。6种lincRNA编码在癌症中发挥新作用的小开放阅读框(smORF)蛋白,微小RNA以及在分子致癌过程中具有已知功能的蛋白质可与smORF的编码区域结合。我们鉴定出7种具有潜在预后价值的转录组特征,仅由2至7种不同的lincRNA组成。综合分析的文献、生物医学和分子生物学数据表明,人类18号染色体所有lincRNA中只有5种与癌症相关,而其他11种lincRNA有与癌症相关的倾向。