Li Yang, Liu Yahui, Xie Yingxin, Wang Yaxuan, Wang Jing, Wang Huan, Xia Lin, Xie Dan
Laboratory of Omics Technology and Bioinformatics, Frontiers Science Center for Disease-Related Molecular Network, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, Chengdu, 610041, Sichuan, China.
West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, 610041, Sichuan, China.
BMC Cancer. 2025 Mar 15;25(1):482. doi: 10.1186/s12885-025-13888-5.
Transposable elements (TEs), which constitute nearly half of the human genome, have long been regarded as genomic "dark matter". However, their reactivation in tumor cells, resulting in the production of TE-chimeric transcripts (TCTs), has emerged as a potential driver of cancer progression. The complexity and full extent of these transcripts remain elusive, largely due to the limitations of short-read next-generation sequencing technologies. These methods have struggled to comprehensively capture the diversity and structure of TCTs, particularly those involving short interspersed nuclear elements (SINEs) or closely co-transcribed TEs.
Leveraging full-length cDNA sequencing technology based on nanopore sequencing platform, we developed a customized pipeline for identifying and quantifying TCTs in 19 lung adenocarcinoma (LUAD) cell lines. The short-read RNA-seq dataset from a LUAD corhort (~ 200 tumor samples) was employed to validate the identified TCTs and explore their association with tumor progression. To assess the functional roles of a specific TCTs, cell migration and cell proliferation assays were performed.
We uncovered 208 unique TCT candidates in the LUAD cell lines. Our approach allowed for the identification of cryptic promoters and terminators within non-transposing TEs. Notably, we identified a chimeric transcript involving MIR_HKDC1, which appears to play a significant role in the progression of LUAD. Furthermore, the expression of these TCTs were associated with poor clinical outcomes in a cohort of LUAD patients, suggesting their potential as novel biomarkers for both LUAD progression and prognosis.
Our study underscores the application of long-read sequencing to unravel the complex landscape of TCTs in LUAD. We provide a comprehensive characterization of TCTs in LUAD, exploring their potential regulatory roles in cancer progression. These findings contribute to a deeper understanding of the genomic intricacies underlying cancer, and offer new directions for the development of targeted therapies and personalized treatment strategies for LUAD. This research highlights the potential of TCTs as both biomarkers and therapeutic targets in the oncogenesis, offering new insights into the interplay between transposable elements and gene regulation in cancer.
转座元件(TEs)构成了近一半的人类基因组,长期以来一直被视为基因组的“暗物质”。然而,它们在肿瘤细胞中的重新激活,导致产生TE嵌合转录本(TCTs),已成为癌症进展的潜在驱动因素。这些转录本的复杂性和全貌仍然难以捉摸,这主要是由于短读长下一代测序技术的局限性。这些方法难以全面捕捉TCTs的多样性和结构,特别是那些涉及短散在核元件(SINEs)或紧密共转录的TEs。
利用基于纳米孔测序平台的全长cDNA测序技术,我们开发了一个定制流程,用于鉴定和定量19种肺腺癌(LUAD)细胞系中的TCTs。使用来自LUAD队列(约200个肿瘤样本)的短读长RNA-seq数据集来验证鉴定出的TCTs,并探索它们与肿瘤进展的关联。为了评估特定TCTs的功能作用,进行了细胞迁移和细胞增殖测定。
我们在LUAD细胞系中发现了208个独特的TCT候选物。我们的方法能够识别非转座TEs中的隐秘启动子和终止子。值得注意的是,我们鉴定出一种涉及MIR_HKDC1的嵌合转录本,它似乎在LUAD的进展中起重要作用。此外,这些TCTs的表达与LUAD患者队列中的不良临床结果相关,表明它们作为LUAD进展和预后的新型生物标志物的潜力。
我们的研究强调了长读长测序在揭示LUAD中TCTs复杂格局方面的应用。我们对LUAD中的TCTs进行了全面表征,探索了它们在癌症进展中的潜在调控作用。这些发现有助于更深入地理解癌症背后的基因组复杂性,并为LUAD的靶向治疗和个性化治疗策略的开发提供新方向。这项研究突出了TCTs作为肿瘤发生中的生物标志物和治疗靶点的潜力,为癌症中转座元件与基因调控之间的相互作用提供了新见解。