Hsu Min-Kung, Wu I-Ching, Cheng Ching-Chia, Su Jen-Liang, Hsieh Chang-Huain, Lin Yeong-Shin, Chen Feng-Chi
Department of Biological Science and Technology, National Chiao-Tung University, Hsinchu, Taiwan.
Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan.
Oncotarget. 2015 Oct 6;6(30):28755-73. doi: 10.18632/oncotarget.4810.
Lung adenocarcinoma is one of the most deadly human diseases. However, the molecular mechanisms underlying this disease, particularly RNA splicing, have remained underexplored. Here, we report a triple-level (gene-, transcript-, and exon-level) analysis of lung adenocarcinoma transcriptomes from 77 paired tumor and normal tissues, as well as an analysis pipeline to overcome genetic variability for accurate differentiation between tumor and normal tissues. We report three major results. First, more than 5,000 differentially expressed transcripts/exonic regions occur repeatedly in lung adenocarcinoma patients. These transcripts/exonic regions are enriched in nicotine metabolism and ribosomal functions in addition to the pathways enriched for differentially expressed genes (cell cycle, extracellular matrix receptor interaction, and axon guidance). Second, classification models based on rationally selected transcripts or exonic regions can reach accuracies of 0.93 to 1.00 in differentiating tumor from normal tissues. Of the 28 selected exonic regions, 26 regions correspond to alternative exons located in such regulators as tumor suppressor (GDF10), signal receptor (LYVE1), vascular-specific regulator (RASIP1), ubiquitination mediator (RNF5), and transcriptional repressor (TRIM27). Third, classification systems based on 13 to 14 differentially expressed genes yield accuracies near 100%. Genes selected by both detection methods include C16orf59, DAP3, ETV4, GABARAPL1, PPAR, RADIL, RSPO1, SERTM1, SRPK1, ST6GALNAC6, and TNXB. Our findings imply a multilayered lung adenocarcinoma regulome in which transcript-/exon-level regulation may be dissociated from gene-level regulation. Our described method may be used to identify potentially important genes/transcripts/exonic regions for the tumorigenesis of lung adenocarcinoma and to construct accurate tumor vs. normal classification systems for this disease.
肺腺癌是最致命的人类疾病之一。然而,该疾病背后的分子机制,尤其是RNA剪接,仍未得到充分探索。在此,我们报告了对来自77对肿瘤和正常组织的肺腺癌转录组进行的三级(基因、转录本和外显子水平)分析,以及一种克服基因变异性以准确区分肿瘤和正常组织的分析流程。我们报告了三个主要结果。首先,超过5000个差异表达的转录本/外显子区域在肺腺癌患者中反复出现。除了差异表达基因富集的通路(细胞周期、细胞外基质受体相互作用和轴突导向)外,这些转录本/外显子区域还富集于尼古丁代谢和核糖体功能。其次,基于合理选择的转录本或外显子区域的分类模型在区分肿瘤和正常组织时准确率可达0.93至1.00。在所选的28个外显子区域中,26个区域对应于位于肿瘤抑制因子(GDF10)、信号受体(LYVE1)、血管特异性调节因子(RASIP1)、泛素化介质(RNF5)和转录抑制因子(TRIM27)等调节因子中的可变外显子。第三,基于13至14个差异表达基因的分类系统准确率接近100%。两种检测方法都选择的基因包括C16orf59、DAP3、ETV4、GABARAPL1、PPAR、RADIL、RSPO1、SERTM1、SRPK1、ST6GALNAC6和TNXB。我们的发现暗示了一个多层的肺腺癌调控组,其中转录本/外显子水平的调控可能与基因水平的调控分离。我们所描述的方法可用于识别肺腺癌肿瘤发生中潜在重要的基因/转录本/外显子区域,并构建针对该疾病的准确的肿瘤与正常分类系统。