Li Qi-Gang, He Yong-Han, Wu Huan, Yang Cui-Ping, Pu Shao-Yan, Fan Song-Qing, Jiang Li-Ping, Shen Qiu-Shuo, Wang Xiao-Xiong, Chen Xiao-Qiong, Yu Qin, Li Ying, Sun Chang, Wang Xiangting, Zhou Jumin, Li Hai-Peng, Chen Yong-Bin, Kong Qing-Peng
State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.
KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming 650223, China.
Theranostics. 2017 Jul 8;7(11):2888-2899. doi: 10.7150/thno.19425. eCollection 2017.
Heterogeneity in transcriptional data hampers the identification of differentially expressed genes (DEGs) and understanding of cancer, essentially because current methods rely on cross-sample normalization and/or distribution assumption-both sensitive to heterogeneous values. Here, we developed a new method, Cross-Value Association Analysis (CVAA), which overcomes the limitation and is more robust to heterogeneous data than the other methods. Applying CVAA to a more complex pan-cancer dataset containing 5,540 transcriptomes discovered numerous new DEGs and many previously rarely explored pathways/processes; some of them were validated, both and , to be crucial in tumorigenesis, e.g., alcohol metabolism (), chromosome remodeling () and complement system (). Together, we present a sharper tool to navigate large-scale expression data and gain new mechanistic insights into tumorigenesis.
转录数据中的异质性阻碍了差异表达基因(DEG)的识别以及对癌症的理解,主要是因为当前方法依赖于跨样本归一化和/或分布假设——两者都对异质值敏感。在此,我们开发了一种新方法,交叉值关联分析(CVAA),它克服了这一局限性,并且比其他方法对异质数据更具鲁棒性。将CVAA应用于包含5540个转录组的更复杂的泛癌数据集,发现了许多新的DEG以及许多以前很少探索的途径/过程;其中一些已在体内和体外得到验证,对肿瘤发生至关重要,例如酒精代谢(体内)、染色体重塑(体外)和补体系统(体内和体外)。总之,我们提供了一个更有效的工具来处理大规模表达数据,并获得对肿瘤发生的新机制见解。