Yu Xiangtian, Pan XiaoYong, Zhang ShiQi, Zhang Yu-Hang, Chen Lei, Wan Sibao, Huang Tao, Cai Yu-Dong
Clinical Research Center, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China.
Key Laboratory of System Control and Information Processing, Ministry of Education of China, Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai, China.
Front Genet. 2021 Jan 28;11:605012. doi: 10.3389/fgene.2020.605012. eCollection 2020.
Cancer, which refers to abnormal cell proliferative diseases with systematic pathogenic potential, is one of the leading threats to human health. The final causes for patients' deaths are usually cancer recurrence, metastasis, and drug resistance against continuing therapy. Epithelial-to-mesenchymal transition (EMT), which is the transformation of tumor cells (TCs), is a prerequisite for pathogenic cancer recurrence, metastasis, and drug resistance. Conventional biomarkers can only define and recognize large tissues with obvious EMT markers but cannot accurately monitor detailed EMT processes. In this study, a systematic workflow was established integrating effective feature selection, multiple machine learning models [Random forest (RF), Support vector machine (SVM)], rule learning, and functional enrichment analyses to find new biomarkers and their functional implications for distinguishing single-cell isolated TCs with unique epithelial or mesenchymal markers using public single-cell expression profiling. Our discovered signatures may provide an effective and precise transcriptomic reference to monitor EMT progression at the single-cell level and contribute to the exploration of detailed tumorigenesis mechanisms during EMT.
癌症是指具有系统性致病潜力的异常细胞增殖性疾病,是对人类健康的主要威胁之一。患者死亡的最终原因通常是癌症复发、转移以及对持续治疗产生耐药性。上皮-间质转化(EMT),即肿瘤细胞(TCs)的转变,是癌症致病性复发、转移和耐药性的先决条件。传统生物标志物只能定义和识别具有明显EMT标志物的大组织,但无法准确监测详细的EMT过程。在本研究中,建立了一个系统的工作流程,整合了有效的特征选择、多种机器学习模型[随机森林(RF)、支持向量机(SVM)]、规则学习和功能富集分析,以利用公开的单细胞表达谱寻找新的生物标志物及其功能意义,用于区分具有独特上皮或间质标志物的单细胞分离TCs。我们发现的特征可能提供一个有效且精确的转录组参考,以在单细胞水平监测EMT进展,并有助于探索EMT过程中详细的肿瘤发生机制。