Suppr超能文献

RPS学习者:一种基于随机投影和深度堆叠学习的非小细胞肺癌分类新方法。

RPSLearner: A novel approach based on random projection and deep stacking learning for categorizing NSCLC.

作者信息

Wu Xinchao, Wang Jieqiong, Wan Shibiao

机构信息

Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE.

Department of Neurological Sciences, University of Nebraska Medical Center, Omaha, NE.

出版信息

bioRxiv. 2025 May 7:2025.05.01.651699. doi: 10.1101/2025.05.01.651699.

Abstract

BACKGROUND

Lung cancer is the leading cause of cancer death, and non-small cell lung cancer (NSCLC) comprises the largest subtype with most cases. Lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) are two NSCLC subtypes that pose challenges for accurate diagnosis using conventional methods. Existing methods are histological examination and imaging which lacks definitive histologic features and requires intense time.

METHODS

To address these concerns, we propose RPSLearner, which combines Random Projection (RP) for dimensionality reduction and stacking ensemble learning to accurately predict lung cancer subtypes. Specifically, multiple independent RP matrices were first generated to project the high-dimensional RNA-seq data into lower-dimensional space, whose features were subsequently concatenated. After that, we fed the fused features into a stack of diverse base classifiers and integrated the predictions from base models via a deep linear layer network.

RESULTS

Benchmarking tests on 1,333 NSCLC patients demonstrated that RPSLearner outperformed state-of-the-art approaches for lung cancer subtype classification. Specifically, RPSLearner efficiently preserved sample-to-sample distances even after significant dimension reduction, and the meta-model in RPSLearner yielded consistently higher accuracy, F1 and AUC scores than individual base models and state-of-the-art approaches for lung cancer subtyping. Besides, the feature fusion method applied in RPSLearner shown better performance than conventional scores ensemble methods.

CONCLUSION

We developed a novel stacking learning method called RPSLearner which combines RP and stacking learning, enabling efficient and accurate identification of NSCLC subtypes. RPSLearner is a promising lung cancer subtyping model for downstream lung cancer clinical diagnosis and personalized treatment, and the framework holds the potentiality to be extended to subtyping of other types of cancer.

摘要

背景

肺癌是癌症死亡的主要原因,非小细胞肺癌(NSCLC)是最大的亚型,病例最多。肺腺癌(LUAD)和肺鳞状细胞癌(LUSC)是两种NSCLC亚型,使用传统方法进行准确诊断具有挑战性。现有方法是组织学检查和成像,缺乏明确的组织学特征且需要大量时间。

方法

为了解决这些问题,我们提出了RPSLearner,它结合了用于降维的随机投影(RP)和堆叠集成学习来准确预测肺癌亚型。具体来说,首先生成多个独立的RP矩阵,将高维RNA测序数据投影到低维空间,随后将其特征连接起来。之后,我们将融合后的特征输入到一堆不同的基分类器中,并通过深度线性层网络整合基模型的预测结果。

结果

对1333例NSCLC患者进行的基准测试表明,RPSLearner在肺癌亚型分类方面优于现有方法。具体而言,即使在显著降维后,RPSLearner仍能有效地保留样本间距离,并且RPSLearner中的元模型在肺癌亚型分类方面始终比单个基模型和现有方法产生更高的准确率、F1值和AUC分数。此外,RPSLearner中应用的特征融合方法比传统的分数集成方法表现更好。

结论

我们开发了一种名为RPSLearner的新型堆叠学习方法,它结合了RP和堆叠学习,能够高效准确地识别NSCLC亚型。RPSLearner是一种有前途的肺癌亚型分类模型,可用于下游肺癌临床诊断和个性化治疗,并且该框架具有扩展到其他类型癌症亚型分类的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2688/12247899/06265f6f7b93/nihpp-2025.05.01.651699v1-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验