Suppr超能文献

基于下一代测序检测与肿瘤细胞大小整合的机器学习模型改善了成熟B细胞肿瘤的亚型分类。

Machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature B-cell neoplasms.

作者信息

Mu Yafei, Chen Yuxin, Meng Yuhuan, Chen Tao, Fan Xijie, Yuan Jiecheng, Lin Junwei, Pan Jianhua, Li Guibin, Feng Jinghua, Diao Kaiyuan, Li Yinghua, Yu Shihui, Liu Lingling

机构信息

Department of Hematology, The Third Affiliated Hospital of Sun Yat-sen University and Sun Yat-sen Institute of Hematology, Guangzhou, China.

KingMed School of Laboratory Medicine, Guangzhou Medical University, Guangzhou, China.

出版信息

Front Oncol. 2023 Aug 3;13:1160383. doi: 10.3389/fonc.2023.1160383. eCollection 2023.

Abstract

BACKGROUND

Next-generation sequencing (NGS) panels for mature B-cell neoplasms (MBNs) are widely applied clinically but have yet to be routinely used in a manner that is suitable for subtype differential diagnosis. This study retrospectively investigated newly diagnosed cases of MBNs from our laboratory to investigate mutation landscapes in Chinese patients with MBNs and to combine mutational information and machine learning (ML) into clinical applications for MBNs, especially for subtype classification.

METHODS

Samples from the Catalogue Of Somatic Mutations In Cancer (COSMIC) database were collected for ML model construction and cases from our laboratory were used for ML model validation. Five repeats of 10-fold cross-validation Random Forest algorithm was used for ML model construction. Mutation detection was performed by NGS and tumor cell size was confirmed by cell morphology and/or flow cytometry in our laboratory.

RESULTS

Totally 849 newly diagnosed MBN cases from our laboratory were retrospectively identified and included in mutational landscape analyses. Patterns of gene mutations in a variety of MBN subtypes were found, important to investigate tumorigenesis in MBNs. A long list of novel mutations was revealed, valuable to both functional studies and clinical applications. By combining gene mutation information revealed by NGS and ML, we established ML models that provide valuable information for MBN subtype classification. In total, 8895 cases of 8 subtypes of MBNs in the COSMIC database were collected and utilized for ML model construction, and the models were validated on the 849 MBN cases from our laboratory. A series of ML models was constructed in this study, and the most efficient model, with an accuracy of 0.87, was based on integration of NGS testing and tumor cell sizes.

CONCLUSIONS

The ML models were of great significance in the differential diagnosis of all cases and different MBN subtypes. Additionally, using NGS results to assist in subtype classification of MBNs by method of ML has positive clinical potential.

摘要

背景

用于成熟B细胞肿瘤(MBNs)的新一代测序(NGS)检测板在临床上已得到广泛应用,但尚未以适合亚型鉴别诊断的方式常规使用。本研究回顾性调查了我们实验室新诊断的MBNs病例,以研究中国MBNs患者的突变图谱,并将突变信息与机器学习(ML)结合应用于MBNs的临床,特别是用于亚型分类。

方法

收集癌症体细胞突变目录(COSMIC)数据库中的样本用于ML模型构建,我们实验室的病例用于ML模型验证。采用五重10折交叉验证随机森林算法构建ML模型。在我们实验室,通过NGS进行突变检测,并通过细胞形态学和/或流式细胞术确认肿瘤细胞大小。

结果

我们实验室共回顾性鉴定了849例新诊断的MBN病例,并纳入突变图谱分析。发现了多种MBN亚型中的基因突变模式,这对研究MBNs的肿瘤发生很重要。揭示了一长串新突变,对功能研究和临床应用都有价值。通过将NGS揭示的基因突变信息与ML相结合,我们建立了为MBN亚型分类提供有价值信息的ML模型。总共收集了COSMIC数据库中8895例8种MBN亚型的病例用于ML模型构建,并在我们实验室的849例MBN病例上对模型进行了验证。本研究构建了一系列ML模型,最有效的模型准确率为0.87,该模型基于NGS检测和肿瘤细胞大小的整合。

结论

ML模型在所有病例和不同MBN亚型的鉴别诊断中具有重要意义。此外,通过ML方法利用NGS结果辅助MBNs的亚型分类具有积极的临床潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0257/10436202/b7c56da8badb/fonc-13-1160383-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验