• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的乳腺癌类型分类

Breast Cancer Type Classification Using Machine Learning.

作者信息

Wu Jiande, Hicks Chindo

机构信息

Department of Genetics, School of Medicine, Louisiana State University Health Sciences Center, 533 Bolivar, New Orleans, LA 70112, USA.

出版信息

J Pers Med. 2021 Jan 20;11(2):61. doi: 10.3390/jpm11020061.

DOI:10.3390/jpm11020061
PMID:33498339
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7909418/
Abstract

BACKGROUND

Breast cancer is a heterogeneous disease defined by molecular types and subtypes. Advances in genomic research have enabled use of precision medicine in clinical management of breast cancer. A critical unmet medical need is distinguishing triple negative breast cancer, the most aggressive and lethal form of breast cancer, from non-triple negative breast cancer. Here we propose use of a machine learning (ML) approach for classification of triple negative breast cancer and non-triple negative breast cancer patients using gene expression data.

METHODS

We performed analysis of RNA-Sequence data from 110 triple negative and 992 non-triple negative breast cancer tumor samples from The Cancer Genome Atlas to select the features (genes) used in the development and validation of the classification models. We evaluated four different classification models including Support Vector Machines, K-nearest neighbor, Naïve Bayes and Decision tree using features selected at different threshold levels to train the models for classifying the two types of breast cancer. For performance evaluation and validation, the proposed methods were applied to independent gene expression datasets.

RESULTS

Among the four ML algorithms evaluated, the Support Vector Machine algorithm was able to classify breast cancer more accurately into triple negative and non-triple negative breast cancer and had less misclassification errors than the other three algorithms evaluated.

CONCLUSIONS

The prediction results show that ML algorithms are efficient and can be used for classification of breast cancer into triple negative and non-triple negative breast cancer types.

摘要

背景

乳腺癌是一种由分子类型和亚型定义的异质性疾病。基因组研究的进展使得精准医学能够应用于乳腺癌的临床管理。一个关键的未满足医疗需求是区分三阴性乳腺癌(最具侵袭性和致命性的乳腺癌形式)和非三阴性乳腺癌。在此,我们提出使用机器学习(ML)方法,利用基因表达数据对三阴性乳腺癌患者和非三阴性乳腺癌患者进行分类。

方法

我们对来自癌症基因组图谱的110例三阴性乳腺癌肿瘤样本和992例非三阴性乳腺癌肿瘤样本的RNA序列数据进行分析,以选择用于分类模型开发和验证的特征(基因)。我们使用在不同阈值水平选择的特征评估了四种不同的分类模型,包括支持向量机、K近邻、朴素贝叶斯和决策树,以训练用于区分这两种类型乳腺癌的模型。为了进行性能评估和验证,将所提出的方法应用于独立的基因表达数据集。

结果

在所评估的四种机器学习算法中,支持向量机算法能够更准确地将乳腺癌分类为三阴性和非三阴性乳腺癌,并且与所评估的其他三种算法相比,误分类错误更少。

结论

预测结果表明,机器学习算法是有效的,可用于将乳腺癌分类为三阴性和非三阴性乳腺癌类型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f0b/7909418/af88f360d685/jpm-11-00061-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f0b/7909418/ef2eb6225055/jpm-11-00061-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f0b/7909418/ae439b6b1c60/jpm-11-00061-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f0b/7909418/af88f360d685/jpm-11-00061-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f0b/7909418/ef2eb6225055/jpm-11-00061-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f0b/7909418/ae439b6b1c60/jpm-11-00061-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f0b/7909418/af88f360d685/jpm-11-00061-g003.jpg

相似文献

1
Breast Cancer Type Classification Using Machine Learning.基于机器学习的乳腺癌类型分类
J Pers Med. 2021 Jan 20;11(2):61. doi: 10.3390/jpm11020061.
2
Breast Cancer Subtypes Classification with Hybrid Machine Learning Model.基于混合机器学习模型的乳腺癌亚型分类。
Methods Inf Med. 2022 Sep;61(3-04):68-83. doi: 10.1055/s-0042-1751043. Epub 2022 Sep 12.
3
Evaluation of Machine Learning Algorithm Utilization for Lung Cancer Classification Based on Gene Expression Levels.基于基因表达水平的肺癌分类机器学习算法应用评估
Asian Pac J Cancer Prev. 2016;17(2):835-8. doi: 10.7314/apjcp.2016.17.2.835.
4
Mammography Image-Based Diagnosis of Breast Cancer Using Machine Learning: A Pilot Study.基于乳腺 X 光图像的机器学习乳腺癌诊断:一项试点研究。
Sensors (Basel). 2021 Dec 28;22(1):203. doi: 10.3390/s22010203.
5
An integrative machine learning framework for classifying SEER breast cancer.用于 SEER 乳腺癌分类的集成机器学习框架
Sci Rep. 2023 Apr 1;13(1):5362. doi: 10.1038/s41598-023-32029-1.
6
Analysis of Decision Tree and K-Nearest Neighbor Algorithm in the Classification of Breast Cancer.决策树和K近邻算法在乳腺癌分类中的分析
Asian Pac J Cancer Prev. 2019 Dec 1;20(12):3777-3781. doi: 10.31557/APJCP.2019.20.12.3777.
7
MLSeq: Machine learning interface for RNA-sequencing data.MLSeq:用于 RNA-seq 数据的机器学习接口。
Comput Methods Programs Biomed. 2019 Jul;175:223-231. doi: 10.1016/j.cmpb.2019.04.007. Epub 2019 Apr 29.
8
Hubness weighted SVM ensemble for prediction of breast cancer subtypes.基于 Hubness 权重的支持向量机集成模型预测乳腺癌亚型。
Technol Health Care. 2022;30(3):565-578. doi: 10.3233/THC-212825.
9
Machine learning models in breast cancer survival prediction.用于乳腺癌生存预测的机器学习模型。
Technol Health Care. 2016;24(1):31-42. doi: 10.3233/THC-151071.
10
Personal Health Information Inference Using Machine Learning on RNA Expression Data from Patients With Cancer: Algorithm Validation Study.利用癌症患者 RNA 表达数据进行机器学习的个人健康信息推断:算法验证研究。
J Med Internet Res. 2020 Aug 10;22(8):e18387. doi: 10.2196/18387.

引用本文的文献

1
A Machine-Learning Model for the Prediction of Triple-Negative Breast Cancer Based on Multiparameter MRI.基于多参数磁共振成像的三阴性乳腺癌预测机器学习模型
Breast Cancer (Dove Med Press). 2025 Jul 15;17:611-625. doi: 10.2147/BCTT.S513779. eCollection 2025.
2
Cohesive data analysis for the identification of prognostic hub genes and significant pathways associated with HER2 + and TN breast cancer types.用于鉴定与HER2阳性和三阴型乳腺癌相关的预后关键基因及重要通路的连贯数据分析
Sci Rep. 2025 Jul 2;15(1):23675. doi: 10.1038/s41598-025-94084-0.
3
Early detection and analysis of accurate breast cancer for improved diagnosis using deep supervised learning for enhanced patient outcomes.

本文引用的文献

1
Radiomic Analysis in Contrast-Enhanced Spectral Mammography for Predicting Breast Cancer Histological Outcome.对比增强光谱乳腺摄影术的放射组学分析用于预测乳腺癌组织学结果
Diagnostics (Basel). 2020 Sep 17;10(9):708. doi: 10.3390/diagnostics10090708.
2
Mechanisms of Chemotherapy Resistance in Triple-Negative Breast Cancer-How We Can Rise to the Challenge.三阴性乳腺癌化疗耐药的机制——我们如何应对挑战。
Cells. 2019 Aug 22;8(9):957. doi: 10.3390/cells8090957.
3
Breast cancer outcome prediction with tumour tissue images and machine learning.
利用深度监督学习进行早期准确乳腺癌检测与分析,以改善诊断,提高患者治疗效果。
PeerJ Comput Sci. 2025 Apr 24;11:e2784. doi: 10.7717/peerj-cs.2784. eCollection 2025.
4
Breast Cancer Detection Using Convolutional Neural Networks: A Deep Learning-Based Approach.使用卷积神经网络进行乳腺癌检测:一种基于深度学习的方法。
Cureus. 2025 May 3;17(5):e83421. doi: 10.7759/cureus.83421. eCollection 2025 May.
5
Impact of surgical approach and survival prediction of malignant phyllode tumor by machine learning.手术方式对恶性叶状肿瘤的影响及机器学习对其生存预测
Updates Surg. 2025 Apr 7. doi: 10.1007/s13304-025-02191-4.
6
Artificial intelligence based classification and prediction of medical imaging using a novel framework of inverted and self-attention deep neural network architecture.基于人工智能的医学影像分类与预测:使用一种新颖的倒置与自注意力深度神经网络架构框架
Sci Rep. 2025 Mar 13;15(1):8724. doi: 10.1038/s41598-025-93718-7.
7
Enhancing breast cancer prediction through stacking ensemble and deep learning integration.通过堆叠集成和深度学习集成增强乳腺癌预测
PeerJ Comput Sci. 2025 Feb 3;11:e2461. doi: 10.7717/peerj-cs.2461. eCollection 2025.
8
Radiomics Integration of Mammography and DCE-MRI for Predicting Molecular Subtypes in Breast Cancer Patients.乳腺钼靶和动态对比增强磁共振成像的影像组学整合用于预测乳腺癌患者的分子亚型
Breast Cancer (Dove Med Press). 2025 Feb 18;17:187-200. doi: 10.2147/BCTT.S488200. eCollection 2025.
9
Utilizing Feature Selection Techniques for AI-Driven Tumor Subtype Classification: Enhancing Precision in Cancer Diagnostics.利用特征选择技术进行人工智能驱动的肿瘤亚型分类:提高癌症诊断的精度。
Biomolecules. 2025 Jan 8;15(1):81. doi: 10.3390/biom15010081.
10
Synthetic Boosted Resampling Using Deep Generative Adversarial Networks: A Novel Approach to Improve Cancer Prediction from Imbalanced Datasets.使用深度生成对抗网络的合成增强重采样:一种从不平衡数据集中改善癌症预测的新方法。
Cancers (Basel). 2024 Dec 2;16(23):4046. doi: 10.3390/cancers16234046.
利用肿瘤组织图像和机器学习预测乳腺癌的预后。
Breast Cancer Res Treat. 2019 Aug;177(1):41-52. doi: 10.1007/s10549-019-05281-1. Epub 2019 May 22.
4
Cancer statistics, 2019.癌症统计数据,2019 年。
CA Cancer J Clin. 2019 Jan;69(1):7-34. doi: 10.3322/caac.21551. Epub 2019 Jan 8.
5
Machine learning for diagnostic ultrasound of triple-negative breast cancer.机器学习在三阴性乳腺癌诊断超声中的应用。
Breast Cancer Res Treat. 2019 Jan;173(2):365-373. doi: 10.1007/s10549-018-4984-7. Epub 2018 Oct 20.
6
Survival Study of Triple-Negative and Non-Triple-Negative Breast Cancer in a Brazilian Cohort.巴西队列中三阴性和非三阴性乳腺癌的生存研究
Clin Med Insights Oncol. 2018 Jul 27;12:1179554918790563. doi: 10.1177/1179554918790563. eCollection 2018.
7
Diagnostic Accuracy of Different Machine Learning Algorithms for Breast Cancer Risk Calculation: a Meta-Analysis.不同机器学习算法用于乳腺癌风险计算的诊断准确性:一项荟萃分析
Asian Pac J Cancer Prev. 2018 Jul 27;19(7):1747-1752. doi: 10.22034/APJCP.2018.19.7.1747.
8
Prediction of breast cancer risk using a machine learning approach embedded with a locality preserving projection algorithm.基于局部保持投影算法的机器学习方法预测乳腺癌风险。
Phys Med Biol. 2018 Jan 30;63(3):035020. doi: 10.1088/1361-6560/aaa1ca.
9
Immunohistochemical versus molecular (BluePrint and MammaPrint) subtyping of breast carcinoma. Outcome results from the EORTC 10041/BIG 3-04 MINDACT trial.免疫组化与分子(BluePrint 和 MammaPrint)亚型在乳腺癌中的应用。EORTC 10041/BIG 3-04 MINDACT 试验的结果。
Breast Cancer Res Treat. 2018 Jan;167(1):123-131. doi: 10.1007/s10549-017-4509-9. Epub 2017 Sep 19.
10
Prospective comparison of outcome after treatment for triple-negative and non-triple-negative breast cancer.三阴性乳腺癌与非三阴性乳腺癌治疗后结局的前瞻性比较。
Surgeon. 2017 Oct;15(5):272-277. doi: 10.1016/j.surge.2016.10.001. Epub 2016 Oct 27.