• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RF-PCA:一种基于属性选择和特征提取的乳腺癌分类数据快速识别新解决方案。

RF-PCA: A New Solution for Rapid Identification of Breast Cancer Categorical Data Based on Attribute Selection and Feature Extraction.

作者信息

Bian Kai, Zhou Mengran, Hu Feng, Lai Wenhao

机构信息

School of Electrical and Information Engineering, Anhui University of Science and Technology, Huainan, China.

State Key Laboratory of Mining Response and Disaster Prevention and Control in Deep Coal Mines, Anhui University of Science and Technology, Huainan, China.

出版信息

Front Genet. 2020 Sep 9;11:566057. doi: 10.3389/fgene.2020.566057. eCollection 2020.

DOI:10.3389/fgene.2020.566057
PMID:33033496
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7510777/
Abstract

Breast cancer is one of the most common cancer diseases in women. The rapid and accurate diagnosis of breast cancer is of great significance for the treatment of cancer. Artificial intelligence and machine learning algorithms are used to identify breast malignant tumors, which can effectively solve the problems of insufficient recognition accuracy and long time-consuming in traditional breast cancer diagnosis methods. To solve these problems, we proposed a method of attribute selection and feature extraction based on random forest (RF) combined with principal component analysis (PCA) for rapid and accurate diagnosis of breast cancer. Firstly, RF was used to reduce 30 attributes of breast cancer categorical data. According to the average importance of attributes and out of bag error, 21 relatively important attribute data were selected for feature extraction based on PCA. The seven features extracted from PCA were used to establish an extreme learning machine (ELM) classification model with different activation functions. By comparing the classification accuracy and training time of these different models, the activation function of the hidden layer was determined as the sigmoid function. When the number of neurons in the hidden layer was 27, the accuracy of the test set was 98.75%, the accuracy of the training set was 99.06%, and the training time was only 0.0022 s. Finally, in order to verify the superiority of this method in breast cancer diagnosis, we compared with the ELM model based on the original breast cancer data and other intelligent classification algorithm models. The algorithm used in this article has a faster recognition time and a higher recognition accuracy than other algorithms. We also used the breast cancer data of breast tissue reactance features to verify the reliability of this method, and ideal results were obtained. The experimental results show that RF-PCA combined with ELM can significantly reduce the time required for the diagnosis of breast cancer, which has the ability of rapid and accurate identification of breast cancer and provides a theoretical basis for the intelligent diagnosis of breast cancer.

摘要

乳腺癌是女性最常见的癌症疾病之一。乳腺癌的快速准确诊断对于癌症治疗具有重要意义。人工智能和机器学习算法被用于识别乳腺恶性肿瘤,这可以有效解决传统乳腺癌诊断方法中识别准确率不足和耗时较长的问题。为了解决这些问题,我们提出了一种基于随机森林(RF)结合主成分分析(PCA)的属性选择和特征提取方法,用于乳腺癌的快速准确诊断。首先,利用RF对乳腺癌分类数据的30个属性进行约简。根据属性的平均重要性和袋外误差,选择21个相对重要的属性数据基于PCA进行特征提取。从PCA中提取的7个特征用于建立具有不同激活函数的极限学习机(ELM)分类模型。通过比较这些不同模型的分类准确率和训练时间,确定隐藏层的激活函数为sigmoid函数。当隐藏层神经元数量为27时,测试集准确率为98.75%,训练集准确率为99.06%,训练时间仅为0.0022秒。最后,为了验证该方法在乳腺癌诊断中的优越性,我们与基于原始乳腺癌数据的ELM模型及其他智能分类算法模型进行了比较。本文所使用的算法比其他算法具有更快的识别时间和更高的识别准确率。我们还使用乳腺组织电抗特征的乳腺癌数据验证了该方法的可靠性,并获得了理想结果。实验结果表明,RF-PCA结合ELM可以显著减少乳腺癌诊断所需时间,具有快速准确识别乳腺癌的能力,为乳腺癌的智能诊断提供了理论依据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/d6d53a753673/fgene-11-566057-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/b650717ac213/fgene-11-566057-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/53dd3b9cb524/fgene-11-566057-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/784c99fe3bcd/fgene-11-566057-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/ed404e883e8d/fgene-11-566057-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/e732115509a1/fgene-11-566057-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/d6d53a753673/fgene-11-566057-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/b650717ac213/fgene-11-566057-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/53dd3b9cb524/fgene-11-566057-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/784c99fe3bcd/fgene-11-566057-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/ed404e883e8d/fgene-11-566057-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/e732115509a1/fgene-11-566057-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fd/7510777/d6d53a753673/fgene-11-566057-g006.jpg

相似文献

1
RF-PCA: A New Solution for Rapid Identification of Breast Cancer Categorical Data Based on Attribute Selection and Feature Extraction.RF-PCA:一种基于属性选择和特征提取的乳腺癌分类数据快速识别新解决方案。
Front Genet. 2020 Sep 9;11:566057. doi: 10.3389/fgene.2020.566057. eCollection 2020.
2
A New Method Based on CEEMD Combined With Iterative Feature Reduction for Aided Diagnosis of Epileptic EEG.一种基于CEEMD结合迭代特征约简的癫痫脑电辅助诊断新方法。
Front Bioeng Biotechnol. 2020 Jun 30;8:669. doi: 10.3389/fbioe.2020.00669. eCollection 2020.
3
Comparison of SVM, RF and ELM on an Electronic Nose for the Intelligent Evaluation of Paraffin Samples.支持向量机、随机森林和极限学习机在电子鼻用于石蜡样品智能评估方面的比较。
Sensors (Basel). 2018 Jan 18;18(1):285. doi: 10.3390/s18010285.
4
A computer aided diagnosis system for thyroid disease using extreme learning machine.基于极限学习机的甲状腺疾病计算机辅助诊断系统。
J Med Syst. 2012 Oct;36(5):3327-37. doi: 10.1007/s10916-012-9825-3. Epub 2012 Feb 12.
5
[Identification of varieties of black bean using ground based hyperspectral imaging].[基于地面高光谱成像技术的黑豆品种鉴别]
Guang Pu Xue Yu Guang Pu Fen Xi. 2014 Mar;34(3):746-50.
6
Identification of the interference spectra of edible oil samples based on neighborhood rough set attribute reduction.基于邻域粗糙集属性约简的食用油样本干涉光谱识别
Appl Opt. 2023 Feb 20;62(6):1537-1546. doi: 10.1364/AO.475459.
7
Spoken language identification based on the enhanced self-adjusting extreme learning machine approach.基于增强型自调节极限学习机方法的口语识别。
PLoS One. 2018 Apr 19;13(4):e0194770. doi: 10.1371/journal.pone.0194770. eCollection 2018.
8
Remote Sensing Image Recognition Based on LOG-T-SSA-LSSVM and AE-ELM Network.基于 LOG-T-SSA-LSSVM 和 AE-ELM 网络的遥感图像识别。
Comput Intell Neurosci. 2022 Jan 25;2022:8077563. doi: 10.1155/2022/8077563. eCollection 2022.
9
Improved lung nodule diagnosis accuracy using lung CT images with uncertain class.利用不确定类别的肺部 CT 图像提高肺结节诊断准确性。
Comput Methods Programs Biomed. 2018 Aug;162:197-209. doi: 10.1016/j.cmpb.2018.05.028. Epub 2018 May 18.
10
Accurate quantification of alkalinity of sintered ore by random forest model based on PCA and variable importance (PCA-VI-RF).基于主成分分析和变量重要性的随机森林模型(PCA-VI-RF)对烧结矿碱度进行准确量化。
Appl Opt. 2020 Mar 1;59(7):2042-2049. doi: 10.1364/AO.382102.

引用本文的文献

1
Construction and evaluation of a liver cancer risk prediction model based on machine learning.基于机器学习的肝癌风险预测模型的构建与评估
World J Gastrointest Oncol. 2024 Sep 15;16(9):3839-3850. doi: 10.4251/wjgo.v16.i9.3839.
2
An integrative machine learning framework for classifying SEER breast cancer.用于 SEER 乳腺癌分类的集成机器学习框架
Sci Rep. 2023 Apr 1;13(1):5362. doi: 10.1038/s41598-023-32029-1.
3
Research on SPDTRS-PNN based intelligent assistant diagnosis for breast cancer.基于 SPDTRS-PNN 的乳腺癌智能辅助诊断研究。

本文引用的文献

1
Cancer statistics, 2020.癌症统计数据,2020 年。
CA Cancer J Clin. 2020 Jan;70(1):7-30. doi: 10.3322/caac.21590. Epub 2020 Jan 8.
2
Breast cancer statistics, 2019.乳腺癌统计数据,2019 年。
CA Cancer J Clin. 2019 Nov;69(6):438-451. doi: 10.3322/caac.21583. Epub 2019 Oct 2.
3
Transfer learning based deep CNN for segmentation and detection of mitoses in breast cancer histopathological images.基于迁移学习的深度卷积神经网络用于乳腺癌组织病理学图像中有丝分裂的分割与检测
Sci Rep. 2023 Mar 16;13(1):4386. doi: 10.1038/s41598-023-28316-6.
4
Intelligent Diagnosis Based on Double-Optimized Artificial Hydrocarbon Networks for Mechanical Faults of In-Wheel Motor.基于双重优化人工碳氢网络的轮内电机机械故障智能诊断。
Sensors (Basel). 2022 Aug 22;22(16):6316. doi: 10.3390/s22166316.
5
E-Learning Performance Prediction: Mining the Feature Space of Effective Learning Behavior.电子学习绩效预测:挖掘有效学习行为的特征空间
Entropy (Basel). 2022 May 19;24(5):722. doi: 10.3390/e24050722.
Microscopy (Oxf). 2019 Jun 1;68(3):216-233. doi: 10.1093/jmicro/dfz002.
4
Long-Term Endocrine and Metabolic Consequences of Cancer Treatment: A Systematic Review.癌症治疗的长期内分泌和代谢后果:系统评价。
Endocr Rev. 2019 Jun 1;40(3):711-767. doi: 10.1210/er.2018-00092.
5
Second Primary Lung Cancer After Breast Cancer: A Population-Based Study of 6,269 Women.乳腺癌后发生的第二原发性肺癌:一项基于人群的6269名女性的研究。
Front Oncol. 2018 Oct 9;8:427. doi: 10.3389/fonc.2018.00427. eCollection 2018.
6
Principal component analysis.
Transfusion. 2018 Jul;58(7):1580-1582. doi: 10.1111/trf.14639. Epub 2018 May 6.
7
e-Bitter: Bitterant Prediction by the Consensus Voting From the Machine-Learning Methods.电子苦味:通过机器学习方法的共识投票进行苦味剂预测。
Front Chem. 2018 Mar 29;6:82. doi: 10.3389/fchem.2018.00082. eCollection 2018.
8
Breast cancer identification via modeling of peripherally circulating miRNAs.通过外周循环微小RNA建模进行乳腺癌识别
PeerJ. 2018 Mar 26;6:e4551. doi: 10.7717/peerj.4551. eCollection 2018.
9
Quantitative proteomic profiling of primary cancer-associated fibroblasts in oesophageal adenocarcinoma.食管腺癌中原发性癌相关成纤维细胞的定量蛋白质组学分析。
Br J Cancer. 2018 May;118(9):1200-1207. doi: 10.1038/s41416-018-0042-9. Epub 2018 Mar 29.
10
Bioinspired Architecture Selection for Multitask Learning.用于多任务学习的生物启发式架构选择
Front Neuroinform. 2017 Jun 22;11:39. doi: 10.3389/fninf.2017.00039. eCollection 2017.