• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于支持向量机的吸烟诱导肺癌特征风险通路的鉴定。

Identification of feature risk pathways of smoking-induced lung cancer based on SVM.

机构信息

Department of Surgery, Anji Third People's Hospital, Zhejiang China.

Department of Radiology, Taizhou Municipal Hospital, Zhejiang China.

出版信息

PLoS One. 2020 Jun 4;15(6):e0233445. doi: 10.1371/journal.pone.0233445. eCollection 2020.

DOI:10.1371/journal.pone.0233445
PMID:32497048
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7272018/
Abstract

OBJECTIVE

The present study aims to explore the role of smoking factors in the risk of lung cancer and screen the feature risk pathways of smoking-induced lung cancer.

METHODS

The expression profiles of the patient data from GEO database were standardized, and differentially expressed genes (DEGs) were analyzed by limma algorithm. Samples and genes were analyzed by Unsupervised hierarchical clustering method, while GO and KEGG enrichment analyses were performed on DEGs. The data of the protein-protein interaction (PPI) network were downloaded from the BioGrid and HPRD databases, and the DEGs were mapped into the PPI network to identify the interaction relationship. The enriched significant pathways were used to calculate the anomaly score and RFE method was used to optimize the feature sets. The model was trained using the support vector machine (SVM) and the predicted results were plotted into ROC curves. The AUC value was calculated to evaluate the predictive performance of the SVM model.

RESULTS

A total of 1923 DEGs were obtained, of which 826 were down-regulated and 1097 were up-regulated. Unsupervised hierarchical clustering analysis showed that the diagnosis accuracy of lung cancer smokers was 74%, and that of non-lung cancer smokers was 75%. Five optimal feature pathway sets were obtained by screening, the clinical diagnostic ability of which was detected by SVM model with the accuracy improved to 84%. The diagnostic accuracy was 90% after combining clinical information.

CONCLUSION

We verified that five signaling pathways combined with clinical information could be used as a feature risk pathway for identifying lung cancer smokers and non-lung cancer smokers and increased the diagnostic accuracy.

摘要

目的

本研究旨在探讨吸烟因素在肺癌风险中的作用,并筛选吸烟诱导肺癌的特征风险通路。

方法

对 GEO 数据库中患者数据的表达谱进行标准化,采用 limma 算法分析差异表达基因(DEGs)。通过无监督层次聚类方法对样本和基因进行分析,对 DEGs 进行 GO 和 KEGG 富集分析。从 BioGrid 和 HPRD 数据库下载蛋白质-蛋白质相互作用(PPI)网络的数据,将 DEGs 映射到 PPI 网络中以识别相互作用关系。富集显著通路,计算异常评分,并采用 RFE 方法对特征集进行优化。采用支持向量机(SVM)对模型进行训练,并将预测结果绘制为 ROC 曲线。计算 AUC 值以评估 SVM 模型的预测性能。

结果

共获得 1923 个 DEGs,其中 826 个下调,1097 个上调。无监督层次聚类分析显示,肺癌吸烟者的诊断准确率为 74%,非肺癌吸烟者的诊断准确率为 75%。通过筛选获得了五个最优特征通路集,SVM 模型检测其临床诊断能力,准确率提高到 84%。结合临床信息后,诊断准确率提高到 90%。

结论

验证了五个信号通路结合临床信息可作为鉴别肺癌吸烟者和非肺癌吸烟者的特征风险通路,提高了诊断准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56e5/7272018/e80c64eb4651/pone.0233445.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56e5/7272018/e80c64eb4651/pone.0233445.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56e5/7272018/e80c64eb4651/pone.0233445.g001.jpg

相似文献

1
Identification of feature risk pathways of smoking-induced lung cancer based on SVM.基于支持向量机的吸烟诱导肺癌特征风险通路的鉴定。
PLoS One. 2020 Jun 4;15(6):e0233445. doi: 10.1371/journal.pone.0233445. eCollection 2020.
2
Construction of a 26‑feature gene support vector machine classifier for smoking and non‑smoking lung adenocarcinoma sample classification.构建一个 26 特征基因支持向量机分类器,用于吸烟和非吸烟肺腺癌样本分类。
Mol Med Rep. 2018 Feb;17(2):3005-3013. doi: 10.3892/mmr.2017.8220. Epub 2017 Dec 7.
3
Identification of gene markers in the development of smoking-induced lung cancer.吸烟诱导肺癌发生过程中基因标志物的鉴定
Gene. 2016 Jan 15;576(1 Pt 3):451-7. doi: 10.1016/j.gene.2015.10.060. Epub 2015 Oct 27.
4
Support Vector Machine for Lung Adenocarcinoma Staging Through Variant Pathways.基于变异通路的肺腺癌分期的支持向量机
G3 (Bethesda). 2020 Jul 7;10(7):2423-2434. doi: 10.1534/g3.120.401207.
5
A support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy.一种支持向量机分类器,可高精度预测骨肉瘤转移。
Int J Mol Med. 2017 Nov;40(5):1357-1364. doi: 10.3892/ijmm.2017.3126. Epub 2017 Sep 7.
6
Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods.通过 MetaDE 和 SVM 分类器方法鉴定转移性乳腺癌的特征基因。
Mol Med Rep. 2018 Mar;17(3):4281-4290. doi: 10.3892/mmr.2018.8398. Epub 2018 Jan 9.
7
Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction.风险基因识别和支持向量机学习构建心肌梗死早期诊断模型。
Mol Med Rep. 2020 Sep;22(3):1775-1782. doi: 10.3892/mmr.2020.11247. Epub 2020 Jun 17.
8
Construction of a 5-feature gene model by support vector machine for classifying osteoporosis samples.基于支持向量机的五特征基因模型构建用于骨质疏松症样本分类。
Bioengineered. 2021 Dec;12(1):6821-6830. doi: 10.1080/21655979.2021.1971026.
9
A multigene support vector machine predictor for metastasis of cutaneous melanoma.一种用于预测皮肤黑色素瘤转移的多基因支持向量机预测器。
Mol Med Rep. 2018 Feb;17(2):2907-2914. doi: 10.3892/mmr.2017.8219. Epub 2017 Dec 7.
10
Bioinformatics analysis of gene expression profile data to screen key genes involved in intracranial aneurysms.生物信息学分析基因表达谱数据,筛选颅内动脉瘤相关的关键基因。
Mol Med Rep. 2019 Nov;20(5):4415-4424. doi: 10.3892/mmr.2019.10696. Epub 2019 Sep 23.

引用本文的文献

1
Harnessing machine learning in contemporary tobacco research.当代烟草研究中机器学习的应用
Toxicol Rep. 2024 Dec 19;14:101877. doi: 10.1016/j.toxrep.2024.101877. eCollection 2025 Jun.
2
Implementation of PCA enabled Support Vector Machine using cytokines to differentiate smokers versus nonsmokers.使用细胞因子实施主成分分析支持向量机以区分吸烟者与非吸烟者。
Proc (Int Conf Comput Sci Comput Intell). 2021 Dec;2021:312-317. doi: 10.1109/csci54926.2021.00125. Epub 2022 Jun 22.
3
Intelligent assistant diagnosis for pediatric inguinal hernia based on a multilayer and unbalanced classification model.

本文引用的文献

1
Multiple Human-Behaviour Indicators for Predicting Lung Cancer Mortality with Support Vector Machine.基于支持向量机的多个人类行为指标预测肺癌死亡率。
Sci Rep. 2018 Nov 9;8(1):16596. doi: 10.1038/s41598-018-34945-z.
2
Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.全球癌症统计数据 2018:GLOBOCAN 对全球 185 个国家/地区 36 种癌症的发病率和死亡率的估计。
CA Cancer J Clin. 2018 Nov;68(6):394-424. doi: 10.3322/caac.21492. Epub 2018 Sep 12.
3
Meta-analysis of mRNA expression profiles to identify differentially expressed genes in lung adenocarcinoma tissue from smokers and non-smokers.
基于多层非平衡分类模型的小儿腹股沟疝智能辅助诊断
Front Physiol. 2023 Mar 14;14:1105891. doi: 10.3389/fphys.2023.1105891. eCollection 2023.
4
COVID-19: The Impact in Oncology Care.新型冠状病毒肺炎:对肿瘤护理的影响
SN Compr Clin Med. 2020;2(12):2621-2630. doi: 10.1007/s42399-020-00592-7. Epub 2020 Oct 23.
5
XGBoost-Based Framework for Smoking-Induced Noncommunicable Disease Prediction.基于 XGBoost 的吸烟相关非传染性疾病预测框架。
Int J Environ Res Public Health. 2020 Sep 7;17(18):6513. doi: 10.3390/ijerph17186513.
6
Correction: Identification of feature risk pathways of smoking-induced lung cancer based on SVM.更正:基于支持向量机识别吸烟诱导肺癌的特征风险途径。
PLoS One. 2020 Jul 1;15(7):e0235854. doi: 10.1371/journal.pone.0235854. eCollection 2020.
对吸烟和不吸烟的肺腺癌组织中差异表达基因的 mRNA 表达谱进行荟萃分析。
Oncol Rep. 2018 Mar;39(3):929-938. doi: 10.3892/or.2018.6197. Epub 2018 Jan 8.
4
Applications of Support Vector Machine (SVM) Learning in Cancer Genomics.支持向量机(SVM)学习在癌症基因组学中的应用。
Cancer Genomics Proteomics. 2018 Jan-Feb;15(1):41-51. doi: 10.21873/cgp.20063.
5
SVM and SVM Ensembles in Breast Cancer Prediction.支持向量机及其集成方法在乳腺癌预测中的应用
PLoS One. 2017 Jan 6;12(1):e0161501. doi: 10.1371/journal.pone.0161501. eCollection 2017.
6
A 15-gene signature for prediction of colon cancer recurrence and prognosis based on SVM.基于支持向量机的用于预测结肠癌复发和预后的15基因特征。
Gene. 2017 Mar 10;604:33-40. doi: 10.1016/j.gene.2016.12.016. Epub 2016 Dec 18.
7
Lung cancer mutation profile of EGFR, ALK, and KRAS: Meta-analysis and comparison of never and ever smokers.表皮生长因子受体(EGFR)、间变性淋巴瘤激酶(ALK)和 Kirsten 大鼠肉瘤病毒癌基因(KRAS)的肺癌突变谱:从不吸烟者与曾经吸烟者的荟萃分析及比较
Lung Cancer. 2016 Dec;102:122-134. doi: 10.1016/j.lungcan.2016.10.010. Epub 2016 Oct 25.
8
A recurrence model for laryngeal cancer based on SVM and gene function clustering.基于支持向量机和基因功能聚类的喉癌复发模型
Acta Otolaryngol. 2017 May;137(5):557-562. doi: 10.1080/00016489.2016.1247984. Epub 2016 Nov 4.
9
Smoking History as a Predictor of Pemetrexed Monotherapy in Patients with Non-Squamous Non-Small Cell Lung Cancer.吸烟史作为非鳞状非小细胞肺癌患者培美曲塞单药治疗的预测指标
Oncology. 2016;91(1):41-7. doi: 10.1159/000446438. Epub 2016 May 27.
10
Smoking, inflammation and small cell lung cancer: recent developments.吸烟、炎症与小细胞肺癌:最新进展
Wien Med Wochenschr. 2015 Oct;165(19-20):379-86. doi: 10.1007/s10354-015-0381-6. Epub 2015 Aug 20.