• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于集成特征选择和机器学习方法的肺癌分类

Classification of lung cancer using ensemble-based feature selection and machine learning methods.

作者信息

Cai Zhihua, Xu Dong, Zhang Qing, Zhang Jiexia, Ngai Sai-Ming, Shao Jianlin

机构信息

Affiliated Cancer Hospital of Guangzhou Medical University, Guangzhou, Guangdong Province, China.

出版信息

Mol Biosyst. 2015 Mar;11(3):791-800. doi: 10.1039/c4mb00659c. Epub 2014 Dec 16.

DOI:10.1039/c4mb00659c
PMID:25512221
Abstract

Lung cancer is one of the leading causes of death worldwide. There are three major types of lung cancers, non-small cell lung cancer (NSCLC), small cell lung cancer (SCLC) and carcinoid. NSCLC is further classified into lung adenocarcinoma (LADC), squamous cell lung cancer (SQCLC) as well as large cell lung cancer. Many previous studies demonstrated that DNA methylation has emerged as potential lung cancer-specific biomarkers. However, whether there exists a set of DNA methylation markers simultaneously distinguishing such three types of lung cancers remains elusive. In the present study, ROC (Receiving Operating Curve), RFs (Random Forests) and mRMR (Maximum Relevancy and Minimum Redundancy) were proposed to capture the unbiased, informative as well as compact molecular signatures followed by machine learning methods to classify LADC, SQCLC and SCLC. As a result, a panel of 16 DNA methylation markers exhibits an ideal classification power with an accuracy of 86.54%, 84.6% and a recall 84.37%, 85.5% in the leave-one-out cross-validation (LOOCV) and independent data set test experiments, respectively. Besides, comparison results indicate that ensemble-based feature selection methods outperform individual ones when combined with the incremental feature selection (IFS) strategy in terms of the informative and compact property of features. Taken together, results obtained suggest the effectiveness of the ensemble-based feature selection approach and the possible existence of a common panel of DNA methylation markers among such three types of lung cancer tissue, which would facilitate clinical diagnosis and treatment.

摘要

肺癌是全球主要死因之一。肺癌主要有三种类型,即非小细胞肺癌(NSCLC)、小细胞肺癌(SCLC)和类癌。非小细胞肺癌又进一步分为肺腺癌(LADC)、肺鳞状细胞癌(SQCLC)以及大细胞肺癌。许多先前的研究表明,DNA甲基化已成为潜在的肺癌特异性生物标志物。然而,是否存在一组DNA甲基化标记物能够同时区分这三种类型的肺癌仍不清楚。在本研究中,我们提出使用ROC(接收操作曲线)、随机森林(RFs)和最大相关最小冗余(mRMR)方法来获取无偏、信息丰富且简洁的分子特征,随后采用机器学习方法对肺腺癌、肺鳞状细胞癌和小细胞肺癌进行分类。结果,一组16个DNA甲基化标记物在留一法交叉验证(LOOCV)和独立数据集测试实验中分别展现出理想的分类能力,准确率分别为86.54%、84.6%,召回率分别为84.37%、85.5%。此外,比较结果表明,在特征的信息性和简洁性方面,基于集成的特征选择方法与增量特征选择(IFS)策略相结合时优于单个方法。综上所述,所得结果表明基于集成的特征选择方法的有效性以及这三种类型肺癌组织中可能存在一组共同的DNA甲基化标记物,这将有助于临床诊断和治疗。

相似文献

1
Classification of lung cancer using ensemble-based feature selection and machine learning methods.基于集成特征选择和机器学习方法的肺癌分类
Mol Biosyst. 2015 Mar;11(3):791-800. doi: 10.1039/c4mb00659c. Epub 2014 Dec 16.
2
Using epigenomics data to predict gene expression in lung cancer.利用表观基因组学数据预测肺癌中的基因表达。
BMC Bioinformatics. 2015;16 Suppl 5(Suppl 5):S10. doi: 10.1186/1471-2105-16-S5-S10. Epub 2015 Mar 18.
3
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
4
Discovery and development of DNA methylation-based biomarkers for lung cancer.基于 DNA 甲基化的肺癌生物标志物的发现和开发。
Epigenomics. 2014 Feb;6(1):59-72. doi: 10.2217/epi.13.81.
5
Hierarchical clustering of lung cancer cell lines using DNA methylation markers.使用DNA甲基化标记对肺癌细胞系进行层次聚类。
Cancer Epidemiol Biomarkers Prev. 2002 Mar;11(3):291-7.
6
Combined use of oligonucleotide and tissue microarrays identifies cancer/testis antigens as biomarkers in lung carcinoma.寡核苷酸与组织微阵列的联合应用将癌胚抗原鉴定为肺癌生物标志物。
Cancer Res. 2002 Jul 15;62(14):3971-9.
7
Analysis of lncRNA expression profiles in non-small cell lung cancers (NSCLC) and their clinical subtypes.非小细胞肺癌(NSCLC)及其临床亚型中lncRNA表达谱分析。
Lung Cancer. 2014 Aug;85(2):110-5. doi: 10.1016/j.lungcan.2014.05.011. Epub 2014 May 22.
8
Epigenetic inactivation of laminin-5-encoding genes in lung cancers.肺癌中层粘连蛋白-5编码基因的表观遗传失活。
Clin Cancer Res. 2003 Jul;9(7):2665-72.
9
Classification of individual lung cancer cell lines based on DNA methylation markers: use of linear discriminant analysis and artificial neural networks.基于DNA甲基化标记的个体肺癌细胞系分类:线性判别分析和人工神经网络的应用
J Mol Diagn. 2004 Feb;6(1):28-36. doi: 10.1016/S1525-1578(10)60488-6.
10
Genome-wide combination profiling of DNA copy number and methylation for deciphering biomarkers in non-small cell lung cancer patients.对非小细胞肺癌患者的 DNA 拷贝数和甲基化进行全基因组组合分析,以破译生物标志物。
Cancer Lett. 2011 Dec 1;311(1):29-37. doi: 10.1016/j.canlet.2011.06.021. Epub 2011 Jun 24.

引用本文的文献

1
Machine Learning-Based Ensemble Feature Selection and Nested Cross-Validation for miRNA Biomarker Discovery in Usher Syndrome.基于机器学习的集成特征选择与嵌套交叉验证用于发现Usher综合征中的miRNA生物标志物
Bioengineering (Basel). 2025 May 8;12(5):497. doi: 10.3390/bioengineering12050497.
2
Utilizing Feature Selection Techniques for AI-Driven Tumor Subtype Classification: Enhancing Precision in Cancer Diagnostics.利用特征选择技术进行人工智能驱动的肿瘤亚型分类:提高癌症诊断的精度。
Biomolecules. 2025 Jan 8;15(1):81. doi: 10.3390/biom15010081.
3
Developing an ensemble machine learning study: Insights from a multi-center proof-of-concept study.
开发集成机器学习研究:来自多中心概念验证研究的见解。
PLoS One. 2024 Sep 10;19(9):e0303217. doi: 10.1371/journal.pone.0303217. eCollection 2024.
4
Artificial Intelligence Applications in Oral Cancer and Oral Dysplasia.人工智能在口腔癌和口腔发育异常中的应用。
Tissue Eng Part A. 2024 Oct;30(19-20):640-651. doi: 10.1089/ten.TEA.2024.0096. Epub 2024 Aug 7.
5
Enhancement of Classifier Performance with Adam and RanAdam Hyper-Parameter Tuning for Lung Cancer Detection from Microarray Data-In Pursuit of Precision.通过Adam和RanAdam超参数调优提高从微阵列数据检测肺癌的分类器性能——追求精准度
Bioengineering (Basel). 2024 Mar 26;11(4):314. doi: 10.3390/bioengineering11040314.
6
Diagnostics of Thyroid Cancer Using Machine Learning and Metabolomics.利用机器学习和代谢组学诊断甲状腺癌
Metabolites. 2023 Dec 22;14(1):11. doi: 10.3390/metabo14010011.
7
Establishment and Analysis of an Artificial Neural Network Model for Early Detection of Polycystic Ovary Syndrome Using Machine Learning Techniques.使用机器学习技术建立和分析用于早期检测多囊卵巢综合征的人工神经网络模型
J Inflamm Res. 2023 Nov 29;16:5667-5676. doi: 10.2147/JIR.S438838. eCollection 2023.
8
Construction and analysis of a conjunctive diagnostic model of HNSCC with random forest and artificial neural network.基于随机森林和人工神经网络构建并分析头颈部鳞状细胞癌联合诊断模型。
Sci Rep. 2023 Apr 25;13(1):6736. doi: 10.1038/s41598-023-32620-6.
9
A comparison of feature selection methodologies and learning algorithms in the development of a DNA methylation-based telomere length estimator.在基于 DNA 甲基化的端粒长度估算器的开发中,特征选择方法和学习算法的比较。
BMC Bioinformatics. 2023 May 1;24(1):178. doi: 10.1186/s12859-023-05282-4.
10
Artificial intelligence-based non-small cell lung cancer transcriptome RNA-sequence analysis technology selection guide.基于人工智能的非小细胞肺癌转录组RNA序列分析技术选择指南
Front Bioeng Biotechnol. 2023 Feb 15;11:1081950. doi: 10.3389/fbioe.2023.1081950. eCollection 2023.