Suppr
超能文献

基于基因表达数据的肺癌预后聚类方法的比较研究。

A comparative study of clustering methods on gene expression data for lung cancer prognosis.

机构信息

Wake Forest University, Winston-Salem, NC, United States of America.

Markey Cancer Center, University of Kentucky, Lexington, KY, USA.

出版信息

BMC Res Notes. 2023 Nov 8;16(1):319. doi: 10.1186/s13104-023-06604-8.

DOI:10.1186/s13104-023-06604-8

PMID:37941025

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10630994/

Abstract

Lung cancer subtyping based on gene expression data is important for identifying patient subgroups with differing survival prognosis to facilitate customized treatment strategies for each subtype of patients. Unsupervised clustering methods are the traditional approach for clustering patients into subtypes. However, since those methods cluster patients based only on gene expression data, the resulting clusters may not always be relevant to the survival outcome of interest. In recent years, semi-supervised and supervised methods have been proposed, which leverage the survival outcome data to identify clusters more relevant to survival prognosis. This paper aims to compare the performance of different clustering methods for identifying clinically prognostic lung cancer subtypes based on two lung adenocarcinoma datasets. For each method, we clustered patients into two clusters and assessed the difference in patient survival time between clusters. Unsupervised methods were found to have large logrank p-values and no significant results in most cases. Semi-supervised and supervised methods had improved performance over unsupervised methods and very significant p-values. These results indicate that unsupervised methods are not capable of identifying clusters with significant differences in survival prognosis in most cases, while supervised and semi-supervised methods can better cluster patients into clinically useful subtypes.

摘要

基于基因表达数据的肺癌亚型分类对于识别具有不同生存预后的患者亚组很重要，有助于为每个患者亚型制定定制化的治疗策略。无监督聚类方法是聚类患者为亚型的传统方法。然而，由于这些方法仅基于基因表达数据对患者进行聚类，因此得到的聚类结果可能并不总是与感兴趣的生存结果相关。近年来，提出了半监督和监督方法，利用生存结果数据来识别与生存预后更相关的聚类。本文旨在比较基于两个肺腺癌数据集的不同聚类方法在识别具有临床预后意义的肺癌亚型方面的性能。对于每种方法，我们将患者聚类为两个聚类，并评估聚类之间患者生存时间的差异。无监督方法的对数秩 p 值较大，在大多数情况下没有显著结果。半监督和监督方法的性能优于无监督方法，p 值非常显著。这些结果表明，在大多数情况下，无监督方法无法识别生存预后存在显著差异的聚类，而监督和半监督方法可以更好地将患者聚类为具有临床意义的亚型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/624f/10630994/9204065ef254/13104_2023_6604_Fig1_HTML.jpg

相似文献

A comparative study of clustering methods on gene expression data for lung cancer prognosis.

BMC Res Notes. 2023 Nov 8;16(1):319. doi: 10.1186/s13104-023-06604-8.

Evaluation of immunohistochemical markers in non-small cell lung cancer by unsupervised hierarchical clustering analysis: a tissue microarray study of 284 cases and 18 markers.

J Pathol. 2004 Sep;204(1):101-9. doi: 10.1002/path.1612.

Supervised Graph Clustering for Cancer Subtyping Based on Survival Analysis and Integration of Multi-Omic Tumor Data.

IEEE/ACM Trans Comput Biol Bioinform. 2022 Mar-Apr;19(2):1193-1202. doi: 10.1109/TCBB.2020.3010509. Epub 2022 Apr 1.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Subtyping of children with developmental dyslexia via bootstrap aggregated clustering and the gap statistic: comparison with the double-deficit hypothesis.

Int J Lang Commun Disord. 2007 Jan-Feb;42(1):77-95. doi: 10.1080/13682820600806680.

Prognostic stratification of stage IIIA pN2 non-small cell lung cancer by hierarchical clustering analysis of tissue microarray immunostaining data: an Alpe Adria Thoracic Oncology Multidisciplinary Group study (ATOM 014).

J Thorac Oncol. 2010 Sep;5(9):1354-60. doi: 10.1097/JTO.0b013e3181e77a78.

Pathway-based deep clustering for molecular subtyping of cancer.

Methods. 2020 Feb 15;173:24-31. doi: 10.1016/j.ymeth.2019.06.017. Epub 2019 Jun 25.

A MicroRNA cluster at 14q32 drives aggressive lung adenocarcinoma.

Clin Cancer Res. 2014 Jun 15;20(12):3107-17. doi: 10.1158/1078-0432.CCR-13-3348. Epub 2014 May 15.

Casein kinase II alpha subunit and C1-inhibitor are independent predictors of outcome in patients with squamous cell carcinoma of the lung.

Clin Cancer Res. 2004 Sep 1;10(17):5792-803. doi: 10.1158/1078-0432.CCR-03-0317.

Identifying and evaluating clinical subtypes of Alzheimer's disease in care electronic health records using unsupervised machine learning.

BMC Med Inform Decis Mak. 2021 Dec 8;21(1):343. doi: 10.1186/s12911-021-01693-6.

引用本文的文献

Identifying and Diagnosing Lytic Cell Death Genes in Atherosclerosis Using Machine Learning and Bioinformatics.

J Inflamm Res. 2025 Jul 23;18:9767-9793. doi: 10.2147/JIR.S520039. eCollection 2025.

Survival guided adaptive clustering enhances mortality risk stratification and radiotherapy guidance in early stage uterine sarcoma.

Sci Rep. 2025 Jul 25;15(1):27055. doi: 10.1038/s41598-025-13139-4.

本文引用的文献

Consensus clustering methodology to improve molecular stratification of non-small cell lung cancer.

Sci Rep. 2023 May 12;13(1):7759. doi: 10.1038/s41598-023-33954-x.

Molecular subtyping in colorectal cancer: A bridge to personalized therapy (Review).

Oncol Lett. 2023 Apr 18;25(6):230. doi: 10.3892/ol.2023.13816. eCollection 2023 Jun.

Supervised clustering of high-dimensional data using regularized mixture modeling.

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa291.

Pan-cancer identification of clinically relevant genomic subtypes using outcome-weighted integrative clustering.

Genome Med. 2020 Dec 3;12(1):110. doi: 10.1186/s13073-020-00804-8.

Molecular subtyping of cancer: current status and moving toward clinical applications.

Brief Bioinform. 2019 Mar 25;20(2):572-584. doi: 10.1093/bib/bby026.

Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer.

Cell. 2018 Apr 5;173(2):291-304.e6. doi: 10.1016/j.cell.2018.03.022.

Lung Cancer: Understanding Its Molecular Pathology and the 2015 WHO Classification.

Front Oncol. 2017 Aug 28;7:193. doi: 10.3389/fonc.2017.00193. eCollection 2017.

Integrated genomic characterization of oesophageal carcinoma.

Nature. 2017 Jan 12;541(7636):169-175. doi: 10.1038/nature20805. Epub 2017 Jan 4.

Molecular classification of gastric cancer.

Ann Oncol. 2016 May;27(5):763-9. doi: 10.1093/annonc/mdw040. Epub 2016 Feb 9.

Comprehensive molecular profiling of lung adenocarcinoma.

Nature. 2014 Jul 31;511(7511):543-50. doi: 10.1038/nature13385. Epub 2014 Jul 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

基于基因表达数据的肺癌预后聚类方法的比较研究。

A comparative study of clustering methods on gene expression data for lung cancer prognosis.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译