使用主成分分析和聚类分析乳腺癌进展情况。

Analysis of breast cancer progression using principal component analysis and clustering.

作者信息

Alexe G, Dalgin G S, Ganesan S, Delisi C, Bhanot G

机构信息

The Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA.

出版信息

J Biosci. 2007 Aug;32(5):1027-39. doi: 10.1007/s12038-007-0102-4.

DOI:10.1007/s12038-007-0102-4

PMID:17914245

Abstract

We develop a new technique to analyse microarray data which uses a combination of principal components analysis and consensus ensemble k-clustering to find robust clusters and gene markers in the data. We apply our method to a public microarray breast cancer dataset which has expression levels of genes in normal samples as well as in three pathological stages of disease; namely, atypical ductal hyperplasia or ADH, ductal carcinoma in situ or DCIS and invasive ductal carcinoma or IDC. Our method averages over clustering techniques and data perturbation to find stable, robust clusters and gene markers. We identify the clusters and their pathways with distinct subtypes of breast cancer (Luminal,Basal and Her2+). We confirm that the cancer phenotype develops early (in early hyperplasia or ADH stage) and find from our analysis that each subtype progresses from ADH to DCIS to IDC along its own specific pathway, as if each was a distinct disease.

摘要

我们开发了一种新技术来分析微阵列数据，该技术结合了主成分分析和一致性集成k聚类，以在数据中找到稳健的聚类和基因标记。我们将我们的方法应用于一个公开的微阵列乳腺癌数据集，该数据集具有正常样本以及疾病三个病理阶段（即非典型导管增生或ADH、导管原位癌或DCIS以及浸润性导管癌或IDC）中基因的表达水平。我们的方法对聚类技术和数据扰动进行平均，以找到稳定、稳健的聚类和基因标记。我们用乳腺癌的不同亚型（管腔型、基底型和Her2+）来识别聚类及其通路。我们证实癌症表型在早期（早期增生或ADH阶段）就已出现，并且从我们的分析中发现，每个亚型都沿着其自身特定的途径从ADH发展到DCIS再到IDC，就好像每种都是一种独特的疾病。

相似文献

Analysis of breast cancer progression using principal component analysis and clustering.

J Biosci. 2007 Aug;32(5):1027-39. doi: 10.1007/s12038-007-0102-4.

Portraits of breast cancer progression.

BMC Bioinformatics. 2007 Aug 6;8:291. doi: 10.1186/1471-2105-8-291.

Breast cancer stratification from analysis of micro-array data of micro-dissected specimens.

Genome Inform. 2007;18:130-40.

Progression-specific genes identified by expression profiling of matched ductal carcinomas in situ and invasive breast tumors, combining laser capture microdissection and oligonucleotide microarray analysis.

Cancer Res. 2006 May 15;66(10):5278-86. doi: 10.1158/0008-5472.CAN-05-4610.

Evidence that molecular changes in cells occur before morphological alterations during the progression of breast ductal carcinoma.

Breast Cancer Res. 2008;10(5):R87. doi: 10.1186/bcr2157. Epub 2008 Oct 17.

Comparison of HER2 amplification status among breast cancer subgroups offers new insights in pathways of breast cancer progression.

Virchows Arch. 2017 Nov;471(5):575-587. doi: 10.1007/s00428-017-2161-8. Epub 2017 May 31.

HER2 as a prognostic factor in breast cancer.

Oncology. 2001;61 Suppl 2:67-72. doi: 10.1159/000055404.

Selection and evolution in the genomic landscape of copy number alterations in ductal carcinoma in situ (DCIS) and its progression to invasive carcinoma of ductal/no special type: a meta-analysis.

Breast Cancer Res Treat. 2015 Aug;153(1):101-21. doi: 10.1007/s10549-015-3509-x. Epub 2015 Aug 9.

Gene expression profiling of ductal carcinomas in situ and invasive breast tumors.

Anticancer Res. 2003 May-Jun;23(3A):2043-51.

Refinement of breast cancer classification by molecular characterization of histological special types.

J Pathol. 2008 Oct;216(2):141-50. doi: 10.1002/path.2407.

引用本文的文献

Machine Learning Empowered a Graphical User Interface on Native Fluorescence to Predict Breast Cancer.

ACS Omega. 2025 May 14;10(20):20315-20325. doi: 10.1021/acsomega.4c11669. eCollection 2025 May 27.

Autoencoder-based multimodal prediction of non-small cell lung cancer survival.

Sci Rep. 2023 Sep 22;13(1):15761. doi: 10.1038/s41598-023-42365-x.

Patient subgrouping with distinct survival rates via integration of multiomics data on a Grassmann manifold.

BMC Med Inform Decis Mak. 2022 Jul 23;22(1):190. doi: 10.1186/s12911-022-01938-y.

AI applications in functional genomics.

Comput Struct Biotechnol J. 2021 Oct 11;19:5762-5790. doi: 10.1016/j.csbj.2021.10.009. eCollection 2021.

Deep learning-based ovarian cancer subtypes identification using multi-omics data.

BioData Min. 2020 Aug 24;13:10. doi: 10.1186/s13040-020-00222-x. eCollection 2020.

The Application of Deep Learning in Cancer Prognosis Prediction.

Cancers (Basel). 2020 Mar 5;12(3):603. doi: 10.3390/cancers12030603.

Dysregulated lncRNA-miRNA-mRNA Network Reveals Patient Survival-Associated Modules and RNA Binding Proteins in Invasive Breast Carcinoma.

Front Genet. 2020 Jan 15;10:1284. doi: 10.3389/fgene.2019.01284. eCollection 2019.

Deep Learning-Based Multi-Omics Data Integration Reveals Two Prognostic Subtypes in High-Risk Neuroblastoma.

Front Genet. 2018 Oct 18;9:477. doi: 10.3389/fgene.2018.00477. eCollection 2018.

HER2/ErbB2-induced breast cancer cell migration and invasion require p120 catenin activation of Rac1 and Cdc42.

J Biol Chem. 2010 Sep 17;285(38):29491-501. doi: 10.1074/jbc.M110.136770. Epub 2010 Jul 1.

Presence of an in situ component is associated with reduced biological aggressiveness of size-matched invasive breast cancer.

Br J Cancer. 2010 Apr 27;102(9):1391-6. doi: 10.1038/sj.bjc.6605655.

本文引用的文献

Data perturbation independent diagnosis and validation of breast cancer subtypes using clustering and patterns.

Cancer Inform. 2007 Feb 19;2:243-74.

Survivin, a novel anti-apoptosis inhibitor, expression in uterine cervical cancer and relationship with prognostic factors.

Int J Gynecol Cancer. 2005 Jan-Feb;15(1):113-9. doi: 10.1111/j.1048-891X.2005.15011.x.

Survivin, Survivin-2B, and Survivin-deItaEx3 expression in medulloblastoma: biologic markers of tumour morphology and clinical outcome.

Br J Cancer. 2005 Jan 31;92(2):359-65. doi: 10.1038/sj.bjc.6602317.

A gene network for navigating the literature.

Nat Genet. 2004 Jul;36(7):664. doi: 10.1038/ng0704-664.

Repeated observation of breast tumor subtypes in independent gene expression data sets.

Proc Natl Acad Sci U S A. 2003 Jul 8;100(14):8418-23. doi: 10.1073/pnas.0932692100. Epub 2003 Jun 26.

DAVID: Database for Annotation, Visualization, and Integrated Discovery.

Genome Biol. 2003;4(5):P3. Epub 2003 Apr 3.

Gene expression profiles of human breast cancer progression.

Proc Natl Acad Sci U S A. 2003 May 13;100(10):5974-9. doi: 10.1073/pnas.0931261100. Epub 2003 Apr 24.

MatchMiner: a tool for batch navigation among gene and gene product identifiers.

Genome Biol. 2003;4(4):R27. doi: 10.1186/gb-2003-4-4-r27. Epub 2003 Mar 25.

Molecular portraits of human breast tumours.

Nature. 2000 Aug 17;406(6797):747-52. doi: 10.1038/35021093.

The hallmarks of cancer.

Cell. 2000 Jan 7;100(1):57-70. doi: 10.1016/s0092-8674(00)81683-9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用主成分分析和聚类分析乳腺癌进展情况。

Analysis of breast cancer progression using principal component analysis and clustering.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献