• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于机器学习的无监督聚类分析对原发性乳腺癌患者进行表型映射。

Phenomapping of Patients with Primary Breast Cancer Using Machine Learning-Based Unsupervised Cluster Analysis.

作者信息

Ferro Sara, Bottigliengo Daniele, Gregori Dario, Fabricio Aline S C, Gion Massimo, Baldi Ileana

机构信息

Unit of Biostatistics, Epidemiology and Public Health, Department of Cardiac Thoracic Vascular Sciences and Public Health, University of Padova, Via Loredan 18, 35121 Padova, Italy.

Veneto Institute of Oncology IOV-IRCCS, 35128 Padua, Italy.

出版信息

J Pers Med. 2021 Apr 5;11(4):272. doi: 10.3390/jpm11040272.

DOI:10.3390/jpm11040272
PMID:33916398
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8067194/
Abstract

Primary breast cancer (PBC) is a heterogeneous disease at the clinical, histopathological, and molecular levels. The improved classification of PBC might be important to identify subgroups of the disease, relevant to patient management. Machine learning algorithms may allow a better understanding of the relationships within heterogeneous clinical syndromes. This work aims to show the potential of unsupervised learning techniques for improving classification in PBC. A dataset of 712 women with PBC is used as a motivating example. A set of variables containing biological prognostic parameters is considered to define groups of individuals. Four different clustering methods are used: K-means, self-organising maps, hierarchical agglomerative (HAC), and Gaussian mixture models clustering. HAC outperforms the other clustering methods. With an optimal partitioning parameter, the methods identify two clusters with different clinical profiles. Patients in the first cluster are younger and have lower values of the oestrogen receptor (ER) and progesterone receptor (PgR) than patients in the second cluster. Moreover, cathepsin D values are lower in the first cluster. The three most important variables identified by the HAC are: age, ER, and PgR. Unsupervised learning seems a suitable alternative for the analysis of PBC data, opening up new perspectives in the particularly active domain of dissecting clinical heterogeneity.

摘要

原发性乳腺癌(PBC)在临床、组织病理学和分子水平上是一种异质性疾病。PBC分类的改进对于识别该疾病的亚组可能很重要,这与患者管理相关。机器学习算法可能有助于更好地理解异质性临床综合征之间的关系。这项工作旨在展示无监督学习技术在改善PBC分类方面的潜力。以712名PBC女性患者的数据集作为一个激励性示例。考虑一组包含生物学预后参数的变量来定义个体组。使用了四种不同的聚类方法:K均值、自组织映射、层次凝聚(HAC)和高斯混合模型聚类。HAC优于其他聚类方法。通过一个最优划分参数,这些方法识别出两个具有不同临床特征的聚类。第一聚类中的患者比第二聚类中的患者更年轻,雌激素受体(ER)和孕激素受体(PgR)的值更低。此外,第一聚类中的组织蛋白酶D值更低。HAC识别出的三个最重要变量是:年龄、ER和PgR。无监督学习似乎是分析PBC数据的一种合适替代方法,在剖析临床异质性这个特别活跃的领域开辟了新的视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0284/8067194/58e9cb46994a/jpm-11-00272-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0284/8067194/348730c13017/jpm-11-00272-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0284/8067194/58e9cb46994a/jpm-11-00272-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0284/8067194/348730c13017/jpm-11-00272-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0284/8067194/58e9cb46994a/jpm-11-00272-g002.jpg

相似文献

1
Phenomapping of Patients with Primary Breast Cancer Using Machine Learning-Based Unsupervised Cluster Analysis.使用基于机器学习的无监督聚类分析对原发性乳腺癌患者进行表型映射。
J Pers Med. 2021 Apr 5;11(4):272. doi: 10.3390/jpm11040272.
2
Sheep's coping style can be identified by unsupervised machine learning from unlabeled data.通过对无标签数据进行无监督机器学习,可以识别出绵羊的应对方式。
Behav Processes. 2022 Jan;194:104559. doi: 10.1016/j.beproc.2021.104559. Epub 2021 Nov 25.
3
Clinical fracture risk evaluated by hierarchical agglomerative clustering.通过层次凝聚聚类评估临床骨折风险。
Osteoporos Int. 2017 Mar;28(3):819-832. doi: 10.1007/s00198-016-3828-8. Epub 2016 Nov 16.
4
Comparison of machine learning clustering algorithms for detecting heterogeneity of treatment effect in acute respiratory distress syndrome: A secondary analysis of three randomised controlled trials.机器学习聚类算法在急性呼吸窘迫综合征治疗效果异质性检测中的比较:三项随机对照试验的二次分析。
EBioMedicine. 2021 Dec;74:103697. doi: 10.1016/j.ebiom.2021.103697. Epub 2021 Dec 1.
5
Comparison of Unsupervised Machine Learning Approaches for Cluster Analysis to Define Subgroups of Heart Failure with Preserved Ejection Fraction with Different Outcomes.用于聚类分析以定义射血分数保留的心力衰竭不同结局亚组的无监督机器学习方法比较
Bioengineering (Basel). 2022 Apr 16;9(4):175. doi: 10.3390/bioengineering9040175.
6
Identification and analysis of behavioral phenotypes in autism spectrum disorder via unsupervised machine learning.通过无监督机器学习识别和分析自闭症谱系障碍的行为表型。
Int J Med Inform. 2019 Sep;129:29-36. doi: 10.1016/j.ijmedinf.2019.05.006. Epub 2019 May 12.
7
Comparison of unsupervised machine-learning methods to identify metabolomic signatures in patients with localized breast cancer.用于识别局部乳腺癌患者代谢组学特征的无监督机器学习方法比较
Comput Struct Biotechnol J. 2020 Jun 3;18:1509-1524. doi: 10.1016/j.csbj.2020.05.021. eCollection 2020.
8
Molecular descriptor analysis of approved drugs using unsupervised learning for drug repurposing.使用无监督学习对已批准药物进行分子描述符分析,以实现药物再利用。
Comput Biol Med. 2021 Nov;138:104856. doi: 10.1016/j.compbiomed.2021.104856. Epub 2021 Sep 10.
9
Distributed dual vigilance fuzzy adaptive resonance theory learns online, retrieves arbitrarily-shaped clusters, and mitigates order dependence.分布式双警戒模糊自适应共振理论在线学习,检索任意形状的聚类,并减轻顺序相关性。
Neural Netw. 2020 Jan;121:208-228. doi: 10.1016/j.neunet.2019.08.033. Epub 2019 Sep 9.
10
Hormone receptor status of a contralateral breast cancer is independent of the receptor status of the first primary in patients not receiving adjuvant tamoxifen.在未接受辅助性他莫昔芬治疗的患者中,对侧乳腺癌的激素受体状态独立于首个原发性肿瘤的受体状态。
J Clin Oncol. 2005 Jul 20;23(21):4687-94. doi: 10.1200/JCO.2005.04.076. Epub 2005 Apr 18.

引用本文的文献

1
Machine Learning Methods for Small Data Challenges in Molecular Science.机器学习方法在分子科学中小数据挑战中的应用。
Chem Rev. 2023 Jul 12;123(13):8736-8780. doi: 10.1021/acs.chemrev.3c00189. Epub 2023 Jun 29.
2
Cyclin A2 and Ki-67 proliferation markers could be used to identify tumors with poor prognosis in African American women with breast cancer.细胞周期蛋白A2和Ki-67增殖标志物可用于识别患有乳腺癌的非裔美国女性中预后不良的肿瘤。
J Cancer Biol. 2023;4(1):3-16. doi: 10.46439/cancerbiology.4.048.
3
The Circular RNA Circ_0085494 Regulates Prostate Cancer Progression Through NRBP1/miR-497-5p Axis.

本文引用的文献

1
Comparison of unsupervised machine-learning methods to identify metabolomic signatures in patients with localized breast cancer.用于识别局部乳腺癌患者代谢组学特征的无监督机器学习方法比较
Comput Struct Biotechnol J. 2020 Jun 3;18:1509-1524. doi: 10.1016/j.csbj.2020.05.021. eCollection 2020.
2
Machine-learned identification of psychological subgroups with relation to pain interference in patients after breast cancer treatments.机器学习识别乳腺癌治疗后患者疼痛干扰相关的心理亚组。
Breast. 2020 Apr;50:71-80. doi: 10.1016/j.breast.2020.01.042. Epub 2020 Feb 7.
3
Gaussian Mixture Models for Probabilistic Classification of Breast Cancer.
环状 RNA Circ_0085494 通过 NRBP1/miR-497-5p 轴调控前列腺癌进展。
Biochem Genet. 2023 Oct;61(5):1775-1790. doi: 10.1007/s10528-023-10341-9. Epub 2023 Feb 15.
4
Vitamin D Deficiency in Women with Breast Cancer: A Correlation with Osteoporosis? A Machine Learning Approach with Multiple Factor Analysis.维生素 D 缺乏与乳腺癌妇女:与骨质疏松症的相关性?基于多因素分析的机器学习方法。
Nutrients. 2022 Apr 11;14(8):1586. doi: 10.3390/nu14081586.
高斯混合模型在乳腺癌概率分类中的应用。
Cancer Res. 2019 Jul 1;79(13):3492-3502. doi: 10.1158/0008-5472.CAN-19-0573. Epub 2019 May 21.
4
Machine Learning With K-Means Dimensional Reduction for Predicting Survival Outcomes in Patients With Breast Cancer.采用K均值降维的机器学习方法预测乳腺癌患者的生存结局
Cancer Inform. 2018 Nov 9;17:1176935118810215. doi: 10.1177/1176935118810215. eCollection 2018.
5
Hierarchical clustering of activated proteins in the PI3K and MAPK pathways in ER-positive, HER2-negative breast cancer with potential therapeutic consequences.PI3K 和 MAPK 通路中激活蛋白在 ER 阳性、HER2 阴性乳腺癌中的层次聚类,具有潜在的治疗意义。
Br J Cancer. 2018 Oct;119(7):832-839. doi: 10.1038/s41416-018-0221-8. Epub 2018 Oct 5.
6
Artificial Intelligence in Cardiology.人工智能在心脏病学中的应用。
J Am Coll Cardiol. 2018 Jun 12;71(23):2668-2679. doi: 10.1016/j.jacc.2018.03.521.
7
Patient similarity for precision medicine: A systematic review.精准医学中的患者相似性:系统评价。
J Biomed Inform. 2018 Jul;83:87-96. doi: 10.1016/j.jbi.2018.06.001. Epub 2018 Jun 1.
8
optCluster: An R Package for Determining the Optimal Clustering Algorithm.optCluster:一个用于确定最优聚类算法的R软件包。
Bioinformation. 2017 Mar 31;13(3):101-103. doi: 10.6026/97320630013101. eCollection 2017.
9
Integrative clustering reveals a novel split in the luminal A subtype of breast cancer with impact on outcome.综合聚类分析揭示了乳腺癌腔面A型亚型的一种新分类,这对预后有影响。
Breast Cancer Res. 2017 Mar 29;19(1):44. doi: 10.1186/s13058-017-0812-y.
10
mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models.mclust 5:使用高斯有限混合模型进行聚类、分类和密度估计
R J. 2016 Aug;8(1):289-317.