• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单纯形结构矩阵分解:软聚类在代谢组学数据中的应用。

Simplex-structured matrix factorisation: application of soft clustering to metabolomic data.

作者信息

Liu Wenxuan, Murphy Thomas Brendan, Brennan Lorraine

机构信息

UCD School of Agriculture and Food Science, Institute of Food and Health, University College Dublin, Belfield, Dublin, D04 V1W8, Ireland.

UCD School of Mathematics and Statistics, University College Dublin, Belfield, Dublin, D04 V1W8, Ireland.

出版信息

Sci Rep. 2025 May 22;15(1):17817. doi: 10.1038/s41598-025-02361-9.

DOI:10.1038/s41598-025-02361-9
PMID:40404736
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12098731/
Abstract

Metabolomics is the measurement of metabolites in biological samples to reveal information on metabolic pathways and phenotypes. Cluster analysis is a popular multivariate technique employed in metabolomics to characterise observations with similar features. Previous work in the field has applied hard clustering approaches to group observations into distinct clusters. This approach can be overly restrictive in some practical applications. Therefore, there is a growing need for soft clustering methods that allow for the clustering of observations into more than one cluster. Simplex-structured matrix factorisation (SSMF) is proposed and applied in a simulation study and to a metabolomic dataset to demonstrate its utility for soft clustering. In the simulation study, the cluster prototypes and cluster memberships were well estimated. In the real data application to metabolomic data, the presence of four soft clusters was suggested by the gap statistic. Furthermore, the Shannon diversity index indicated that several observations have memberships in three clusters. Additionally, the introduction of the covariates sex, age and BMI revealed that sex and age mainly associated with the cluster memberships. The results indicate that a majority of men and young people were in the cluster predominantly characterised by high levels of amino acids and low levels of phosphatidylcholines and sphingomyelins. However, a high proportion of older people were characterised by low levels of amino acids, biogenic amines, acylcarnitines and lysophosphatidylcholines. The SSMF presented successfully estimates a soft clustering of the metabolomic data. It provides an interpretable representation of the data structure using the cluster prototypes combined with cluster memberships. A software package called MetabolSSMF has been developed, which is freely available as an R package, to facilitate the implementation of soft clustering in the field of metabolomics.

摘要

代谢组学是对生物样品中的代谢物进行测量,以揭示有关代谢途径和表型的信息。聚类分析是代谢组学中常用的一种多变量技术,用于表征具有相似特征的观察结果。该领域以前的工作采用硬聚类方法将观察结果分组为不同的簇。这种方法在某些实际应用中可能过于严格。因此,越来越需要软聚类方法,该方法允许将观察结果聚类到多个簇中。提出了单纯形结构矩阵分解(SSMF),并将其应用于模拟研究和代谢组学数据集,以证明其在软聚类中的效用。在模拟研究中,簇原型和簇成员得到了很好的估计。在代谢组学数据的实际应用中,间隙统计表明存在四个软簇。此外,香农多样性指数表明,一些观察结果在三个簇中都有成员资格。此外,引入协变量性别、年龄和体重指数表明,性别和年龄主要与簇成员资格相关。结果表明,大多数男性和年轻人处于主要以高水平氨基酸和低水平磷脂酰胆碱及鞘磷脂为特征的簇中。然而,高比例的老年人的特征是氨基酸、生物胺、酰基肉碱和溶血磷脂酰胆碱水平较低。SSMF成功地对代谢组学数据进行了软聚类估计。它使用簇原型和簇成员资格提供了数据结构的可解释表示。已经开发了一个名为MetabolSSMF的软件包,作为R包免费提供,以促进代谢组学领域软聚类的实施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/baf6c063e246/41598_2025_2361_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/36e64b668581/41598_2025_2361_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/e239c8b677f8/41598_2025_2361_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/1e7176254330/41598_2025_2361_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/e38efeb7e997/41598_2025_2361_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/2bd092b1bf59/41598_2025_2361_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/baf6c063e246/41598_2025_2361_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/36e64b668581/41598_2025_2361_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/e239c8b677f8/41598_2025_2361_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/1e7176254330/41598_2025_2361_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/e38efeb7e997/41598_2025_2361_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/2bd092b1bf59/41598_2025_2361_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d40/12098731/baf6c063e246/41598_2025_2361_Fig6_HTML.jpg

相似文献

1
Simplex-structured matrix factorisation: application of soft clustering to metabolomic data.单纯形结构矩阵分解:软聚类在代谢组学数据中的应用。
Sci Rep. 2025 May 22;15(1):17817. doi: 10.1038/s41598-025-02361-9.
2
Metabolic phenotypes and vitamin D response in the critically ill: A metabolomic cohort study.危重症患者的代谢表型和维生素 D 反应:代谢组学队列研究。
Clin Nutr. 2024 Nov;43(11):10-19. doi: 10.1016/j.clnu.2024.09.030. Epub 2024 Sep 18.
3
Prepregnancy Body Mass Index and Lipoprotein Fractions are Associated with Changes in Women's Serum Metabolome from Late Pregnancy to the First Months of Postpartum.妊娠前体重指数和脂蛋白亚组分与女性晚孕期至产后头几个月血清代谢组学变化相关。
J Nutr. 2023 Jan;153(1):56-65. doi: 10.1016/j.tjnut.2022.12.005. Epub 2022 Dec 26.
4
clusterBMA: Bayesian model averaging for clustering.聚类 BMA:用于聚类的贝叶斯模型平均。
PLoS One. 2023 Aug 21;18(8):e0288000. doi: 10.1371/journal.pone.0288000. eCollection 2023.
5
The human plasma-metabolome: Reference values in 800 French healthy volunteers; impact of cholesterol, gender and age.人类血浆代谢组:800名法国健康志愿者的参考值;胆固醇、性别和年龄的影响。
PLoS One. 2017 Mar 9;12(3):e0173615. doi: 10.1371/journal.pone.0173615. eCollection 2017.
6
Deconstructing the pig sex metabolome: Targeted metabolomics in heavy pigs revealed sexual dimorphisms in plasma biomarkers and metabolic pathways.解析猪的性别代谢组:对育肥猪进行靶向代谢组学研究揭示了血浆生物标志物和代谢途径中的性别二态性。
J Anim Sci. 2015 Dec;93(12):5681-93. doi: 10.2527/jas.2015-9528.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Bi-clustering of metabolic data using matrix factorization tools.基于矩阵分解工具的代谢数据双聚类分析。
Methods. 2018 Dec 1;151:12-20. doi: 10.1016/j.ymeth.2018.02.004. Epub 2018 Feb 10.
9
GLIO-Select: Machine Learning-Based Feature Selection and Weighting of Tissue and Serum Proteomic and Metabolomic Data Uncovers Sex Differences in Glioblastoma.GLIO-Select:基于机器学习的胶质母细胞瘤组织和血清蛋白质组学及代谢组学数据特征选择与加权揭示性别差异
Int J Mol Sci. 2025 May 2;26(9):4339. doi: 10.3390/ijms26094339.
10
Reliability of Serum Metabolites over a Two-Year Period: A Targeted Metabolomic Approach in Fasting and Non-Fasting Samples from EPIC.血清代谢物在两年时间内的可靠性:一项针对欧洲癌症与营养前瞻性调查(EPIC)空腹和非空腹样本的靶向代谢组学研究。
PLoS One. 2015 Aug 14;10(8):e0135437. doi: 10.1371/journal.pone.0135437. eCollection 2015.

本文引用的文献

1
Implementing routine collection of EQ-5D-5L in a breast cancer outpatient clinic.在乳腺癌门诊常规收集 EQ-5D-5L。
PLoS One. 2024 Aug 27;19(8):e0307225. doi: 10.1371/journal.pone.0307225. eCollection 2024.
2
Etiologies underlying subtypes of long-standing type 2 diabetes.长期 2 型糖尿病亚型的潜在病因。
PLoS One. 2024 May 28;19(5):e0304036. doi: 10.1371/journal.pone.0304036. eCollection 2024.
3
Assessment of the reliability and quality of breast cancer related videos on TikTok and Bilibili: cross-sectional study in China.
TikTok和哔哩哔哩上乳腺癌相关视频的可靠性和质量评估:中国的横断面研究
Front Public Health. 2024 Jan 22;11:1296386. doi: 10.3389/fpubh.2023.1296386. eCollection 2023.
4
Serum acylcarnitines levels as a potential predictor for gestational diabetes: a systematic review and meta-analysis.血清酰基肉碱水平作为预测妊娠糖尿病的潜在指标:系统评价和荟萃分析。
Front Public Health. 2023 Jul 4;11:1217237. doi: 10.3389/fpubh.2023.1217237. eCollection 2023.
5
Metabolic Disposition and Elimination of Tritum-Labeled Sulfamethoxazole in Pigs, Chickens and Rats.猪、鸡和大鼠体内氚标记磺胺甲恶唑的代谢处置与消除
Metabolites. 2022 Dec 30;13(1):57. doi: 10.3390/metabo13010057.
6
Applications of machine learning in metabolomics: Disease modeling and classification.机器学习在代谢组学中的应用:疾病建模与分类。
Front Genet. 2022 Nov 24;13:1017340. doi: 10.3389/fgene.2022.1017340. eCollection 2022.
7
Identification and Validation of Yak () Frozen-Thawed Sperm Proteins Associated with Capacitation and the Acrosome Reaction.牦牛()冻融精子蛋白的鉴定和验证与获能和顶体反应相关。
J Proteome Res. 2022 Nov 4;21(11):2754-2770. doi: 10.1021/acs.jproteome.2c00528. Epub 2022 Oct 17.
8
Acylcarnitines: Nomenclature, Biomarkers, Therapeutic Potential, Drug Targets, and Clinical Trials.酰基肉碱:命名、生物标志物、治疗潜力、药物靶点和临床试验。
Pharmacol Rev. 2022 Jul;74(3):506-551. doi: 10.1124/pharmrev.121.000408.
9
Guide to Metabolomics Analysis: A Bioinformatics Workflow.代谢组学分析指南:一种生物信息学工作流程。
Metabolites. 2022 Apr 15;12(4):357. doi: 10.3390/metabo12040357.
10
Four groups of type 2 diabetes contribute to the etiological and clinical heterogeneity in newly diagnosed individuals: An IMI DIRECT study.四组 2 型糖尿病导致新诊断个体的病因学和临床异质性:一项 IMI DIRECT 研究。
Cell Rep Med. 2022 Jan 4;3(1):100477. doi: 10.1016/j.xcrm.2021.100477. eCollection 2022 Jan 18.