• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用核主成分分析和计算机器学习探索与饮食强烈相关的代谢物。

Application of kernel principal component analysis and computational machine learning to exploration of metabolites strongly associated with diet.

机构信息

RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 235-0045, Japan.

Graduate School of Medical Life Science, Yokohama City University, 1-7-29 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045, Japan.

出版信息

Sci Rep. 2018 Feb 21;8(1):3426. doi: 10.1038/s41598-018-20121-w.

DOI:10.1038/s41598-018-20121-w
PMID:29467421
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5821832/
Abstract

Computer-based technological innovation provides advancements in sophisticated and diverse analytical instruments, enabling massive amounts of data collection with relative ease. This is accompanied by a fast-growing demand for technological progress in data mining methods for analysis of big data derived from chemical and biological systems. From this perspective, use of a general "linear" multivariate analysis alone limits interpretations due to "non-linear" variations in metabolic data from living organisms. Here we describe a kernel principal component analysis (KPCA)-incorporated analytical approach for extracting useful information from metabolic profiling data. To overcome the limitation of important variable (metabolite) determinations, we incorporated a random forest conditional variable importance measure into our KPCA-based analytical approach to demonstrate the relative importance of metabolites. Using a market basket analysis, hippurate, the most important variable detected in the importance measure, was associated with high levels of some vitamins and minerals present in foods eaten the previous day, suggesting a relationship between increased hippurate and intake of a wide variety of vegetables and fruits. Therefore, the KPCA-incorporated analytical approach described herein enabled us to capture input-output responses, and should be useful not only for metabolic profiling but also for profiling in other areas of biological and environmental systems.

摘要

基于计算机的技术创新为复杂多样的分析仪器提供了进步,使得大量数据的收集变得相对容易。伴随着对数据分析方法的技术进步的需求也在快速增长,以便对来自化学和生物系统的大数据进行分析。从这个角度来看,仅使用一般的“线性”多元分析由于来自生物体的代谢数据的“非线性”变化而限制了解释。在这里,我们描述了一种基于核主成分分析(KPCA)的分析方法,用于从代谢轮廓数据中提取有用信息。为了克服重要变量(代谢物)测定的限制,我们将随机森林条件变量重要性度量纳入基于 KPCA 的分析方法中,以证明代谢物的相对重要性。使用市场篮子分析,我们发现检测到的重要变量中最重要的是 hippurate,与前一天食用的食物中某些维生素和矿物质的高水平有关,这表明 hippurate 的增加与各种蔬菜和水果的摄入之间存在关系。因此,本文描述的基于 KPCA 的分析方法使我们能够捕捉输入-输出响应,不仅对代谢组学而且对生物和环境系统的其他领域的分析都应该是有用的。

相似文献

1
Application of kernel principal component analysis and computational machine learning to exploration of metabolites strongly associated with diet.应用核主成分分析和计算机器学习探索与饮食强烈相关的代谢物。
Sci Rep. 2018 Feb 21;8(1):3426. doi: 10.1038/s41598-018-20121-w.
2
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
3
An automated ranking platform for machine learning regression models for meat spoilage prediction using multi-spectral imaging and metabolic profiling.基于多光谱成像和代谢轮廓分析的机器学习回归模型用于肉类腐败预测的自动评分平台。
Food Res Int. 2017 Sep;99(Pt 1):206-215. doi: 10.1016/j.foodres.2017.05.013. Epub 2017 May 20.
4
Application of Market Basket Analysis for the Visualization of Transaction Data Based on Human Lifestyle and Spectroscopic Measurements.基于人类生活方式和光谱测量的交易数据可视化的市场篮子分析应用。
Anal Chem. 2016 Mar 1;88(5):2714-9. doi: 10.1021/acs.analchem.5b04182. Epub 2016 Feb 11.
5
Challenges in applying chemometrics to LC-MS-based global metabolite profile data.将化学计量学应用于基于液相色谱-质谱联用的全局代谢物谱数据时所面临的挑战。
Bioanalysis. 2009 Jul;1(4):805-19. doi: 10.4155/bio.09.64.
6
Metabolomics data exploration guided by prior knowledge.基于先验知识的代谢组学数据探索。
Anal Chim Acta. 2009 Oct 5;651(2):173-81. doi: 10.1016/j.aca.2009.08.029. Epub 2009 Aug 25.
7
Nonocclusive Sweat Collection Combined with Chemical Isotope Labeling LC-MS for Human Sweat Metabolomics and Mapping the Sweat Metabolomes at Different Skin Locations.非闭塞性汗液采集结合化学同位素标记 LC-MS 用于人体汗液代谢组学,并绘制不同皮肤部位的汗液代谢组图谱。
Anal Chem. 2017 Aug 1;89(15):7847-7851. doi: 10.1021/acs.analchem.7b01988. Epub 2017 Jul 19.
8
Informatics for Metabolomics.代谢组学信息学
Adv Exp Med Biol. 2016;939:91-115. doi: 10.1007/978-981-10-1503-8_5.
9
Metabolomic network analysis of estrogen-stimulated MCF-7 cells: a comparison of overrepresentation analysis, quantitative enrichment analysis and pathway analysis versus metabolite network analysis.雌激素刺激的MCF-7细胞的代谢组学网络分析:过表达分析、定量富集分析和通路分析与代谢物网络分析的比较
Arch Toxicol. 2017 Jan;91(1):217-230. doi: 10.1007/s00204-016-1695-x. Epub 2016 Apr 2.
10
Application of a Deep Neural Network to Metabolomics Studies and Its Performance in Determining Important Variables.深度学习网络在代谢组学研究中的应用及其在确定重要变量方面的性能。
Anal Chem. 2018 Feb 6;90(3):1805-1810. doi: 10.1021/acs.analchem.7b03795. Epub 2018 Jan 17.

引用本文的文献

1
Improving genomic prediction accuracy for methane emission and feed efficiency in sheep: integrating rumen microbial PCA with host genomic variation using neural network GBLUP (NN-GBLUP).提高绵羊甲烷排放和饲料效率的基因组预测准确性:使用神经网络GBLUP(NN-GBLUP)将瘤胃微生物主成分分析与宿主基因组变异相结合。
Genet Sel Evol. 2025 Jul 17;57(1):41. doi: 10.1186/s12711-025-00987-x.
2
Leveraging ML for profiling lipidomic alterations in breast cancer tissues: a methodological perspective.利用机器学习分析乳腺癌组织中的脂质组学改变:一种方法学视角。
Sci Rep. 2024 Oct 28;14(1):25825. doi: 10.1038/s41598-024-71439-7.
3
Exploring metabolic anomalies in COVID-19 and post-COVID-19: a machine learning approach with explainable artificial intelligence.

本文引用的文献

1
Potential applications of ferulic acid from natural sources.天然来源阿魏酸的潜在应用。
Biotechnol Rep (Amst). 2014 Sep 16;4:86-93. doi: 10.1016/j.btre.2014.09.002. eCollection 2014 Dec.
2
Application of Market Basket Analysis for the Visualization of Transaction Data Based on Human Lifestyle and Spectroscopic Measurements.基于人类生活方式和光谱测量的交易数据可视化的市场篮子分析应用。
Anal Chem. 2016 Mar 1;88(5):2714-9. doi: 10.1021/acs.analchem.5b04182. Epub 2016 Feb 11.
3
Fragment Assembly Approach Based on Graph/Network Theory with Quantum Chemistry Verifications for Assigning Multidimensional NMR Signals in Metabolite Mixtures.
探索新冠病毒感染期及感染后代谢异常:一种基于可解释人工智能的机器学习方法
Front Mol Biosci. 2024 Sep 9;11:1429281. doi: 10.3389/fmolb.2024.1429281. eCollection 2024.
4
Screening for obstructive sleep apnea in patients with cancer - a machine learning approach.癌症患者阻塞性睡眠呼吸暂停的筛查——一种机器学习方法。
Sleep Adv. 2023 Oct 31;4(1):zpad042. doi: 10.1093/sleepadvances/zpad042. eCollection 2023.
5
Primary Metabolite Screening Shows Significant Differences between Embryogenic and Non-Embryogenic Callus of Tamarillo ( Cav.).初级代谢物筛选显示番茄树(Cav.)胚性愈伤组织和非胚性愈伤组织之间存在显著差异。
Plants (Basel). 2023 Aug 4;12(15):2869. doi: 10.3390/plants12152869.
6
An agroecological structure model of compost-soil-plant interactions for sustainable organic farming.一种用于可持续有机农业的堆肥-土壤-植物相互作用的农业生态结构模型。
ISME Commun. 2023 Mar 31;3(1):28. doi: 10.1038/s43705-023-00233-9.
7
The exposome paradigm to predict environmental health in terms of systemic homeostasis and resource balance based on NMR data science.基于核磁共振数据科学,从系统稳态和资源平衡角度预测环境卫生的暴露组范式。
RSC Adv. 2021 Sep 13;11(48):30426-30447. doi: 10.1039/d1ra03008f. eCollection 2021 Sep 6.
8
Integrative measurement analysis via machine learning descriptor selection for investigating physical properties of biopolymers in hairs.通过机器学习描述符选择进行综合测量分析,以研究毛发中生物聚合物的物理性质。
Sci Rep. 2021 Dec 21;11(1):24359. doi: 10.1038/s41598-021-03793-9.
9
Nutritional Metabolomics and the Classification of Dietary Biomarker Candidates: A Critical Review.营养代谢组学与膳食生物标志物候选物分类:批判性评价。
Adv Nutr. 2021 Dec 1;12(6):2333-2357. doi: 10.1093/advances/nmab054.
10
Relaxometric learning: a pattern recognition method for T relaxation curves based on machine learning supported by an analytical framework.弛豫测量学习:一种基于解析框架支持的机器学习的T1弛豫曲线模式识别方法。 (注:原文中未明确是T1还是T2等弛豫曲线,这里假设为T1,可根据实际情况调整)
BMC Chem. 2021 Feb 20;15(1):13. doi: 10.1186/s13065-020-00731-0.
基于图/网络理论并经量子化学验证的片段组装方法用于代谢物混合物中多维核磁共振信号的归属
ACS Chem Biol. 2016 Apr 15;11(4):1030-8. doi: 10.1021/acschembio.5b00894. Epub 2016 Jan 29.
4
SENSI: signal enhancement by spectral integration for the analysis of metabolic mixtures.
Chem Commun (Camb). 2016 Feb 18;52(14):2964-7. doi: 10.1039/c5cc09442a.
5
SpinCouple: Development of a Web Tool for Analyzing Metabolite Mixtures via Two-Dimensional J-Resolved NMR Database.自旋耦合:一种通过二维J分辨核磁共振数据库分析代谢物混合物的网络工具的开发。
Anal Chem. 2016 Jan 5;88(1):659-65. doi: 10.1021/acs.analchem.5b02311. Epub 2015 Dec 16.
6
Metabolic dynamics analysis by massive data integration: application to tsunami-affected field soils in Japan.通过海量数据整合进行代谢动力学分析:在日本受海啸影响的田间土壤中的应用。
ACS Chem Biol. 2015 Aug 21;10(8):1908-15. doi: 10.1021/cb500609p. Epub 2015 Jun 3.
7
MetaboAnalyst 3.0--making metabolomics more meaningful.MetaboAnalyst 3.0——让代谢组学更具意义。
Nucleic Acids Res. 2015 Jul 1;43(W1):W251-7. doi: 10.1093/nar/gkv380. Epub 2015 Apr 20.
8
Pretreatment and integrated analysis of spectral data reveal seaweed similarities based on chemical diversity.预处理和光谱数据的综合分析揭示了基于化学多样性的海藻相似性。
Anal Chem. 2015 Mar 3;87(5):2819-26. doi: 10.1021/ac504211n. Epub 2015 Feb 18.
9
Human metabolic, mineral, and microbiota fluctuations across daily nutritional intake visualized by a data-driven approach.通过数据驱动方法可视化人类每日营养摄入过程中的代谢、矿物质及微生物群波动情况。
J Proteome Res. 2015 Mar 6;14(3):1526-34. doi: 10.1021/pr501194k. Epub 2015 Feb 10.
10
Comparative metabolomic and ionomic approach for abundant fishes in estuarine environments of Japan.日本河口环境中常见鱼类的比较代谢组学和离子组学方法
Sci Rep. 2014 Nov 12;4:7005. doi: 10.1038/srep07005.