• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Predicting interpretability of metabolome models based on behavior, putative identity, and biological relevance of explanatory signals.基于解释性信号的行为、假定身份和生物学相关性预测代谢组学模型的可解释性。
Proc Natl Acad Sci U S A. 2006 Oct 3;103(40):14865-70. doi: 10.1073/pnas.0605152103. Epub 2006 Sep 21.
2
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
3
mzGroupAnalyzer--predicting pathways and novel chemical structures from untargeted high-throughput metabolomics data.mzGroupAnalyzer——从非靶向高通量代谢组学数据预测代谢途径和新型化学结构。
PLoS One. 2014 May 20;9(5):e96188. doi: 10.1371/journal.pone.0096188. eCollection 2014.
4
Representation, comparison, and interpretation of metabolome fingerprint data for total composition analysis and quality trait investigation in potato cultivars.马铃薯品种总成分分析和品质性状研究中代谢组指纹数据的表征、比较与解读
J Agric Food Chem. 2007 May 2;55(9):3444-51. doi: 10.1021/jf0701842. Epub 2007 Apr 7.
5
PAIRUP-MS: Pathway analysis and imputation to relate unknowns in profiles from mass spectrometry-based metabolite data.PAIRUP-MS:基于质谱的代谢物数据谱中未知物的途径分析和推断。
PLoS Comput Biol. 2019 Jan 14;15(1):e1006734. doi: 10.1371/journal.pcbi.1006734. eCollection 2019 Jan.
6
Genetic variation in the nuclear and organellar genomes modulates stochastic variation in the metabolome, growth, and defense.核基因组和细胞器基因组中的遗传变异调节了代谢组、生长和防御中的随机变异。
PLoS Genet. 2015 Jan 8;11(1):e1004779. doi: 10.1371/journal.pgen.1004779. eCollection 2015 Jan.
7
Application of metabolomics to plant genotype discrimination using statistics and machine learning.代谢组学在利用统计学和机器学习进行植物基因型鉴别中的应用。
Bioinformatics. 2002;18 Suppl 2:S241-8. doi: 10.1093/bioinformatics/18.suppl_2.s241.
8
Metabolomic correlation-network modules in Arabidopsis based on a graph-clustering approach.基于图聚类方法的拟南芥代谢组学相关网络模块
BMC Syst Biol. 2011 Jan 1;5:1. doi: 10.1186/1752-0509-5-1.
9
Enhancement of plant metabolite fingerprinting by machine learning.机器学习增强植物代谢产物指纹图谱分析。
Plant Physiol. 2010 Aug;153(4):1506-20. doi: 10.1104/pp.109.150524. Epub 2010 Jun 21.
10
Opening the Random Forest Black Box of the Metabolome by the Application of Surrogate Minimal Depth.通过应用替代最小深度打开代谢组学的随机森林黑箱
Metabolites. 2021 Dec 21;12(1):5. doi: 10.3390/metabo12010005.

引用本文的文献

1
A flow-injection mass spectrometry fingerprinting scaffold for feature selection and quantitation of Cordyceps and Ganoderma extracts in beverage: a predictive artificial neural network modelling strategy.一种用于特征选择和定量分析虫草和灵芝提取物的流动注射质谱指纹图谱支架:一种预测性人工神经网络建模策略。
AMB Express. 2012 Aug 13;2(1):43. doi: 10.1186/2191-0855-2-43.
2
Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis.用于基于质谱的代谢组学数据处理与分析的生物信息学工具
Curr Bioinform. 2012 Mar;7(1):96-108. doi: 10.2174/157489312799304431.
3
Profiling the human response to physical exercise: a computational strategy for the identification and kinetic analysis of metabolic biomarkers.剖析人类对体育锻炼的反应:一种用于识别和动力学分析代谢生物标志物的计算策略。
J Clin Bioinforma. 2011 Dec 19;1(1):34. doi: 10.1186/2043-9113-1-34.
4
Bioinformatic-driven search for metabolic biomarkers in disease.基于生物信息学的疾病代谢生物标志物搜索
J Clin Bioinforma. 2011 Jan 20;1(1):2. doi: 10.1186/2043-9113-1-2.
5
Enhancement of plant metabolite fingerprinting by machine learning.机器学习增强植物代谢产物指纹图谱分析。
Plant Physiol. 2010 Aug;153(4):1506-20. doi: 10.1104/pp.109.150524. Epub 2010 Jun 21.
6
The role of mass spectrometry-based metabolomics in medical countermeasures against radiation.基于质谱的代谢组学在辐射医学对策中的作用。
Mass Spectrom Rev. 2010 May-Jun;29(3):503-21. doi: 10.1002/mas.20272.
7
Convergent Random Forest predictor: methodology for predicting drug response from genome-scale data applied to anti-TNF response.汇聚随机森林预测器:从基因组规模数据预测药物反应的方法,应用于抗 TNF 反应。
Genomics. 2009 Dec;94(6):423-32. doi: 10.1016/j.ygeno.2009.08.008. Epub 2009 Aug 20.
8
Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'.使用MZedDB在精确质量代谢组学数据中进行代谢物信号识别,MZedDB是一种利用预测电离行为“规则”的交互式质荷比注释工具。
BMC Bioinformatics. 2009 Jul 21;10:227. doi: 10.1186/1471-2105-10-227.
9
Human urinary metabolomic profile of PPARalpha induced fatty acid beta-oxidation.过氧化物酶体增殖物激活受体α诱导脂肪酸β氧化的人尿代谢组学特征
J Proteome Res. 2009 Sep;8(9):4293-300. doi: 10.1021/pr9004103.
10
Prediction of high-responding peptides for targeted protein assays by mass spectrometry.通过质谱法预测用于靶向蛋白质分析的高反应性肽段。
Nat Biotechnol. 2009 Feb;27(2):190-8. doi: 10.1038/nbt.1524. Epub 2009 Jan 25.

本文引用的文献

1
Assessing the statistical significance of the achieved classification error of classifiers constructed using serum peptide profiles, and a prescription for random sampling repeated studies for massive high-throughput genomic and proteomic studies.评估使用血清肽谱构建的分类器所实现的分类误差的统计显著性,以及针对大规模高通量基因组和蛋白质组研究进行随机抽样重复研究的方案。
Cancer Inform. 2005;1(1):53-77.
2
The genetics of plant metabolism.植物代谢遗传学
Nat Genet. 2006 Jul;38(7):842-9. doi: 10.1038/ng1815. Epub 2006 Jun 4.
3
Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer.需要数千个样本才能生成一个用于预测癌症预后的可靠基因列表。
Proc Natl Acad Sci U S A. 2006 Apr 11;103(15):5923-8. doi: 10.1073/pnas.0601231103. Epub 2006 Apr 3.
4
Large-scale human metabolomics studies: a strategy for data (pre-) processing and validation.大规模人类代谢组学研究:数据(预)处理与验证策略
Anal Chem. 2006 Jan 15;78(2):567-74. doi: 10.1021/ac051495j.
5
Gene selection and classification of microarray data using random forest.使用随机森林进行微阵列数据的基因选择与分类
BMC Bioinformatics. 2006 Jan 6;7:3. doi: 10.1186/1471-2105-7-3.
6
Hierarchical metabolomics demonstrates substantial compositional similarity between genetically modified and conventional potato crops.分层代谢组学表明转基因马铃薯作物和传统马铃薯作物之间存在显著的成分相似性。
Proc Natl Acad Sci U S A. 2005 Oct 4;102(40):14458-62. doi: 10.1073/pnas.0503955102. Epub 2005 Sep 26.
7
Measuring the metabolome: current analytical technologies.测量代谢组:当前的分析技术。
Analyst. 2005 May;130(5):606-25. doi: 10.1039/b418288j. Epub 2005 Mar 4.
8
Modelling of classification rules on metabolic patterns including machine learning and expert knowledge.基于代谢模式的分类规则建模,包括机器学习和专家知识。
J Biomed Inform. 2005 Apr;38(2):89-98. doi: 10.1016/j.jbi.2004.08.009.
9
Screening large-scale association study data: exploiting interactions using random forests.筛选大规模关联研究数据:利用随机森林探索相互作用
BMC Genet. 2004 Dec 10;5:32. doi: 10.1186/1471-2156-5-32.
10
Potential of metabolomics as a functional genomics tool.代谢组学作为功能基因组学工具的潜力。
Trends Plant Sci. 2004 Sep;9(9):418-25. doi: 10.1016/j.tplants.2004.07.004.

基于解释性信号的行为、假定身份和生物学相关性预测代谢组学模型的可解释性。

Predicting interpretability of metabolome models based on behavior, putative identity, and biological relevance of explanatory signals.

作者信息

Enot David P, Beckmann Manfred, Overy David, Draper John

机构信息

Institute of Biological Sciences, University of Wales, Aberystwyth SY23 3DA, United Kingdom.

出版信息

Proc Natl Acad Sci U S A. 2006 Oct 3;103(40):14865-70. doi: 10.1073/pnas.0605152103. Epub 2006 Sep 21.

DOI:10.1073/pnas.0605152103
PMID:16990432
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1595442/
Abstract

Powerful algorithms are required to deal with the dimensionality of metabolomics data. Although many achieve high classification accuracy, the models they generate have limited value unless it can be demonstrated that they are reproducible and statistically relevant to the biological problem under investigation. Random forest (RF) generates models, without any requirement for dimensionality reduction or feature selection, in which individual variables are ranked for significance and displayed in an explicit manner. In metabolome fingerprinting by mass spectrometry, each metabolite can be represented by signals at several m/z. Exploiting a prior understanding of expected biochemical differences between sample classes, we aimed to develop meaningful metrics relevant to the significance both of the overall RF model and individual, potentially explanatory, signals. Pair-wise comparison of related plant genotypes with strong phenotypic differences demonstrated that robust models are not only reproducible but also logically structured, highlighting correlated m/z derived from just a small number of explanatory metabolites reflecting the biological differences between sample classes. RF models were also generated by using groupings of samples known to be increasingly phenotypically similar. Although classification accuracy was often reasonable, we demonstrated reproducibly in both Arabidopsis and potato a performance threshold based on margin statistics beyond which such models showed little structure indicative of either generalizability or further biological interpretability. In a multiclass problem using 25 Arabidopsis genotypes, despite the complicating effects of ecotype background and secondary metabolome perturbations common to several mutations, the ranking of metabolome signals by RF provided scope for deeper interpretability.

摘要

需要强大的算法来处理代谢组学数据的维度。尽管许多算法能实现较高的分类准确率,但它们生成的模型价值有限,除非能证明其具有可重复性且与所研究的生物学问题具有统计学相关性。随机森林(RF)生成模型时无需进行降维或特征选择,其中各个变量会按重要性排序并以明确的方式显示。在通过质谱进行代谢组指纹分析时,每种代谢物可由几个质荷比处的信号表示。利用对样本类别之间预期生化差异的先验理解,我们旨在开发与整体RF模型以及个体潜在解释性信号的重要性相关的有意义指标。对具有强烈表型差异的相关植物基因型进行成对比较表明,稳健的模型不仅具有可重复性,而且结构合理,突出了仅来自少数解释性代谢物的相关质荷比,这些代谢物反映了样本类别之间的生物学差异。还通过使用已知表型越来越相似的样本分组来生成RF模型。尽管分类准确率通常较为合理,但我们在拟南芥和马铃薯中均反复证明了基于边际统计的性能阈值,超过该阈值,此类模型几乎没有显示出表明可推广性或进一步生物学可解释性的结构。在一个使用25种拟南芥基因型的多类问题中,尽管生态型背景和几种突变共有的次生代谢组扰动具有复杂影响,但RF对代谢组信号的排序为更深入的可解释性提供了空间。