• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

参考点不敏感的分子数据分析。

Reference point insensitive molecular data analysis.

机构信息

Statistical Bioinformatics, Institute of Functional Genomics, University of Regensburg, Regensburg, Germany.

Institute of Functional Genomics, University of Regensburg, Regensburg, Germany.

出版信息

Bioinformatics. 2017 Jan 15;33(2):219-226. doi: 10.1093/bioinformatics/btw598. Epub 2016 Sep 15.

DOI:10.1093/bioinformatics/btw598
PMID:27634945
Abstract

MOTIVATION

In biomedicine, every molecular measurement is relative to a reference point, like a fixed aliquot of RNA extracted from a tissue, a defined number of blood cells, or a defined volume of biofluid. Reference points are often chosen for practical reasons. For example, we might want to assess the metabolome of a diseased organ but can only measure metabolites in blood or urine. In this case, the observable data only indirectly reflects the disease state. The statistical implications of these discrepancies in reference points have not yet been discussed.

RESULTS

Here, we show that reference point discrepancies compromise the performance of regression models like the LASSO. As an alternative, we suggest zero-sum regression for a reference point insensitive analysis. We show that zero-sum regression is superior to the LASSO in case of a poor choice of reference point both in simulations and in an application that integrates intestinal microbiome analysis with metabolomics. Moreover, we describe a novel coordinate descent based algorithm to fit zero-sum elastic nets.

AVAILABILITY AND IMPLEMENTATION

The R-package "zeroSum" can be downloaded at https://github.com/rehbergT/zeroSum Moreover, we provide all R-scripts and data used to produce the results of this manuscript as Supplementary Material CONTACT: Michael.Altenbuchinger@ukr.de, Thorsten.Rehberg@ukr.de and Rainer.Spang@ukr.deSupplementary information: Supplementary material is available at Bioinformatics online.

摘要

动机

在生物医学领域,每一个分子测量都相对于一个参考点,例如从组织中提取的固定 RNA 等分试样、一定数量的血细胞或一定体积的生物流体。参考点通常是出于实际原因选择的。例如,我们可能希望评估患病器官的代谢组学,但只能在血液或尿液中测量代谢物。在这种情况下,可观察到的数据仅间接反映疾病状态。这些参考点差异的统计影响尚未讨论。

结果

在这里,我们表明参考点差异会影响像 LASSO 这样的回归模型的性能。作为替代方案,我们建议对参考点不敏感的分析采用零和回归。我们表明,在参考点选择不佳的情况下,零和回归在模拟和将肠道微生物组分析与代谢组学相结合的应用中均优于 LASSO。此外,我们描述了一种新的基于坐标下降的算法来拟合零和弹性网络。

可用性和实现

R 包“zeroSum”可在 https://github.com/rehbergT/zeroSum 上下载。此外,我们还提供了产生本文结果所用的所有 R 脚本和数据作为补充材料。

联系人

Michael.Altenbuchinger@ukr.de、Thorsten.Rehberg@ukr.de 和 Rainer.Spang@ukr.de

补充信息

补充材料可在 Bioinformatics 在线获得。

相似文献

1
Reference point insensitive molecular data analysis.参考点不敏感的分子数据分析。
Bioinformatics. 2017 Jan 15;33(2):219-226. doi: 10.1093/bioinformatics/btw598. Epub 2016 Sep 15.
2
Scale-Invariant Biomarker Discovery in Urine and Plasma Metabolite Fingerprints.尿液和血浆代谢物指纹中的尺度不变生物标志物发现。
J Proteome Res. 2017 Oct 6;16(10):3596-3605. doi: 10.1021/acs.jproteome.7b00325. Epub 2017 Sep 7.
3
A distance-based approach for testing the mediation effect of the human microbiome.基于距离的方法检验人类微生物组的中介效应
Bioinformatics. 2018 Jun 1;34(11):1875-1883. doi: 10.1093/bioinformatics/bty014.
4
Molecular signatures that can be transferred across different omics platforms.可在不同组学平台间转移的分子特征。
Bioinformatics. 2017 Jul 15;33(14):i333-i340. doi: 10.1093/bioinformatics/btx241.
5
MoDentify: phenotype-driven module identification in metabolomics networks at different resolutions.MoDentify:在不同分辨率下代谢组学网络中表型驱动的模块识别。
Bioinformatics. 2019 Feb 1;35(3):532-534. doi: 10.1093/bioinformatics/bty650.
6
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
7
CCLasso: correlation inference for compositional data through Lasso.CCLasso:通过套索法对成分数据进行相关性推断
Bioinformatics. 2015 Oct 1;31(19):3172-80. doi: 10.1093/bioinformatics/btv349. Epub 2015 Jun 4.
8
synbreed: a framework for the analysis of genomic prediction data using R.synbreed:一个使用 R 进行基因组预测数据分析的框架。
Bioinformatics. 2012 Aug 1;28(15):2086-7. doi: 10.1093/bioinformatics/bts335. Epub 2012 Jun 10.
9
Optimized application of penalized regression methods to diverse genomic data.优化惩罚回归方法在多种基因组数据中的应用。
Bioinformatics. 2011 Dec 15;27(24):3399-406. doi: 10.1093/bioinformatics/btr591.
10
A Cytoscape app for motif enumeration with ISMAGS.一个使用 ISMAGS 进行基序枚举的 Cytoscape 应用程序。
Bioinformatics. 2017 Feb 1;33(3):461-463. doi: 10.1093/bioinformatics/btw626.

引用本文的文献

1
Scalable log-ratio lasso regression for enhanced microbial feature selection with FLORAL.可扩展对数比套索回归增强微生物特征选择的 FLORAL。
Cell Rep Methods. 2024 Nov 18;4(11):100899. doi: 10.1016/j.crmeth.2024.100899. Epub 2024 Nov 7.
2
A fast solution to the lasso problem with equality constraints.一种带有等式约束的套索问题的快速解决方案。
J Comput Graph Stat. 2024;33(3):804-813. doi: 10.1080/10618600.2023.2277877. Epub 2023 Dec 26.
3
Enhanced Feature Selection for Microbiome Data using FLORAL: Scalable Log-ratio Lasso Regression.
使用FLORAL对微生物组数据进行增强特征选择:可扩展的对数比率套索回归
bioRxiv. 2023 Dec 18:2023.05.02.538599. doi: 10.1101/2023.05.02.538599.
4
Bucket Fuser: Statistical Signal Extraction for 1D H NMR Metabolomic Data.桶融合器:用于一维氢核磁共振代谢组学数据的统计信号提取
Metabolites. 2022 Aug 29;12(9):812. doi: 10.3390/metabo12090812.
5
Cross-Platform Omics Prediction procedure: a statistical machine learning framework for wider implementation of precision medicine.跨平台组学预测程序:一种用于精准医学更广泛实施的统计机器学习框架。
NPJ Digit Med. 2022 Jul 4;5(1):85. doi: 10.1038/s41746-022-00618-5.
6
Chronic Kidney Disease Cohort Studies: A Guide to Metabolome Analyses.慢性肾脏病队列研究:代谢组学分析指南
Metabolites. 2021 Jul 16;11(7):460. doi: 10.3390/metabo11070460.
7
Penalized and constrained LAD estimation in fixed and high dimension.固定维度和高维度下的惩罚约束最小绝对偏差估计
Stat Pap (Berl). 2022;63(1):53-95. doi: 10.1007/s00362-021-01229-0. Epub 2021 Mar 31.
8
Platform independent protein-based cell-of-origin subtyping of diffuse large B-cell lymphoma in formalin-fixed paraffin-embedded tissue.基于蛋白的弥漫性大 B 细胞淋巴瘤在福尔马林固定石蜡包埋组织中的平台无关性细胞起源亚型分析。
Sci Rep. 2020 May 12;10(1):7876. doi: 10.1038/s41598-020-64212-z.
9
Gaussian and Mixed Graphical Models as (multi-)omics data analysis tools.高斯和混合图模型作为(多组学)数据分析工具。
Biochim Biophys Acta Gene Regul Mech. 2020 Jun;1863(6):194418. doi: 10.1016/j.bbagrm.2019.194418. Epub 2019 Oct 19.
10
A multi-source data integration approach reveals novel associations between metabolites and renal outcomes in the German Chronic Kidney Disease study.一种多源数据整合方法揭示了德国慢性肾脏病研究中代谢物与肾脏结局之间的新关联。
Sci Rep. 2019 Sep 27;9(1):13954. doi: 10.1038/s41598-019-50346-2.