• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从分子数据预测相关结果。

Predicting correlated outcomes from molecular data.

机构信息

Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 4362 Esch-sur-Alzette, Luxembourg.

出版信息

Bioinformatics. 2021 Nov 5;37(21):3889-3895. doi: 10.1093/bioinformatics/btab576.

DOI:10.1093/bioinformatics/btab576
PMID:34358294
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10186156/
Abstract

MOTIVATION

Multivariate (multi-target) regression has the potential to outperform univariate (single-target) regression at predicting correlated outcomes, which frequently occur in biomedical and clinical research. Here we implement multivariate lasso and ridge regression using stacked generalization.

RESULTS

Our flexible approach leads to predictive and interpretable models in high-dimensional settings, with a single estimate for each input-output effect. In the simulation, we compare the predictive performance of several state-of-the-art methods for multivariate regression. In the application, we use clinical and genomic data to predict multiple motor and non-motor symptoms in Parkinson's disease patients. We conclude that stacked multivariate regression, with our adaptations, is a competitive method for predicting correlated outcomes.

AVAILABILITY AND IMPLEMENTATION

The R package joinet is available on GitHub (https://github.com/rauschenberger/joinet) and cran (https://cran.r-project.org/package=joinet).

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

多变量(多目标)回归有可能比单变量(单目标)回归在预测相关结果方面表现更优,而相关结果在生物医学和临床研究中经常出现。在这里,我们使用堆叠泛化来实现多元lasso 和 ridge 回归。

结果

我们灵活的方法在高维环境中生成了具有预测能力和可解释性的模型,每个输入-输出效果只有一个单一的估计值。在模拟中,我们比较了多种用于多元回归的最先进方法的预测性能。在应用中,我们使用临床和基因组数据来预测帕金森病患者的多种运动和非运动症状。我们的结论是,经过我们的改编,堆叠多元回归是一种具有竞争力的预测相关结果的方法。

可用性和实现

R 包 joinet 可在 GitHub(https://github.com/rauschenberger/joinet)和 cran(https://cran.r-project.org/package=joinet)上获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e82/10186156/d2953db23d01/btab576f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e82/10186156/fb010b8f7b06/btab576f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e82/10186156/f380e8c0526a/btab576f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e82/10186156/51b708e682ff/btab576f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e82/10186156/d2953db23d01/btab576f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e82/10186156/fb010b8f7b06/btab576f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e82/10186156/f380e8c0526a/btab576f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e82/10186156/51b708e682ff/btab576f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e82/10186156/d2953db23d01/btab576f4.jpg

相似文献

1
Predicting correlated outcomes from molecular data.从分子数据预测相关结果。
Bioinformatics. 2021 Nov 5;37(21):3889-3895. doi: 10.1093/bioinformatics/btab576.
2
Predictive and interpretable models via the stacked elastic net.基于堆叠弹性网络的预测和可解释模型。
Bioinformatics. 2021 Aug 4;37(14):2012-2016. doi: 10.1093/bioinformatics/btaa535.
3
An R package VIGoR for joint estimation of multiple linear learners with variational Bayesian inference.一个用于使用变分贝叶斯推断联合估计多个线性学习者的 R 包 VIGoR。
Bioinformatics. 2022 Jun 13;38(12):3306-3309. doi: 10.1093/bioinformatics/btac328.
4
Optimized application of penalized regression methods to diverse genomic data.优化惩罚回归方法在多种基因组数据中的应用。
Bioinformatics. 2011 Dec 15;27(24):3399-406. doi: 10.1093/bioinformatics/btr591.
5
Predicting dichotomised outcomes from high-dimensional data in biomedicine.预测生物医学中高维数据的二分结果。
J Appl Stat. 2023 Jul 26;51(9):1756-1771. doi: 10.1080/02664763.2023.2233057. eCollection 2024.
6
Spathial: an R package for the evolutionary analysis of biological data.Spathial:用于生物数据进化分析的 R 包。
Bioinformatics. 2020 Nov 1;36(17):4664-4667. doi: 10.1093/bioinformatics/btaa273.
7
MorphoTools2: an R package for multivariate morphometric analysis.MorphoTools2:一个用于多元形态计量分析的 R 包。
Bioinformatics. 2022 May 13;38(10):2954-2955. doi: 10.1093/bioinformatics/btac173.
8
Overcoming the inadaptability of sparse group lasso for data with various group structures by stacking.通过堆叠克服具有各种群组结构的数据的稀疏群组套索的不适应性。
Bioinformatics. 2022 Mar 4;38(6):1542-1549. doi: 10.1093/bioinformatics/btab848.
9
RMTL: an R library for multi-task learning.RMTL:一个用于多任务学习的 R 库。
Bioinformatics. 2019 May 15;35(10):1797-1798. doi: 10.1093/bioinformatics/bty831.
10
Penalized regression with multiple sources of prior effects.带有多个先验效应来源的惩罚回归。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad680.

引用本文的文献

1
A decision-analytical perspective on incorporating multiple outcomes in the production of clinical prediction models: defining a taxonomy of risk estimands.关于在临床预测模型构建中纳入多种结局的决策分析视角:定义风险估计量的分类法。
BMC Med. 2025 Mar 6;23(1):142. doi: 10.1186/s12916-025-03978-3.
2
Predicting dichotomised outcomes from high-dimensional data in biomedicine.预测生物医学中高维数据的二分结果。
J Appl Stat. 2023 Jul 26;51(9):1756-1771. doi: 10.1080/02664763.2023.2233057. eCollection 2024.
3
Isoform-level transcriptome-wide association uncovers genetic risk mechanisms for neuropsychiatric disorders in the human brain.

本文引用的文献

1
From Hume to Wuhan: An Epistemological Journey on the Problem of Induction in COVID-19 Machine Learning Models and its Impact Upon Medical Research.从休谟到武汉:关于新冠机器学习模型中归纳问题及其对医学研究影响的认识论之旅
IEEE Access. 2021 Jul 6;9:97243-97250. doi: 10.1109/ACCESS.2021.3095222. eCollection 2021.
2
Clinical prediction models to predict the risk of multiple binary outcomes: a comparison of approaches.预测多种二元结局风险的临床预测模型:方法比较
Stat Med. 2021 Jan 30;40(2):498-517. doi: 10.1002/sim.8787. Epub 2020 Oct 26.
3
Criteria for evaluating risk prediction of multiple outcomes.
在人脑中,异构体水平转录组全基因组关联揭示了神经精神疾病的遗传风险机制。
Nat Genet. 2023 Dec;55(12):2117-2128. doi: 10.1038/s41588-023-01560-2. Epub 2023 Nov 30.
4
Penalized regression with multiple sources of prior effects.带有多个先验效应来源的惩罚回归。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad680.
5
Differential Phospho-Signatures in Blood Cells Identify LRRK2 G2019S Carriers in Parkinson's Disease.血液细胞中的差异磷酸化特征可识别帕金森病中的 LRRK2 G2019S 携带者。
Mov Disord. 2022 May;37(5):1004-1015. doi: 10.1002/mds.28927. Epub 2022 Jan 20.
评估多种结局风险预测的标准。
Stat Methods Med Res. 2020 Dec;29(12):3492-3510. doi: 10.1177/0962280220929039. Epub 2020 Jun 29.
4
Predictive and interpretable models via the stacked elastic net.基于堆叠弹性网络的预测和可解释模型。
Bioinformatics. 2021 Aug 4;37(14):2012-2016. doi: 10.1093/bioinformatics/btaa535.
5
Simultaneous prediction of multiple outcomes using revised stacking algorithms.使用修订后的堆叠算法同时预测多个结果。
Bioinformatics. 2020 Jan 1;36(1):65-72. doi: 10.1093/bioinformatics/btz531.
6
Dynamic multi-outcome prediction after injury: Applying adaptive machine learning for precision medicine in trauma.损伤后动态多结局预测:在创伤中应用自适应机器学习进行精准医学。
PLoS One. 2019 Apr 10;14(4):e0213836. doi: 10.1371/journal.pone.0213836. eCollection 2019.
7
Using simulation studies to evaluate statistical methods.运用模拟研究评估统计方法。
Stat Med. 2019 May 20;38(11):2074-2102. doi: 10.1002/sim.8086. Epub 2019 Jan 16.
8
Sample size considerations and predictive performance of multinomial logistic prediction models.多分类逻辑回归预测模型的样本量考虑因素和预测性能。
Stat Med. 2019 Apr 30;38(9):1601-1619. doi: 10.1002/sim.8063. Epub 2019 Jan 6.
9
RMTL: an R library for multi-task learning.RMTL:一个用于多任务学习的 R 库。
Bioinformatics. 2019 May 15;35(10):1797-1798. doi: 10.1093/bioinformatics/bty831.
10
Using self-report surveys at the beginning of service to develop multi-outcome risk models for new soldiers in the U.S. Army.利用新兵入伍初期的自我报告调查数据,为美国陆军新兵开发多结局风险预测模型。
Psychol Med. 2017 Oct;47(13):2275-2287. doi: 10.1017/S003329171700071X. Epub 2017 Apr 4.