• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于大数据的组学研究中的乐观偏差校正:模拟数据上惩罚方法的评估。

Optimism Bias Correction in Omics Studies with Big Data: Assessment of Penalized Methods on Simulated Data.

机构信息

1 Service de Biostatistique-Bioinformatique, Pôle Santé Publique, Hospices Civils de Lyon, Lyon, France.

2 Université de Lyon, Lyon, France.

出版信息

OMICS. 2019 Apr;23(4):207-213. doi: 10.1089/omi.2018.0191. Epub 2019 Feb 22.

DOI:10.1089/omi.2018.0191
PMID:30794050
Abstract

Big Data generated by omics technologies require simultaneous analyses of large numbers of variables. This leads to complex model selection and parameter estimates that show optimism bias. This study on simulated data sets examined optimism-bias correction by penalty regression methods in case-control studies that involve clinical and omics variables. Least absolute shrinkage and selection operator (LASSO)-based methods (LASSO-penalized logistic regression, adaptive LASSO, and regularized LASSO for selection + ridge regression) were evaluated using power, the false positive rate (FPR), false discovery rate (FDR), and by estimated versus theoretical parameter comparisons. The "ordinary" LASSO overcorrects the optimism bias. The adaptive LASSO with LASSO estimation of the weights was unable to provide a sufficient correction. Importantly, the adaptive LASSO with ridge estimation of the weights showed the best parameter estimation. The regularized LASSO selection showed a slight optimism bias that decreased with the increase in the training set size. The optimism bias decreased with the increase of the number of variables selected among truly differentially expressed variables; however, power, FPR, and FDR were correlated. A compromise between model selection and estimation accuracy should be found. These results might prove useful because Big Data analyses are becoming commonplace in omics/multiomics studies in integrative biology, precision medicine, and planetary health.

摘要

组学技术产生的大数据需要同时分析大量变量。这导致了复杂的模型选择和参数估计,表现出乐观偏差。本研究通过模拟数据集,在涉及临床和组学变量的病例对照研究中,检查了惩罚回归方法对乐观偏差的修正。基于最小绝对值收缩和选择算子(LASSO)的方法(LASSO 惩罚逻辑回归、自适应 LASSO 和用于选择的正则化 LASSO +岭回归)的评估指标包括功效、假阳性率(FPR)、假发现率(FDR)以及估计参数与理论参数的比较。“普通”的 LASSO 过度校正了乐观偏差。LASSO 权重的自适应 LASSO 无法提供足够的修正。重要的是,LASSO 权重的自适应 LASSO 与岭回归估计显示了最佳的参数估计。正则化 LASSO 选择表现出轻微的乐观偏差,随着训练集大小的增加而减小。随着真正差异表达变量中选择的变量数量的增加,乐观偏差减小;然而,功效、FPR 和 FDR 是相关的。应该在模型选择和估计准确性之间找到一个折衷。这些结果可能很有用,因为大数据分析在综合生物学、精准医学和行星健康等领域的组学/多组学研究中变得越来越普遍。

相似文献

1
Optimism Bias Correction in Omics Studies with Big Data: Assessment of Penalized Methods on Simulated Data.基于大数据的组学研究中的乐观偏差校正:模拟数据上惩罚方法的评估。
OMICS. 2019 Apr;23(4):207-213. doi: 10.1089/omi.2018.0191. Epub 2019 Feb 22.
2
Accounting for grouped predictor variables or pathways in high-dimensional penalized Cox regression models.在高维惩罚 Cox 回归模型中考虑分组预测变量或途径。
BMC Bioinformatics. 2020 Jul 2;21(1):277. doi: 10.1186/s12859-020-03618-y.
3
IPF-LASSO: Integrative -Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data.IPF-LASSO:基于多组学数据的带惩罚因子的整合惩罚回归用于预测
Comput Math Methods Med. 2017;2017:7691937. doi: 10.1155/2017/7691937. Epub 2017 May 4.
4
High-dimensional Cox models: the choice of penalty as part of the model building process.高维Cox模型:作为模型构建过程一部分的惩罚项选择
Biom J. 2010 Feb;52(1):50-69. doi: 10.1002/bimj.200900064.
5
6
Re-evaluation of the comparative effectiveness of bootstrap-based optimism correction methods in the development of multivariable clinical prediction models.基于 Bootstrap 的校正方法在多变量临床预测模型构建中的校正效能再评价。
BMC Med Res Methodol. 2021 Jan 7;21(1):9. doi: 10.1186/s12874-020-01201-w.
7
Favoring the hierarchical constraint in penalized survival models for randomized trials in precision medicine.在精准医学中,为了随机试验的惩罚生存模型,赞成层次约束。
BMC Bioinformatics. 2023 Mar 16;24(1):96. doi: 10.1186/s12859-023-05162-x.
8
Empirical extensions of the lasso penalty to reduce the false discovery rate in high-dimensional Cox regression models.套索惩罚的经验性扩展以降低高维Cox回归模型中的错误发现率
Stat Med. 2016 Jul 10;35(15):2561-73. doi: 10.1002/sim.6927. Epub 2016 Mar 10.
9
Regularized estimation of large-scale gene association networks using graphical Gaussian models.基于图式高斯模型的大规模基因关联网络正则化估计
BMC Bioinformatics. 2009 Nov 24;10:384. doi: 10.1186/1471-2105-10-384.
10
Identification of clinically relevant features in hypertensive patients using penalized regression: a case study of cardiovascular events.使用惩罚回归识别高血压患者的临床相关特征:心血管事件的案例研究。
Med Biol Eng Comput. 2019 Sep;57(9):2011-2026. doi: 10.1007/s11517-019-02007-9. Epub 2019 Jul 25.

引用本文的文献

1
Predicting venous thromboembolism in hospitalized trauma patients: a combination of the Caprini score and data-driven machine learning model.预测住院创伤患者的静脉血栓栓塞症:卡普里尼评分与数据驱动的机器学习模型相结合。
BMC Emerg Med. 2021 May 10;21(1):60. doi: 10.1186/s12873-021-00447-x.