• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结构稀疏正则化分析高维组学数据。

Structured sparsity regularization for analyzing high-dimensional omics data.

机构信息

INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal.

出版信息

Brief Bioinform. 2021 Jan 18;22(1):77-87. doi: 10.1093/bib/bbaa122.

DOI:10.1093/bib/bbaa122
PMID:32597465
Abstract

The development of new molecular and cell technologies is having a significant impact on the quantity of data generated nowadays. The growth of omics databases is creating a considerable potential for knowledge discovery and, concomitantly, is bringing new challenges to statistical learning and computational biology for health applications. Indeed, the high dimensionality of these data may hamper the use of traditional regression methods and parameter estimation algorithms due to the intrinsic non-identifiability of the inherent optimization problem. Regularized optimization has been rising as a promising and useful strategy to solve these ill-posed problems by imposing additional constraints in the solution parameter space. In particular, the field of statistical learning with sparsity has been significantly contributing to building accurate models that also bring interpretability to biological observations and phenomena. Beyond the now-classic elastic net, one of the best-known methods that combine lasso with ridge penalizations, we briefly overview recent literature on structured regularizers and penalty functions that have been applied in biomedical data to build parsimonious models in a variety of underlying contexts, from survival to generalized linear models. These methods include functions of $\ell _k$-norms and network-based penalties that take into account the inherent relationships between the features. The successful application to omics data illustrates the potential of sparse structured regularization for identifying disease's molecular signatures and for creating high-performance clinical decision support systems towards more personalized healthcare. Supplementary information: Supplementary data are available at Briefings in Bioinformatics online.

摘要

新的分子和细胞技术的发展正在对当今产生的数据量产生重大影响。组学数据库的增长为知识发现创造了巨大的潜力,同时也给健康应用的统计学习和计算生物学带来了新的挑战。事实上,由于内在优化问题的固有不可识别性,这些数据的高维性可能会阻碍传统回归方法和参数估计算法的使用。正则化优化已成为解决这些不适定问题的一种有前途和有用的策略,通过在解参数空间中施加附加约束。特别是,具有稀疏性的统计学习领域为构建准确的模型做出了重大贡献,这些模型也为生物观察和现象带来了可解释性。除了现在经典的弹性网络(elastic net),即组合lasso 和 ridge 惩罚的最佳方法之一,我们简要概述了最近在生物医学数据中应用的结构正则化器和惩罚函数的文献,以在各种基础背景下构建简约模型,从生存到广义线性模型。这些方法包括 $\ell _k$-范数的函数和基于网络的惩罚,这些函数考虑了特征之间的内在关系。这些方法在组学数据中的成功应用说明了稀疏结构正则化在识别疾病分子特征和创建高性能临床决策支持系统以实现更个性化医疗保健方面的潜力。补充信息:补充资料可在Briefings in Bioinformatics 在线获取。

相似文献

1
Structured sparsity regularization for analyzing high-dimensional omics data.结构稀疏正则化分析高维组学数据。
Brief Bioinform. 2021 Jan 18;22(1):77-87. doi: 10.1093/bib/bbaa122.
2
IPF-LASSO: Integrative -Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data.IPF-LASSO:基于多组学数据的带惩罚因子的整合惩罚回归用于预测
Comput Math Methods Med. 2017;2017:7691937. doi: 10.1155/2017/7691937. Epub 2017 May 4.
3
eNetXplorer: an R package for the quantitative exploration of elastic net families for generalized linear models.eNetXplorer:用于广义线性模型中弹性网络家族的定量探索的 R 包。
BMC Bioinformatics. 2019 Apr 16;20(1):189. doi: 10.1186/s12859-019-2778-5.
4
Network-Regularized Sparse Logistic Regression Models for Clinical Risk Prediction and Biomarker Discovery.用于临床风险预测和生物标志物发现的基于网络正则化稀疏逻辑回归模型。
IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):944-953. doi: 10.1109/TCBB.2016.2640303. Epub 2016 Dec 15.
5
New Machine Learning Applications to Accelerate Personalized Medicine in Breast Cancer: Rise of the Support Vector Machines.新的机器学习应用程序加速乳腺癌个体化医学:支持向量机的兴起。
OMICS. 2020 May;24(5):241-246. doi: 10.1089/omi.2020.0001. Epub 2020 Mar 31.
6
Structured Sparse Principal Components Analysis With the TV-Elastic Net Penalty.基于 TV-弹性网络罚项的结构稀疏主成分分析。
IEEE Trans Med Imaging. 2018 Feb;37(2):396-407. doi: 10.1109/TMI.2017.2749140. Epub 2017 Sep 4.
7
Continuation of Nesterov's Smoothing for Regression With Structured Sparsity in High-Dimensional Neuroimaging.高维神经影像中具有结构化稀疏性的回归问题的 Nesterov 平滑的延续
IEEE Trans Med Imaging. 2018 Nov;37(11):2403-2413. doi: 10.1109/TMI.2018.2829802. Epub 2018 Apr 24.
8
DegreeCox - a network-based regularization method for survival analysis.DegreeCox——一种用于生存分析的基于网络的正则化方法。
BMC Bioinformatics. 2016 Dec 13;17(Suppl 16):449. doi: 10.1186/s12859-016-1310-4.
9
Regularized estimation of large-scale gene association networks using graphical Gaussian models.基于图式高斯模型的大规模基因关联网络正则化估计
BMC Bioinformatics. 2009 Nov 24;10:384. doi: 10.1186/1471-2105-10-384.
10
Spatio Temporal EEG Source Imaging with the Hierarchical Bayesian Elastic Net and Elitist Lasso Models.基于分层贝叶斯弹性网络和精英套索模型的时空脑电图源成像
Front Neurosci. 2017 Nov 16;11:635. doi: 10.3389/fnins.2017.00635. eCollection 2017.

引用本文的文献

1
Leveraging external information by guided adaptive shrinkage to improve variable selection in high-dimensional regression settings.通过引导式自适应收缩利用外部信息以改善高维回归设置中的变量选择。
Int J Biostat. 2025 Sep 8. doi: 10.1515/ijb-2024-0108.
2
Identification of MEG3 and MAPK3 as potential therapeutic targets for osteoarthritis through multiomics integration and machine learning.通过多组学整合和机器学习鉴定MEG3和MAPK3作为骨关节炎的潜在治疗靶点。
Sci Rep. 2025 Jul 2;15(1):23240. doi: 10.1038/s41598-025-06175-7.
3
Clinical effectiveness of fecal microbial transplantation for metabolic syndrome: Advances in clinical efficacy and multi-omics research.
粪便微生物移植治疗代谢综合征的临床疗效:临床疗效及多组学研究进展
Curr Res Microb Sci. 2025 Jun 5;9:100415. doi: 10.1016/j.crmicr.2025.100415. eCollection 2025.
4
Machine learning and multi-omics integration: advancing cardiovascular translational research and clinical practice.机器学习与多组学整合:推动心血管转化研究与临床实践
J Transl Med. 2025 Apr 2;23(1):388. doi: 10.1186/s12967-025-06425-2.
5
Enhanced fibrotic potential of COL1A1NR4A1 fibroblasts in ischemic heart revealed by transcriptional dynamics heterogeneity analysis at both bulk and single-cell levels.通过整体和单细胞水平的转录动力学异质性分析揭示缺血性心脏中COL1A1NR4A1成纤维细胞增强的纤维化潜能
Front Cardiovasc Med. 2025 Jan 6;11:1460813. doi: 10.3389/fcvm.2024.1460813. eCollection 2024.
6
Prognostic prediction models for postoperative patients with stage I to III colorectal cancer based on machine learning.基于机器学习的Ⅰ至Ⅲ期结直肠癌术后患者预后预测模型
World J Gastrointest Oncol. 2024 Dec 15;16(12):4597-4613. doi: 10.4251/wjgo.v16.i12.4597.
7
Outcome prediction comparison of ischaemic areas' radiomics in acute anterior circulation non-lacunar infarction.急性前循环非腔隙性梗死缺血区域的影像组学结果预测比较
Brain Commun. 2024 Nov 15;6(6):fcae393. doi: 10.1093/braincomms/fcae393. eCollection 2024.
8
Inferring Diagnostic and Prognostic Gene Expression Signatures Across WHO Glioma Classifications: A Network-Based Approach.推断世界卫生组织(WHO)胶质瘤分类中的诊断和预后基因表达特征:一种基于网络的方法。
Bioinform Biol Insights. 2024 Sep 15;18:11779322241271535. doi: 10.1177/11779322241271535. eCollection 2024.
9
Identifying novel circadian rhythm biomarkers for diagnosis and prognosis of melanoma by an integrated bioinformatics and machine learning approach.通过综合生物信息学和机器学习方法鉴定用于黑素瘤诊断和预后的新型生物钟生物标志物。
Aging (Albany NY). 2024 Jun 20;16(16):11824-11842. doi: 10.18632/aging.205961.
10
Machine learning models predicts risk of proliferative lupus nephritis.机器学习模型预测增生性狼疮肾炎的风险。
Front Immunol. 2024 Jun 11;15:1413569. doi: 10.3389/fimmu.2024.1413569. eCollection 2024.