• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MinLinMo:一种用于变量选择和线性模型预测的极简方法。

MinLinMo: a minimalist approach to variable selection and linear model prediction.

作者信息

Bohlin Jon, Håberg Siri E, Magnus Per, Gjessing Håkon K

机构信息

Department of Method Development and Analytics, Section for modeling and bioinformatics, Norwegian Institute of Public Health, Oslo, Norway.

Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway.

出版信息

BMC Bioinformatics. 2024 Dec 18;25(1):380. doi: 10.1186/s12859-024-06000-4.

DOI:10.1186/s12859-024-06000-4
PMID:39695947
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11654326/
Abstract

Generating prediction models from high dimensional data often result in large models with many predictors. Causal inference for such models can therefore be difficult or even impossible in practice. The stand-alone software package MinLinMo emphasizes small linear prediction models over highest possible predictability with a particular focus on including variables correlated with the outcome, minimal memory usage and speed. MinLinMo is demonstrated on large epigenetic datasets with prediction models for chronological age, gestational age, and birth weight comprising, respectively, 15, 14 and 10 predictors. The parsimonious MinLinMo models perform comparably to established prediction models requiring hundreds of predictors.

摘要

从高维数据生成预测模型通常会得到包含许多预测变量的大型模型。因此,对此类模型进行因果推断在实际中可能很困难甚至无法实现。独立软件包MinLinMo强调小型线性预测模型,而非追求尽可能高的可预测性,特别注重纳入与结果相关的变量、最小化内存使用和速度。在大型表观遗传数据集上展示了MinLinMo,其针对 chronological age(实足年龄)、gestational age(胎龄)和 birth weight(出生体重)的预测模型分别包含15、14和10个预测变量。简约的MinLinMo模型与需要数百个预测变量的既定预测模型表现相当。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/b4db4ff894f7/12859_2024_6000_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/567e03d113fc/12859_2024_6000_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/f623b01ef54f/12859_2024_6000_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/f64f975d2113/12859_2024_6000_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/dfe2572c6429/12859_2024_6000_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/b4db4ff894f7/12859_2024_6000_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/567e03d113fc/12859_2024_6000_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/f623b01ef54f/12859_2024_6000_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/f64f975d2113/12859_2024_6000_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/dfe2572c6429/12859_2024_6000_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9408/11654326/b4db4ff894f7/12859_2024_6000_Fig5_HTML.jpg

相似文献

1
MinLinMo: a minimalist approach to variable selection and linear model prediction.MinLinMo:一种用于变量选择和线性模型预测的极简方法。
BMC Bioinformatics. 2024 Dec 18;25(1):380. doi: 10.1186/s12859-024-06000-4.
2
ecpc: an R-package for generic co-data models for high-dimensional prediction.ecpc:用于高维预测的通用协数据模型的 R 包。
BMC Bioinformatics. 2023 Apr 26;24(1):172. doi: 10.1186/s12859-023-05289-x.
3
Constructing small for gestational age prediction models: A retrospective machine learning study.构建小于胎龄儿预测模型:一项回顾性机器学习研究。
Eur J Obstet Gynecol Reprod Biol. 2025 Feb;305:48-55. doi: 10.1016/j.ejogrb.2024.11.022. Epub 2024 Nov 26.
4
Predicting long-term neurocognitive outcome after pediatric intensive care unit admission for bronchiolitis-preliminary exploration of the potential of machine learning.预测儿童重症监护病房支气管肺炎患儿的长期神经认知结局:机器学习的潜在初步探索。
Eur J Pediatr. 2024 Jan;183(1):471-482. doi: 10.1007/s00431-023-05307-3. Epub 2023 Nov 6.
5
Simultaneous SNP selection and adjustment for population structure in high dimensional prediction models.高维预测模型中同时进行 SNP 选择和群体结构调整。
PLoS Genet. 2020 May 4;16(5):e1008766. doi: 10.1371/journal.pgen.1008766. eCollection 2020 May.
6
Random generalized linear model: a highly accurate and interpretable ensemble predictor.随机广义线性模型:一种高度准确且可解释的集成预测器。
BMC Bioinformatics. 2013 Jan 16;14:5. doi: 10.1186/1471-2105-14-5.
7
Deviance residuals-based sparse PLS and sparse kernel PLS regression for censored data.基于偏差残差的稀疏偏最小二乘和稀疏核偏最小二乘回归用于删失数据。
Bioinformatics. 2015 Feb 1;31(3):397-404. doi: 10.1093/bioinformatics/btu660. Epub 2014 Oct 6.
8
Minimalist ensemble algorithms for genome-wide protein localization prediction.基因组范围内蛋白质定位预测的简约集成算法。
BMC Bioinformatics. 2012 Jul 3;13:157. doi: 10.1186/1471-2105-13-157.
9
Evaluation of marker selection methods and statistical models for chronological age prediction based on DNA methylation.基于DNA甲基化的年龄预测中标记选择方法和统计模型的评估
Leg Med (Tokyo). 2020 Nov;47:101744. doi: 10.1016/j.legalmed.2020.101744. Epub 2020 Jul 1.
10
Development and validation of prediction models for fetal growth restriction and birthweight: an individual participant data meta-analysis.胎儿生长受限和出生体重预测模型的建立与验证:个体参与者数据的荟萃分析。
Health Technol Assess. 2024 Aug;28(47):1-119. doi: 10.3310/DABW4814.

本文引用的文献

1
DNA methylation rates scale with maximum lifespan across mammals.DNA 甲基化率与哺乳动物的最长寿命呈正相关。
Nat Aging. 2024 Jan;4(1):27-32. doi: 10.1038/s43587-023-00535-6. Epub 2023 Dec 4.
2
Accurate age prediction from blood using a small set of DNA methylation sites and a cohort-based machine learning algorithm.使用一小部分 DNA 甲基化位点和基于队列的机器学习算法从血液中准确预测年龄。
Cell Rep Methods. 2023 Sep 25;3(9):100567. doi: 10.1016/j.crmeth.2023.100567. Epub 2023 Aug 28.
3
Genetic architecture of adiposity measures among Asians: Findings from GWAS.
亚洲人肥胖指标的遗传结构:全基因组关联研究结果
Ann Hum Genet. 2023 Nov;87(6):255-273. doi: 10.1111/ahg.12526. Epub 2023 Sep 6.
4
Identification of ATP6V0A4 as a potential biomarker in renal cell carcinoma using integrated bioinformatics analysis.通过综合生物信息学分析鉴定ATP6V0A4作为肾细胞癌的潜在生物标志物
Oncol Lett. 2023 Jul 11;26(3):366. doi: 10.3892/ol.2023.13952. eCollection 2023 Sep.
5
Stability selection enhances feature selection and enables accurate prediction of gestational age using only five DNA methylation sites.稳定性选择增强了特征选择,并仅使用五个 DNA 甲基化位点就能准确预测胎龄。
Clin Epigenetics. 2023 Jul 13;15(1):114. doi: 10.1186/s13148-023-01528-3.
6
Central nervous system manifestations of LRBA deficiency: case report of two siblings and literature review.LRBA 缺乏症的中枢神经系统表现:两例同胞兄妹病例报告及文献复习。
BMC Pediatr. 2023 Jul 13;23(1):353. doi: 10.1186/s12887-023-04182-z.
7
CimpleG: finding simple CpG methylation signatures.CimpleG:寻找简单的 CpG 甲基化特征。
Genome Biol. 2023 Jul 10;24(1):161. doi: 10.1186/s13059-023-03000-0.
8
Getting the chronological age out of DNA: using insights of age-dependent DNA methylation for forensic DNA applications.从 DNA 中获取实际年龄:利用与年龄相关的 DNA 甲基化的见解应用于法医 DNA 分析。
Genes Genomics. 2023 Oct;45(10):1239-1261. doi: 10.1007/s13258-023-01392-8. Epub 2023 May 30.
9
Evidence that direct inhibition of transcription factor binding is the prevailing mode of gene and repeat repression by DNA methylation.DNA 甲基化通过直接抑制转录因子结合来普遍抑制基因和重复序列。
Nat Genet. 2022 Dec;54(12):1895-1906. doi: 10.1038/s41588-022-01241-6. Epub 2022 Dec 5.
10
Tagged actin mRNA dysregulation in IGF2BP1[Formula: see text] mice.IGF2BP1[Formula: see text] 小鼠中标记肌动蛋白 mRNA 失调。
Proc Natl Acad Sci U S A. 2022 Sep 13;119(37):e2208465119. doi: 10.1073/pnas.2208465119. Epub 2022 Sep 6.