• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用高维数据集进行有序响应预测的 L1 惩罚连续比模型。

L1 penalized continuation ratio models for ordinal response prediction using high-dimensional datasets.

机构信息

Department of Biostatistics, Virginia Commonwealth University, Richmond, VA, USA.

出版信息

Stat Med. 2012 Jun 30;31(14):1464-74. doi: 10.1002/sim.4484. Epub 2012 Feb 23.

DOI:10.1002/sim.4484
PMID:22359384
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3718008/
Abstract

Health status and outcomes are frequently measured on an ordinal scale. For high-throughput genomic datasets, the common approach to analyzing ordinal response data has been to break the problem into one or more dichotomous response analyses. This dichotomous response approach does not make use of all available data and therefore leads to loss of power and increases the number of type I errors. Herein we describe an innovative frequentist approach that combines two statistical techniques, L(1) penalization and continuation ratio models, for modeling an ordinal response using gene expression microarray data. We conducted a simulation study to assess the performance of two computational approaches and two model selection criteria for fitting frequentist L(1) penalized continuation ratio models. Moreover, we empirically compared the approaches using three application datasets, each of which seeks to classify an ordinal class using microarray gene expression data as the predictor variables. We conclude that the L(1) penalized constrained continuation ratio model is a useful approach for modeling an ordinal response for datasets where the number of covariates (p) exceeds the sample size (n) and the decision of whether to use Akaike Information Criterion (AIC) or Bayesian Information Criterion (BIC) for selecting the final model should depend upon the similarities between the pathologies underlying the disease states to be classified.

摘要

健康状况和结果通常在有序尺度上进行测量。对于高通量基因组数据集,分析有序响应数据的常见方法是将问题分解为一个或多个二项式响应分析。这种二项式响应方法没有利用所有可用的数据,因此会导致功率损失并增加 I 型错误的数量。在此,我们描述了一种创新的频率主义方法,该方法结合了两种统计技术,L(1)惩罚和连续比模型,用于使用基因表达微阵列数据对有序响应进行建模。我们进行了一项模拟研究,以评估两种计算方法和两种模型选择标准拟合频率主义 L(1)惩罚连续比模型的性能。此外,我们使用三个应用数据集经验比较了这些方法,每个数据集都试图使用微阵列基因表达数据作为预测变量对有序类别进行分类。我们得出结论,L(1)惩罚约束连续比模型是一种有用的方法,用于对数据集进行建模,其中协变量的数量(p)超过样本量(n),并且选择最终模型时使用 Akaike 信息准则(AIC)还是贝叶斯信息准则(BIC)的决策应取决于待分类疾病状态的病理学之间的相似性。

相似文献

1
L1 penalized continuation ratio models for ordinal response prediction using high-dimensional datasets.使用高维数据集进行有序响应预测的 L1 惩罚连续比模型。
Stat Med. 2012 Jun 30;31(14):1464-74. doi: 10.1002/sim.4484. Epub 2012 Feb 23.
2
Bayesian penalized cumulative logit model for high-dimensional data with an ordinal response.贝叶斯惩罚累积对数模型在有序响应的高维数据中的应用。
Stat Med. 2021 Mar 15;40(6):1453-1481. doi: 10.1002/sim.8851. Epub 2020 Dec 18.
3
Penalized Bayesian forward continuation ratio model with application to high-dimensional data with discrete survival outcomes.惩罚贝叶斯向前连续比模型及其在高维离散生存数据中的应用。
PLoS One. 2024 Mar 28;19(3):e0300638. doi: 10.1371/journal.pone.0300638. eCollection 2024.
4
Development of a clinical prediction model for an ordinal outcome: the World Health Organization Multicentre Study of Clinical Signs and Etiological agents of Pneumonia, Sepsis and Meningitis in Young Infants. WHO/ARI Young Infant Multicentre Study Group.一种针对有序结局的临床预测模型的开发:世界卫生组织关于小婴儿肺炎、败血症和脑膜炎临床体征及病原体的多中心研究。世界卫生组织/急性呼吸道感染小婴儿多中心研究小组
Stat Med. 1998 Apr 30;17(8):909-44. doi: 10.1002/(sici)1097-0258(19980430)17:8<909::aid-sim753>3.0.co;2-o.
5
High-dimensional variable selection for ordinal outcomes with error control.具有误差控制的有序结局的高维变量选择。
Brief Bioinform. 2021 Jan 18;22(1):334-345. doi: 10.1093/bib/bbaa007.
6
Analyzing large datasets with bootstrap penalization.使用自助法惩罚分析大型数据集。
Biom J. 2017 Mar;59(2):358-376. doi: 10.1002/bimj.201600052. Epub 2016 Nov 21.
7
Regularization method for predicting an ordinal response using longitudinal high-dimensional genomic data.使用纵向高维基因组数据预测有序响应的正则化方法。
Stat Appl Genet Mol Biol. 2015 Feb;14(1):93-111. doi: 10.1515/sagmb-2014-0004.
8
Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes.逻辑随机效应回归模型:用于二进制和有序结果的统计软件包的比较。
BMC Med Res Methodol. 2011 May 23;11:77. doi: 10.1186/1471-2288-11-77.
9
A new type of generalized information criterion for regularization parameter selection in penalized regression with application to treatment process data.一种新的广义信息准则,用于惩罚回归中正则化参数选择,应用于处理过程数据。
J Biopharm Stat. 2024 Jul 3;34(4):488-512. doi: 10.1080/10543406.2023.2228399. Epub 2023 Jul 17.
10
ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R.有序贝叶斯:使用R语言对高维数据拟合有序贝叶斯回归模型
Stats (Basel). 2022 Jun;5(2):371-384. doi: 10.3390/stats5020021. Epub 2022 Apr 15.

引用本文的文献

1
Association Between Racial Equity Plans and Political and Sociodemographic Factors in US Cities.美国城市种族平等计划与政治及社会人口因素之间的关联
J Racial Ethn Health Disparities. 2025 Jul 21. doi: 10.1007/s40615-025-02542-2.
2
Metabolic Disturbances in a Mouse Model of MPTP/Probenecid-Induced Parkinson's Disease: Evaluation Using Liquid Chromatography-Mass Spectrometry.MPTP/丙磺舒诱导的帕金森病小鼠模型中的代谢紊乱:使用液相色谱-质谱联用技术进行评估
Neuropsychiatr Dis Treat. 2024 Aug 27;20:1629-1639. doi: 10.2147/NDT.S471744. eCollection 2024.
3
Mechanistic insights into the interactions between cancer drivers and the tumour immune microenvironment.癌症驱动因子与肿瘤免疫微环境相互作用的机制见解。
Genome Med. 2023 Jun 5;15(1):40. doi: 10.1186/s13073-023-01197-0.
4
psupertime: supervised pseudotime analysis for time-series single-cell RNA-seq data.psupertime:用于时间序列单细胞 RNA-seq 数据的有监督伪时间分析。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i290-i298. doi: 10.1093/bioinformatics/btac227.
5
ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R.有序贝叶斯:使用R语言对高维数据拟合有序贝叶斯回归模型
Stats (Basel). 2022 Jun;5(2):371-384. doi: 10.3390/stats5020021. Epub 2022 Apr 15.
6
Machine learning-assisted decision-support models to better predict patients with calculous pyonephrosis.机器学习辅助决策支持模型以更好地预测结石性脓肾患者。
Transl Androl Urol. 2021 Feb;10(2):710-723. doi: 10.21037/tau-20-1208.
7
Bayesian penalized cumulative logit model for high-dimensional data with an ordinal response.贝叶斯惩罚累积对数模型在有序响应的高维数据中的应用。
Stat Med. 2021 Mar 15;40(6):1453-1481. doi: 10.1002/sim.8851. Epub 2020 Dec 18.
8
Respiratory Effects of Exposure to Aerosol From the Candidate Modified-Risk Tobacco Product THS 2.2 in an 18-Month Systems Toxicology Study With A/J Mice.在一项为期 18 个月的 A/J 小鼠系统毒理学研究中,评估候选改良风险烟草产品 THS 2.2 气溶胶暴露的呼吸效应。
Toxicol Sci. 2020 Nov 1;178(1):138-158. doi: 10.1093/toxsci/kfaa132.
9
Predicting Ewe Body Condition Score Using Lifetime Liveweight and Liveweight Change, and Previous Body Condition Score Record.利用终生体重、体重变化及先前体况评分记录预测母羊体况评分
Animals (Basel). 2020 Jul 13;10(7):1182. doi: 10.3390/ani10071182.
10
High-dimensional variable selection for ordinal outcomes with error control.具有误差控制的有序结局的高维变量选择。
Brief Bioinform. 2021 Jan 18;22(1):334-345. doi: 10.1093/bib/bbaa007.

本文引用的文献

1
Increased expression of lipocalin-type-prostaglandin D synthase in ulcerative colitis and exacerbating role in murine colitis.脂氧素 A4 合酶在溃疡性结肠炎中的表达增加及其在小鼠结肠炎中的加重作用。
Am J Physiol Gastrointest Liver Physiol. 2011 Mar;300(3):G401-8. doi: 10.1152/ajpgi.00351.2010. Epub 2010 Dec 16.
2
Regularization Paths for Generalized Linear Models via Coordinate Descent.基于坐标下降法的广义线性模型正则化路径
J Stat Softw. 2010;33(1):1-22.
3
Neutrophil gelatinase-associated lipocalin levels in patients with crohn disease undergoing treatment with infliximab.克罗恩病患者接受英夫利昔单抗治疗时中性粒细胞明胶酶相关载脂蛋白水平的变化。
J Investig Med. 2010 Mar;58(3):569-71. doi: 10.231/JIM.0b013e3181ccc20c.
4
A Crohn's disease-associated NOD2 mutation suppresses transcription of human IL10 by inhibiting activity of the nuclear ribonucleoprotein hnRNP-A1.一种与克罗恩病相关的NOD2突变通过抑制核糖核蛋白hnRNP - A1的活性来抑制人白细胞介素10的转录。
Nat Immunol. 2009 May;10(5):471-9. doi: 10.1038/ni.1722. Epub 2009 Apr 6.
5
Gene expression-based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study.基于基因表达的肺腺癌生存预测:一项多中心、盲法验证研究。
Nat Med. 2008 Aug;14(8):822-7. doi: 10.1038/nm.1790. Epub 2008 Jul 20.
6
Nrf2 gene promoter polymorphism is associated with ulcerative colitis in a Japanese population.Nrf2基因启动子多态性与日本人群的溃疡性结肠炎相关。
Hepatogastroenterology. 2008 Mar-Apr;55(82-83):394-7.
7
A general approach to simultaneous model fitting and variable elimination in response models for biological data with many more variables than observations.一种用于生物数据响应模型的同时进行模型拟合和变量消除的通用方法,该生物数据的变量比观测值多得多。
BMC Bioinformatics. 2008 Apr 15;9:195. doi: 10.1186/1471-2105-9-195.
8
Liver biopsy assessment in chronic viral hepatitis: a personal, practical approach.慢性病毒性肝炎的肝活检评估:一种个人化的实用方法。
Mod Pathol. 2007 Feb;20 Suppl 1:S3-14. doi: 10.1038/modpathol.3800693.
9
Assessment of survival prediction models based on microarray data.基于微阵列数据的生存预测模型评估。
Bioinformatics. 2007 Jul 15;23(14):1768-74. doi: 10.1093/bioinformatics/btm232. Epub 2007 May 7.
10
Integrative molecular concept modeling of prostate cancer progression.前列腺癌进展的整合分子概念模型
Nat Genet. 2007 Jan;39(1):41-51. doi: 10.1038/ng1935. Epub 2006 Dec 17.