• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过惩罚加权最小绝对偏差-套索方法进行异常值检测和稳健变量选择

Outlier detection and robust variable selection via the penalized weighted LAD-LASSO method.

作者信息

Jiang Yunlu, Wang Yan, Zhang Jiantao, Xie Baojian, Liao Jibiao, Liao Wenhui

机构信息

Department of Statistics, College of Economics, Jinan University, Guangzhou, People's Republic of China.

College of Economics, Jinan University, Guangzhou, People's Republic of China.

出版信息

J Appl Stat. 2020 Feb 4;48(2):234-246. doi: 10.1080/02664763.2020.1722079. eCollection 2021.

DOI:10.1080/02664763.2020.1722079
PMID:35707691
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9041793/
Abstract

This paper studies the outlier detection and robust variable selection problem in the linear regression model. The penalized weighted least absolute deviation (PWLAD) regression estimation method and the adaptive least absolute shrinkage and selection operator (LASSO) are combined to simultaneously achieve outlier detection, and robust variable selection. An iterative algorithm is proposed to solve the proposed optimization problem. Monte Carlo studies are evaluated the finite-sample performance of the proposed methods. The results indicate that the finite sample performance of the proposed methods performs better than that of the existing methods when there are leverage points or outliers in the response variable or explanatory variables. Finally, we apply the proposed methodology to analyze two real datasets.

摘要

本文研究线性回归模型中的异常值检测和稳健变量选择问题。将惩罚加权最小绝对偏差(PWLAD)回归估计方法与自适应最小绝对收缩和选择算子(LASSO)相结合,以同时实现异常值检测和稳健变量选择。提出了一种迭代算法来解决所提出的优化问题。通过蒙特卡罗研究评估了所提方法的有限样本性能。结果表明,当响应变量或解释变量中存在杠杆点或异常值时,所提方法的有限样本性能优于现有方法。最后,我们应用所提方法对两个真实数据集进行分析。

相似文献

1
Outlier detection and robust variable selection via the penalized weighted LAD-LASSO method.通过惩罚加权最小绝对偏差-套索方法进行异常值检测和稳健变量选择
J Appl Stat. 2020 Feb 4;48(2):234-246. doi: 10.1080/02664763.2020.1722079. eCollection 2021.
2
Penalized weighted proportional hazards model for robust variable selection and outlier detection.惩罚加权比例风险模型用于稳健变量选择和异常值检测。
Stat Med. 2022 Jul 30;41(17):3398-3420. doi: 10.1002/sim.9424. Epub 2022 May 17.
3
Variable Selection and Regularization in Quantile Regression via Minimum Covariance Determinant Based Weights.基于最小协方差行列式权重的分位数回归中的变量选择与正则化
Entropy (Basel). 2020 Dec 29;23(1):33. doi: 10.3390/e23010033.
4
Newton-Raphson Meets Sparsity: Sparse Learning Via a Novel Penalty and a Fast Solver.牛顿-拉弗森方法与稀疏性:通过一种新型惩罚项和快速求解器实现稀疏学习
IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):12057-12067. doi: 10.1109/TNNLS.2023.3251748. Epub 2024 Sep 3.
5
LASSO type penalized spline regression for binary data.LASSO 类型惩罚样条回归用于二项数据。
BMC Med Res Methodol. 2021 Apr 24;21(1):83. doi: 10.1186/s12874-021-01234-9.
6
A Bayesian Framework for Robust Quantitative Trait Locus Mapping and Outlier Detection.一种用于稳健数量性状基因座定位和异常值检测的贝叶斯框架。
Int J Biostat. 2020 Feb 15. doi: 10.1515/ijb-2019-0038.
7
The model adaptive space shrinkage (MASS) approach: a new method for simultaneous variable selection and outlier detection based on model population analysis.模型自适应空间收缩(MASS)方法:一种基于模型群体分析的同时变量选择和异常值检测的新方法。
Analyst. 2016 Oct 7;141(19):5586-97. doi: 10.1039/c6an00764c. Epub 2016 Jul 20.
8
Penalized variable selection for accelerated failure time models with random effects.具有随机效应的加速失效时间模型的惩罚变量选择。
Stat Med. 2019 Feb 28;38(5):878-892. doi: 10.1002/sim.8023. Epub 2018 Nov 8.
9
Efficient robust doubly adaptive regularized regression with applications.高效稳健的双重自适应正则化回归及其应用。
Stat Methods Med Res. 2019 Jul;28(7):2210-2226. doi: 10.1177/0962280218757560. Epub 2018 Feb 16.
10
Variable selection for zero-inflated and overdispersed data with application to health care demand in Germany.针对零膨胀和过度分散数据的变量选择及其在德国医疗保健需求中的应用
Biom J. 2015 Sep;57(5):867-84. doi: 10.1002/bimj.201400143. Epub 2015 Jun 8.

引用本文的文献

1
Identification of immune-related biomarkers linked to systemic lupus erythematosus and dilated cardiomyopathy through integrated bioinformatics analysis and multiple machine learning algorithms.通过综合生物信息学分析和多种机器学习算法鉴定与系统性红斑狼疮和扩张型心肌病相关的免疫相关生物标志物。
Front Immunol. 2025 Jul 30;16:1606920. doi: 10.3389/fimmu.2025.1606920. eCollection 2025.
2
Identification of genetic indicators linked to immunological infiltration in idiopathic pulmonary fibrosis.特发性肺纤维化中与免疫浸润相关的遗传指标的鉴定
Medicine (Baltimore). 2025 May 9;104(19):e42376. doi: 10.1097/MD.0000000000042376.
3
Robust multi-outcome regression with correlated covariate blocks using fused LAD-lasso.使用融合最小绝对值偏差-套索法对具有相关协变量块的稳健多结果回归。
J Appl Stat. 2024 Oct 11;52(5):1081-1102. doi: 10.1080/02664763.2024.2414346. eCollection 2025.
4
Outlier detection in spatial error models using modified thresholding-based iterative procedure for outlier detection approach.基于阈值迭代的空间误差模型异常值检测方法的改进。
BMC Med Res Methodol. 2024 Apr 15;24(1):89. doi: 10.1186/s12874-024-02208-3.
5
The impact of the Hedgehog signal pathway on the tumor immune microenvironment of gastric adenocarcinoma by integrated analysis of scRNA-seq and RNA-seq datasets.基于 scRNA-seq 和 RNA-seq 数据的综合分析探讨 Hedgehog 信号通路对胃腺癌肿瘤免疫微环境的影响。
Funct Integr Genomics. 2023 Aug 1;23(3):258. doi: 10.1007/s10142-023-01187-w.
6
Elucidating shared biomarkers and pathways in kidney stones and diabetes: insights into novel therapeutic targets and the role of resveratrol.阐明肾结石和糖尿病的共同生物标志物和途径:揭示新的治疗靶点和白藜芦醇的作用。
J Transl Med. 2023 Jul 21;21(1):491. doi: 10.1186/s12967-023-04356-4.
7
Machine Learning-Based Integration of Metabolomics Characterisation Predicts Progression of Myopic Retinopathy in Children and Adolescents.基于机器学习的代谢组学特征整合预测儿童和青少年近视性视网膜病变的进展
Metabolites. 2023 Feb 17;13(2):301. doi: 10.3390/metabo13020301.

本文引用的文献

1
Efficient robust doubly adaptive regularized regression with applications.高效稳健的双重自适应正则化回归及其应用。
Stat Methods Med Res. 2019 Jul;28(7):2210-2226. doi: 10.1177/0962280218757560. Epub 2018 Feb 16.
2
ADAPTIVE ROBUST VARIABLE SELECTION.自适应鲁棒变量选择
Ann Stat. 2014 Feb 1;42(1):324-351. doi: 10.1214/13-AOS1191.
3
Robust Variable Selection with Exponential Squared Loss.基于指数平方损失的稳健变量选择
J Am Stat Assoc. 2013 Apr 1;108(502):632-643. doi: 10.1080/01621459.2013.766613.
4
Quantile Regression for Analyzing Heterogeneity in Ultra-high Dimension.用于分析超高维异质性的分位数回归
J Am Stat Assoc. 2012 Mar 1;107(497):214-222. doi: 10.1080/01621459.2012.656014. Epub 2012 Jun 11.
5
NEW EFFICIENT ESTIMATION AND VARIABLE SELECTION METHODS FOR SEMIPARAMETRIC VARYING-COEFFICIENT PARTIALLY LINEAR MODELS.半参数变系数部分线性模型的新有效估计与变量选择方法
Ann Stat. 2011 Feb 1;39(1):305-332. doi: 10.1214/10-AOS842.
6
Penalized Composite Quasi-Likelihood for Ultrahigh-Dimensional Variable Selection.用于超高维变量选择的惩罚复合拟似然法
J R Stat Soc Series B Stat Methodol. 2011 Jun;73(3):325-349. doi: 10.1111/j.1467-9868.2010.00764.x.
7
One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.非凹惩罚似然模型中的一步稀疏估计
Ann Stat. 2008 Aug 1;36(4):1509-1533. doi: 10.1214/009053607000000802.
8
Weighted Wilcoxon-type smoothly clipped absolute deviation method.加权威尔科克森型平滑截断绝对偏差法。
Biometrics. 2009 Jun;65(2):564-71. doi: 10.1111/j.1541-0420.2008.01099.x. Epub 2008 Jul 18.