Suppr超能文献

稳健多元回归

Robust Multiple Regression.

作者信息

Scott David W, Wang Zhipeng

机构信息

Department of Statistics, Rice University, MS-138, 6100 Main Street, Houston, TX 77005, USA.

Apple Corporation, Cupertino, CA 95014, USA.

出版信息

Entropy (Basel). 2021 Jan 9;23(1):88. doi: 10.3390/e23010088.

Abstract

As modern data analysis pushes the boundaries of classical statistics, it is timely to reexamine alternate approaches to dealing with outliers in multiple regression. As sample sizes and the number of predictors increase, interactive methodology becomes less effective. Likewise, with limited understanding of the underlying contamination process, diagnostics are likely to fail as well. In this article, we advocate for a non-likelihood procedure that attempts to quantify the fraction of bad data as a part of the estimation step. These ideas also allow for the selection of important predictors under some assumptions. As there are many robust algorithms available, running several and looking for interesting differences is a sensible strategy for understanding the nature of the outliers.

摘要

随着现代数据分析突破经典统计学的界限,重新审视多元回归中处理异常值的替代方法恰逢其时。随着样本量和预测变量数量的增加,交互方法的效果会变差。同样,由于对潜在污染过程的了解有限,诊断方法也可能失效。在本文中,我们提倡一种非似然程序,该程序试图在估计步骤中量化坏数据的比例。在某些假设下,这些想法还允许选择重要的预测变量。由于有许多稳健算法可用,运行多个算法并寻找有趣的差异是理解异常值性质的明智策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cc2/7826993/ae945437f8b2/entropy-23-00088-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验