Suppr超能文献

美国成年人营养摄入数据中的异常值:2017 - 2018年国家健康与营养检查调查

Outliers in nutrient intake data for U.S. adults: national health and nutrition examination survey 2017-2018.

作者信息

Burcham Sara, Liu Yuki, Merianos Ashley L, Mendy Angelico

机构信息

Division of Epidemiology, Department of Environmental and Public Health Sciences, University of Cincinnati College of Medicine, Cincinnati, OH, USA.

Intuitive Surgical, Inc., Global Health Economics and Outcomes Research, Sunnyvale, CA, USA.

出版信息

Epidemiol Methods. 2023 Nov 10;12(1):20230018. doi: 10.1515/em-2023-0018. eCollection 2023 Jan.

Abstract

OBJECTIVES

An important step in preparing data for statistical analysis is outlier detection and removal, yet no gold standard exists in current literature. The objective of this study is to identify the ideal decision test using the National Health and Nutrition Examination Survey (NHANES) 2017-2018 dietary data.

METHODS

We conducted a secondary analysis of NHANES 24-h dietary recalls, considering the survey's multi-stage cluster design. Six outlier detection and removal strategies were assessed by evaluating the decision tests' impact on the Pearson's correlation coefficient among macronutrients. Furthermore, we assessed changes in the effect size estimates based on pre-defined sample sizes. The data were collected as part of the 2017-2018 24-h dietary recall among adult participants (N=4,893).

RESULTS

Effect estimate changes for macronutrients varied from 6.5 % for protein to 39.3 % for alcohol across all decision tests. The largest proportion of outliers removed was 4.0 % in the large sample size, for the decision test, >2 standard deviations from the mean. The smallest sample size, particularly for alcohol analysis, was most affected by the six decision tests when compared to no decision test.

CONCLUSIONS

This study, the first to use 2017-2018 NHANES dietary data for outlier evaluation, emphasizes the importance of selecting an appropriate decision test considering factors such as statistical power, sample size, normality assumptions, the proportion of data removed, effect estimate changes, and the consistency of estimates across sample sizes. We recommend the use of non-parametric tests for non-normally distributed variables of interest.

摘要

目的

在为统计分析准备数据时,一个重要步骤是异常值检测与去除,但当前文献中不存在金标准。本研究的目的是使用2017 - 2018年美国国家健康与营养检查调查(NHANES)的饮食数据确定理想的决策检验方法。

方法

我们对NHANES的24小时饮食回忆进行了二次分析,考虑了该调查的多阶段整群设计。通过评估决策检验对宏量营养素之间皮尔逊相关系数的影响,评估了六种异常值检测与去除策略。此外,我们根据预先定义的样本量评估了效应量估计值的变化。这些数据是作为2017 - 2018年成年参与者24小时饮食回忆的一部分收集的(N = 4893)。

结果

在所有决策检验中,宏量营养素的效应估计值变化范围从蛋白质的6.5%到酒精的39.3%。对于“大于均值2个标准差”的决策检验,在大样本量中去除的异常值比例最大,为4.0%。与不进行决策检验相比,最小样本量,特别是对于酒精分析,受六种决策检验的影响最大。

结论

本研究首次使用2017 - 2018年NHANES饮食数据进行异常值评估,强调了在选择合适的决策检验时考虑统计功效、样本量、正态性假设、去除的数据比例、效应估计值变化以及不同样本量估计值的一致性等因素的重要性。我们建议对感兴趣的非正态分布变量使用非参数检验。

相似文献

3
5

引用本文的文献

本文引用的文献

4
Managing Outliers in Adolescent Food Frequency Questionnaire Data.管理青少年食物频率问卷数据中的异常值。
J Nutr Educ Behav. 2021 Jan;53(1):28-35. doi: 10.1016/j.jneb.2020.08.002. Epub 2020 Oct 1.
9
Univariate Outliers: A Conceptual Overview for the Nurse Researcher.单变量离群值:护理研究者的概念性概述
Can J Nurs Res. 2019 Mar;51(1):31-37. doi: 10.1177/0844562118786647. Epub 2018 Jul 3.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验