• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

检测纵向儿童生长数据中异常测量值和轨迹的新方法和技术考虑因素。

New approaches and technical considerations in detecting outlier measurements and trajectories in longitudinal children growth data.

机构信息

Department of Nutritional Sciences, Faculty of Medicine, University of Toronto, Toronto, Canada.

Translational Medicine Program, Hospital for Sick Children, Toronto, Canada.

出版信息

BMC Med Res Methodol. 2023 Oct 13;23(1):232. doi: 10.1186/s12874-023-02045-w.

DOI:10.1186/s12874-023-02045-w
PMID:37833647
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10576311/
Abstract

BACKGROUND

Growth studies rely on longitudinal measurements, typically represented as trajectories. However, anthropometry is prone to errors that can generate outliers. While various methods are available for detecting outlier measurements, a gold standard has yet to be identified, and there is no established method for outlying trajectories. Thus, outlier types and their effects on growth pattern detection still need to be investigated. This work aimed to assess the performance of six methods at detecting different types of outliers, propose two novel methods for outlier trajectory detection and evaluate how outliers affect growth pattern detection.

METHODS

We included 393 healthy infants from The Applied Research Group for Kids (TARGet Kids!) cohort and 1651 children with severe malnutrition from the co-trimoxazole prophylaxis clinical trial. We injected outliers of three types and six intensities and applied four outlier detection methods for measurements (model-based and World Health Organization cut-offs-based) and two for trajectories. We also assessed growth pattern detection before and after outlier injection using time series clustering and latent class mixed models. Error type, intensity, and population affected method performance.

RESULTS

Model-based outlier detection methods performed best for measurements with precision between 5.72-99.89%, especially for low and moderate error intensities. The clustering-based outlier trajectory method had high precision of 14.93-99.12%. Combining methods improved the detection rate to 21.82% in outlier measurements. Finally, when comparing growth groups with and without outliers, the outliers were shown to alter group membership by 57.9 -79.04%.

CONCLUSIONS

World Health Organization cut-off-based techniques were shown to perform well in few very particular cases (extreme errors of high intensity), while model-based techniques performed well, especially for moderate errors of low intensity. Clustering-based outlier trajectory detection performed exceptionally well across all types and intensities of errors, indicating a potential strategic change in how outliers in growth data are viewed. Finally, the importance of detecting outliers was shown, given its impact on children growth studies, as demonstrated by comparing results of growth group detection.

摘要

背景

生长研究依赖于纵向测量,通常表现为轨迹。然而,人体测量学容易产生误差,从而产生异常值。虽然有多种方法可用于检测异常测量值,但尚未确定金标准,也没有用于异常轨迹的既定方法。因此,异常值的类型及其对生长模式检测的影响仍需研究。本研究旨在评估六种方法检测不同类型异常值的性能,提出两种新的异常轨迹检测方法,并评估异常值如何影响生长模式检测。

方法

我们纳入了来自 TARGet Kids!队列的 393 名健康婴儿和来自复方磺胺甲噁唑预防临床试验的 1651 名严重营养不良儿童。我们注射了三种类型和六种强度的异常值,并应用了四种测量异常值的检测方法(基于模型和世界卫生组织截断值的方法)和两种轨迹异常值的检测方法。我们还在注射异常值前后使用时间序列聚类和潜在类别混合模型评估生长模式检测。错误类型、强度和人群会影响方法的性能。

结果

对于测量值,基于模型的异常值检测方法的性能最佳,精度为 5.72-99.89%,尤其是对于低强度和中强度误差。基于聚类的异常轨迹方法的精度为 14.93-99.12%。结合方法可将异常值的检测率提高到 21.82%。最后,在比较有无异常值的生长组时,异常值使组归属发生了 57.9-79.04%的改变。

结论

基于世界卫生组织截断值的技术在少数非常特殊的情况下表现良好(高强度极端误差),而基于模型的技术表现良好,尤其是在低强度的中度误差情况下。基于聚类的异常轨迹检测在所有类型和强度的误差下表现出色,表明人们对生长数据中异常值的看法可能会发生战略变化。最后,通过比较生长组检测结果,显示了检测异常值的重要性,因为它会影响儿童生长研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8495/10576311/40919adfc87d/12874_2023_2045_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8495/10576311/4bf0d6433561/12874_2023_2045_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8495/10576311/7620a1e78785/12874_2023_2045_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8495/10576311/40919adfc87d/12874_2023_2045_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8495/10576311/4bf0d6433561/12874_2023_2045_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8495/10576311/7620a1e78785/12874_2023_2045_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8495/10576311/40919adfc87d/12874_2023_2045_Fig3_HTML.jpg

相似文献

1
New approaches and technical considerations in detecting outlier measurements and trajectories in longitudinal children growth data.检测纵向儿童生长数据中异常测量值和轨迹的新方法和技术考虑因素。
BMC Med Res Methodol. 2023 Oct 13;23(1):232. doi: 10.1186/s12874-023-02045-w.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Detecting EEG outliers for BCI on the Riemannian manifold using spectral clustering.使用谱聚类在黎曼流形上检测用于脑机接口的脑电图异常值。
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:438-441. doi: 10.1109/EMBC44109.2020.9175456.
4
Entropy-based grid approach for handling outliers: a case study to environmental monitoring data.基于熵的网格方法处理异常值:以环境监测数据为例。
Environ Sci Pollut Res Int. 2023 Dec;30(60):125138-125157. doi: 10.1007/s11356-023-26780-1. Epub 2023 Jun 12.
5
Outlier detection in spatial error models using modified thresholding-based iterative procedure for outlier detection approach.基于阈值迭代的空间误差模型异常值检测方法的改进。
BMC Med Res Methodol. 2024 Apr 15;24(1):89. doi: 10.1186/s12874-024-02208-3.
6
Identifying biologically implausible values in big longitudinal data: an example applied to child growth data from the Brazilian food and nutrition surveillance system.识别大型纵向数据中的生物学上不合理的值:应用于巴西食品和营养监测系统儿童生长数据的示例。
BMC Med Res Methodol. 2024 Feb 15;24(1):38. doi: 10.1186/s12874-024-02161-1.
7
Comparing Methods for Measurement Error Detection in Serial 24-h Hormonal Data.比较串联 24 小时激素数据中测量误差检测的方法。
J Biol Rhythms. 2019 Aug;34(4):347-363. doi: 10.1177/0748730419850917. Epub 2019 Jun 12.
8
"Outlier detection: Underused in sport science despite outliers' impact on inference and prediction".异常值检测:尽管异常值会影响推断和预测,但在体育科学中却未得到充分利用。
J Sports Sci. 2024 Dec;42(24):2495-2505. doi: 10.1080/02640414.2024.2443313. Epub 2024 Dec 24.
9
Trend-Residual Dual Modeling for Detection of Outliers in Low-Cost GPS Trajectories.用于检测低成本GPS轨迹中异常值的趋势-残差双重建模
Sensors (Basel). 2016 Dec 1;16(12):2036. doi: 10.3390/s16122036.
10
Data-driven evolution of water quality models: An in-depth investigation of innovative outlier detection approaches-A case study of Irish Water Quality Index (IEWQI) model.水质模型的数据驱动演变:创新异常值检测方法的深入研究——以爱尔兰水质指数(IEWQI)模型为例
Water Res. 2024 May 15;255:121499. doi: 10.1016/j.watres.2024.121499. Epub 2024 Mar 20.

引用本文的文献

1
An exploration of sub-national variability in institutional maternal mortality ratios in Kenya: a meta-analysis of the 2021 health facility data.肯尼亚各地区机构孕产妇死亡率差异的探索:对2021年医疗机构数据的荟萃分析。
Front Glob Womens Health. 2025 Mar 4;6:1481495. doi: 10.3389/fgwh.2025.1481495. eCollection 2025.
2
An efficient interpretable framework for unsupervised low, very low and extreme birth weight detection.一种用于无监督的低、极低和超低出生体重检测的高效可解释框架。
PLoS One. 2025 Jan 30;20(1):e0317843. doi: 10.1371/journal.pone.0317843. eCollection 2025.

本文引用的文献

1
Longitudinal assessments of child growth: A six-year follow-up of a cluster-randomized maternal education trial.儿童生长的纵向评估:一项针对母亲教育随机对照试验的六年随访研究。
Clin Nutr. 2021 Sep;40(9):5106-5113. doi: 10.1016/j.clnu.2021.08.007. Epub 2021 Aug 20.
2
Identifying longitudinal-growth patterns from infancy to childhood: a study comparing multiple clustering techniques.从婴儿期到儿童期的纵向生长模式识别:比较多种聚类技术的研究。
Int J Epidemiol. 2021 Jul 9;50(3):1000-1010. doi: 10.1093/ije/dyab021.
3
Is it time to stop sweeping data cleaning under the carpet? A novel algorithm for outlier management in growth data.
是否是时候停止将数据清理问题掩盖起来了?一种新的生长数据异常值管理算法。
PLoS One. 2020 Jan 24;15(1):e0228154. doi: 10.1371/journal.pone.0228154. eCollection 2020.
4
A Diagnostic Procedure for Detecting Outliers in Linear State-Space Models.一种用于检测线性状态空间模型中异常值的诊断程序。
Multivariate Behav Res. 2020 Mar-Apr;55(2):231-255. doi: 10.1080/00273171.2019.1627659. Epub 2019 Jul 2.
5
Breastfeeding duration, maternal body mass index, and birth weight are associated with differences in body mass index growth trajectories in early childhood.母乳喂养持续时间、产妇体重指数和出生体重与儿童早期体重指数增长轨迹的差异有关。
Am J Clin Nutr. 2018 Apr 1;107(4):584-592. doi: 10.1093/ajcn/nqx081.
6
New approach for the identification of implausible values and outliers in longitudinal childhood anthropometric data.用于鉴定纵向儿童人体测量数据中不合理值和离群值的新方法。
Ann Epidemiol. 2018 Mar;28(3):204-211.e3. doi: 10.1016/j.annepidem.2018.01.007. Epub 2018 Jan 11.
7
Daily co-trimoxazole prophylaxis to prevent mortality in children with complicated severe acute malnutrition: a multicentre, double-blind, randomised placebo-controlled trial.每日复方磺胺甲噁唑预防治疗复杂严重急性营养不良儿童死亡:一项多中心、双盲、随机安慰剂对照试验。
Lancet Glob Health. 2016 Jul;4(7):e464-73. doi: 10.1016/S2214-109X(16)30096-1. Epub 2016 Jun 2.
8
Cohort Profile: The Applied Research Group for Kids (TARGet Kids!).队列简介:儿童应用研究小组(目标儿童!)
Int J Epidemiol. 2015 Jun;44(3):776-88. doi: 10.1093/ije/dyu123. Epub 2014 Jun 30.
9
Shape outlier detection and visualization for functional data: the outliergram.功能数据的形状异常值检测与可视化:异常值图
Biostatistics. 2014 Oct;15(4):603-19. doi: 10.1093/biostatistics/kxu006. Epub 2014 Mar 11.
10
Measurement error in time-series analysis: a simulation study comparing modelled and monitored data.时间序列分析中的测量误差:比较模拟和监测数据的模拟研究。
BMC Med Res Methodol. 2013 Nov 13;13:136. doi: 10.1186/1471-2288-13-136.