Suppr超能文献

利用局部估计散点图平滑(LOESS)回归来估计乳腺癌患者纵向队列中缺失的权重。

Utilization of locally estimated scatterplot smoothing (LOESS) regression to estimate missing weights in a longitudinal cohort of breast cancer patients.

作者信息

Zimbalist Alexa, Radimer Kelly H, Ergas Isaac J, Roh Janise M, Quesenberry Charles P, Kwan Marilyn L, Kushi Lawrence H

机构信息

Division of Research, Kaiser Permanente Northern California, 4480 Hacienda Drive, Pleasanton, CA 94588, United States.

Division of Research, Kaiser Permanente Northern California, 4480 Hacienda Drive, Pleasanton, CA 94588, United States.

出版信息

Ann Epidemiol. 2025 Apr;104:55-60. doi: 10.1016/j.annepidem.2025.02.013. Epub 2025 Mar 4.

Abstract

INTRODUCTION

Traditional methods to handle missing data rely on making assumptions about missing data patterns. Locally estimated scatterplot smoothing (LOESS) regression models were explored as a data-driven option to minimize missing weight data in a longitudinal cohort of breast cancer patients.

METHODS

Outpatient weights from 2 years prior to breast cancer diagnosis to 10 years post were extracted from electronic health records for 10,778 women with invasive breast cancer diagnosed from 2005-2013 at Kaiser Permanente. LOESS regression models estimated weights at baseline (breast cancer diagnosis) and 6 follow-up time points (6, 12, 24, 48, 72, and 96 months post-baseline). The weights identified by the LOESS models were compared with those identified by the closest-available method, in which the weight measurement closest to each timepoint within a specified time window was selected.

RESULTS

Compared with the closest-available method, LOESS models identified fewer weights at baseline and 6 months post, but significantly more weights at later follow-up periods. At all timepoints, more than 80% of the weights identified by both approaches differed by 2.50 kilograms or less.

CONCLUSIONS

LOESS regression makes effective use of available longitudinal data and may be a beneficial tool to minimize missing longitudinal data in future EHR-based studies.

摘要

引言

处理缺失数据的传统方法依赖于对缺失数据模式做出假设。局部估计散点图平滑(LOESS)回归模型被作为一种数据驱动的选项进行探索,以尽量减少一组乳腺癌患者纵向队列中缺失的体重数据。

方法

从2005年至2013年在凯撒医疗集团被诊断为浸润性乳腺癌的10778名女性的电子健康记录中提取乳腺癌诊断前2年至诊断后10年的门诊体重数据。LOESS回归模型估计了基线(乳腺癌诊断时)和6个随访时间点(基线后6、12、24、48、72和96个月)的体重。将LOESS模型确定的体重与通过最近可用方法确定的体重进行比较,最近可用方法是在指定时间窗口内选择最接近每个时间点的体重测量值。

结果

与最近可用方法相比,LOESS模型在基线和6个月后确定的体重较少,但在后期随访期确定的体重明显更多。在所有时间点,两种方法确定的体重中超过80%相差2.50千克或更少。

结论

LOESS回归有效地利用了可用的纵向数据,可能是未来基于电子健康记录的研究中尽量减少纵向数据缺失的有益工具。

相似文献

6
Postdiagnosis Weight Change and Survival Following a Diagnosis of Early-Stage Breast Cancer.早期乳腺癌诊断后的诊断后体重变化与生存情况
Cancer Epidemiol Biomarkers Prev. 2017 Jan;26(1):44-50. doi: 10.1158/1055-9965.EPI-16-0150. Epub 2016 Aug 26.

本文引用的文献

1
Handling missing data in clinical research.临床研究中缺失数据的处理。
J Clin Epidemiol. 2022 Nov;151:185-188. doi: 10.1016/j.jclinepi.2022.08.016. Epub 2022 Sep 21.
2
Missing Data in Clinical Research: A Tutorial on Multiple Imputation.临床研究中的缺失数据:多重插补方法教程。
Can J Cardiol. 2021 Sep;37(9):1322-1331. doi: 10.1016/j.cjca.2020.11.010. Epub 2020 Dec 1.
8
Measuring bias in self-reported data.测量自我报告数据中的偏差。
Int J Behav Healthc Res. 2011 Oct;2(4):320-332. doi: 10.1504/IJBHR.2011.043414.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验