• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

R 中的样条函数过程综述。

A review of spline function procedures in R.

机构信息

Department of Mathematical Sciences, University of Essex, Colchester, UK.

Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany.

出版信息

BMC Med Res Methodol. 2019 Mar 6;19(1):46. doi: 10.1186/s12874-019-0666-3.

DOI:10.1186/s12874-019-0666-3
PMID:30841848
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6402144/
Abstract

BACKGROUND

With progress on both the theoretical and the computational fronts the use of spline modelling has become an established tool in statistical regression analysis. An important issue in spline modelling is the availability of user friendly, well documented software packages. Following the idea of the STRengthening Analytical Thinking for Observational Studies initiative to provide users with guidance documents on the application of statistical methods in observational research, the aim of this article is to provide an overview of the most widely used spline-based techniques and their implementation in R.

METHODS

In this work, we focus on the R Language for Statistical Computing which has become a hugely popular statistics software. We identified a set of packages that include functions for spline modelling within a regression framework. Using simulated and real data we provide an introduction to spline modelling and an overview of the most popular spline functions.

RESULTS

We present a series of simple scenarios of univariate data, where different basis functions are used to identify the correct functional form of an independent variable. Even in simple data, using routines from different packages would lead to different results.

CONCLUSIONS

This work illustrate challenges that an analyst faces when working with data. Most differences can be attributed to the choice of hyper-parameters rather than the basis used. In fact an experienced user will know how to obtain a reasonable outcome, regardless of the type of spline used. However, many analysts do not have sufficient knowledge to use these powerful tools adequately and will need more guidance.

摘要

背景

随着理论和计算方面的进展,样条建模已成为统计回归分析中一种既定的工具。样条建模中的一个重要问题是是否有用户友好、文档齐全的软件包。为了响应 STRengthening Analytical Thinking for Observational Studies 倡议,为用户提供有关在观察性研究中应用统计方法的指南文件的想法,本文旨在提供最广泛使用的基于样条的技术及其在 R 中的实现的概述。

方法

在这项工作中,我们专注于 R 语言统计计算,它已成为一种非常流行的统计软件。我们确定了一组包含在回归框架内进行样条建模的功能的软件包。使用模拟和真实数据,我们介绍了样条建模,并概述了最流行的样条函数。

结果

我们展示了一系列简单的单变量数据场景,其中使用不同的基函数来确定独立变量的正确函数形式。即使在简单的数据中,使用来自不同软件包的例程也会导致不同的结果。

结论

这项工作说明了分析师在处理数据时面临的挑战。大多数差异可以归因于超参数的选择,而不是所使用的基函数。实际上,有经验的用户将知道如何获得合理的结果,而不管使用哪种样条。但是,许多分析师没有足够的知识来充分利用这些强大的工具,因此需要更多的指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/86fdf2e82706/12874_2019_666_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/6102f5a3e499/12874_2019_666_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/e913abe4eaf1/12874_2019_666_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/74817279bbd8/12874_2019_666_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/3e5569732603/12874_2019_666_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/b998a03b2888/12874_2019_666_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/86fdf2e82706/12874_2019_666_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/6102f5a3e499/12874_2019_666_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/e913abe4eaf1/12874_2019_666_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/74817279bbd8/12874_2019_666_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/3e5569732603/12874_2019_666_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/b998a03b2888/12874_2019_666_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0533/6402144/86fdf2e82706/12874_2019_666_Fig6_HTML.jpg

相似文献

1
A review of spline function procedures in R.R 中的样条函数过程综述。
BMC Med Res Methodol. 2019 Mar 6;19(1):46. doi: 10.1186/s12874-019-0666-3.
2
STRengthening analytical thinking for observational studies: the STRATOS initiative.加强观察性研究的分析思维:STRATOS倡议。
Stat Med. 2014 Dec 30;33(30):5413-32. doi: 10.1002/sim.6265. Epub 2014 Jul 30.
3
A set of SAS macros for calculating and displaying adjusted odds ratios (with confidence intervals) for continuous covariates in logistic B-spline regression models.一组用于计算和显示逻辑B样条回归模型中连续协变量的调整比值比(及其置信区间)的SAS宏。
Comput Methods Programs Biomed. 2008 Oct;92(1):109-14. doi: 10.1016/j.cmpb.2008.05.004. Epub 2008 Jul 7.
4
Gaining more flexibility in Cox proportional hazards regression models with cubic spline functions.利用三次样条函数在Cox比例风险回归模型中获得更大灵活性。
Comput Methods Programs Biomed. 1997 Nov;54(3):201-8. doi: 10.1016/s0169-2607(97)00043-6.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Selection of important variables and determination of functional form for continuous predictors in multivariable model building.多变量模型构建中重要变量的选择以及连续预测变量函数形式的确定。
Stat Med. 2007 Dec 30;26(30):5512-28. doi: 10.1002/sim.3148.
7
Systematic review of education and practical guidance on regression modeling for medical researchers who lack a strong statistical background: Study protocol.缺乏强大统计学背景的医学研究人员的回归建模教育和实践指南的系统评价:研究方案。
PLoS One. 2020 Dec 21;15(12):e0241427. doi: 10.1371/journal.pone.0241427. eCollection 2020.
8
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
9
R/parallel--speeding up bioinformatics analysis with R.R/parallel——利用R加速生物信息学分析
BMC Bioinformatics. 2008 Sep 22;9:390. doi: 10.1186/1471-2105-9-390.
10
Nonlinear growth curve modeling using penalized spline models: A gentle introduction.使用惩罚样条模型进行非线性增长曲线建模:一个简要介绍。
Psychol Methods. 2019 Jun;24(3):269-290. doi: 10.1037/met0000193. Epub 2018 Aug 16.

引用本文的文献

1
The non-linear association of physical fatigue with depression and anxiety among mental health professionals who recovered from COVID-19 infection: a national survey in China.新冠病毒感染康复的心理健康专业人员中身体疲劳与抑郁和焦虑的非线性关联:一项中国全国性调查
Front Psychiatry. 2025 Aug 22;16:1610910. doi: 10.3389/fpsyt.2025.1610910. eCollection 2025.
2
The correlation between preoperative total lesion glycolysis and lymphovascular invasion based on F-FDG PET/CT and its predictive value in primary gastric cancer: a cross-sectional study.基于F-FDG PET/CT的原发性胃癌术前总病灶糖酵解与淋巴管侵犯的相关性及其预测价值:一项横断面研究
Quant Imaging Med Surg. 2025 Sep 1;15(9):7738-7748. doi: 10.21037/qims-2025-644. Epub 2025 Aug 13.
3

本文引用的文献

1
On stability issues in deriving multivariable regression models.关于推导多变量回归模型中的稳定性问题。
Biom J. 2015 Jul;57(4):531-55. doi: 10.1002/bimj.201300222. Epub 2014 Dec 15.
2
STRengthening analytical thinking for observational studies: the STRATOS initiative.加强观察性研究的分析思维:STRATOS倡议。
Stat Med. 2014 Dec 30;33(30):5413-32. doi: 10.1002/sim.6265. Epub 2014 Jul 30.
3
Resampling methods for model fitting and model selection.用于模型拟合和模型选择的重采样方法。
The potential causal relationship between fat mass in different body areas and low back pain: Findings from NHANES and Mendelian randomization studies.不同身体部位脂肪量与腰痛之间的潜在因果关系:来自美国国家健康与营养检查调查(NHANES)及孟德尔随机化研究的结果
Medicine (Baltimore). 2025 Aug 15;104(33):e43891. doi: 10.1097/MD.0000000000043891.
4
Developmental trajectories of the default mode, frontoparietal, and salience networks from the third trimester through the newborn period.从孕晚期到新生儿期默认模式网络、额顶叶网络和突显网络的发育轨迹。
Imaging Neurosci (Camb). 2024 Jul 8;2. doi: 10.1162/imag_a_00201. eCollection 2024.
5
The association between parental BMI and offspring adiposity: A genetically informed analysis of trios.父母体重指数与后代肥胖之间的关联:对三人组的遗传信息分析。
PLoS Genet. 2025 Aug 5;21(8):e1011775. doi: 10.1371/journal.pgen.1011775. eCollection 2025 Aug.
6
Impaired physical function in relation to later-life exposure to ambient fine particulate matter and ozone among Chinese middle-aged and older adults.中国中老年人晚年暴露于环境细颗粒物和臭氧与身体功能受损的关系。
BMC Public Health. 2025 Aug 2;25(1):2616. doi: 10.1186/s12889-025-23885-9.
7
Predictors and Correlates of Depression and Anxiety Symptom Trajectories in a Large Digital Mental Health Provider: Retrospective Analysis of Data From Rula Health.大型数字心理健康服务提供商中抑郁和焦虑症状轨迹的预测因素及相关因素:对鲁拉健康数据的回顾性分析
J Med Internet Res. 2025 Jul 25;27:e75750. doi: 10.2196/75750.
8
Time-series modeling of epidemics in complex populations: Detecting changes in incidence volatility over time.复杂人群中流行病的时间序列建模:检测发病率波动随时间的变化。
PLoS Comput Biol. 2025 Jul 11;21(7):e1012882. doi: 10.1371/journal.pcbi.1012882. eCollection 2025 Jul.
9
The impact of inter-infection time on antimicrobial resistance profiles in women with multiple urinary tract infections over time.随着时间推移,反复发生尿路感染的女性中,感染间隔时间对抗菌药物耐药性特征的影响。
J Antimicrob Chemother. 2025 Aug 1;80(8):2234-2240. doi: 10.1093/jac/dkaf194.
10
Gender differences in Leptospira exposure risk, perceptions of disease severity, and high-risk behaviours in Salvador, Brazil: A cross-sectional study.巴西萨尔瓦多钩端螺旋体暴露风险、疾病严重程度认知及高危行为的性别差异:一项横断面研究
PLOS Glob Public Health. 2025 Jun 27;5(6):e0004786. doi: 10.1371/journal.pgph.0004786. eCollection 2025.
J Biopharm Stat. 2011 Nov;21(6):1177-86. doi: 10.1080/10543406.2011.607749.
4
Selection of important variables and determination of functional form for continuous predictors in multivariable model building.多变量模型构建中重要变量的选择以及连续预测变量函数形式的确定。
Stat Med. 2007 Dec 30;26(30):5512-28. doi: 10.1002/sim.3148.
5
On the use of generalized additive models in time-series studies of air pollution and health.关于广义相加模型在空气污染与健康的时间序列研究中的应用
Am J Epidemiol. 2002 Aug 1;156(3):193-203. doi: 10.1093/aje/kwf062.
6
Generalized additive models for medical research.医学研究中的广义相加模型。
Stat Methods Med Res. 1995 Sep;4(3):187-96. doi: 10.1177/096228029500400302.
7
Smoothing reference centile curves: the LMS method and penalized likelihood.平滑参考百分位数曲线:LMS方法与惩罚似然法
Stat Med. 1992 Jul;11(10):1305-19. doi: 10.1002/sim.4780111005.