Suppr超能文献

使用连接点回归模型对趋势数据进行聚类。

Clustering of trend data using joinpoint regression models.

作者信息

Kim Hyune-Ju, Luo Jun, Kim Jeankyung, Chen Huann-Sheng, Feuer Eric J

机构信息

Department of Mathematics, Syracuse University, Syracuse, NY, 13244, U.S.A.

出版信息

Stat Med. 2014 Oct 15;33(23):4087-103. doi: 10.1002/sim.6221. Epub 2014 Jun 3.

Abstract

In this paper, we propose methods to cluster groups of two-dimensional data whose mean functions are piecewise linear into several clusters with common characteristics such as the same slopes. To fit segmented line regression models with common features for each possible cluster, we use a restricted least squares method. In implementing the restricted least squares method, we estimate the maximum number of segments in each cluster by using both the permutation test method and the Bayes information criterion method and then propose to use the Bayes information criterion to determine the number of clusters. For a more effective implementation of the clustering algorithm, we propose a measure of the minimum distance worth detecting and illustrate its use in two examples. We summarize simulation results to study properties of the proposed methods and also prove the consistency of the cluster grouping estimated with a given number of clusters. The presentation and examples in this paper focus on the segmented line regression model with the ordered values of the independent variable, which has been the model of interest in cancer trend analysis, but the proposed method can be applied to a general model with design points either ordered or unordered.

摘要

在本文中,我们提出了一些方法,用于将均值函数为分段线性的二维数据组聚类为具有相同斜率等共同特征的几个簇。为了对每个可能的簇拟合具有共同特征的分段线性回归模型,我们使用了一种受限最小二乘法。在实施受限最小二乘法时,我们通过排列检验法和贝叶斯信息准则法估计每个簇中的最大段数,然后建议使用贝叶斯信息准则来确定簇的数量。为了更有效地实施聚类算法,我们提出了一种值得检测的最小距离度量,并在两个示例中说明了其用法。我们总结了模拟结果以研究所提方法的性质,并证明了给定簇数下估计的簇分组的一致性。本文中的介绍和示例主要关注自变量有序值的分段线性回归模型,该模型一直是癌症趋势分析中感兴趣的模型,但所提方法可应用于设计点有序或无序的一般模型。

相似文献

1
Clustering of trend data using joinpoint regression models.使用连接点回归模型对趋势数据进行聚类。
Stat Med. 2014 Oct 15;33(23):4087-103. doi: 10.1002/sim.6221. Epub 2014 Jun 3.
8
Age-Specific Incidence of Melanoma in the United States.美国特定年龄段的黑色素瘤发病率。
JAMA Dermatol. 2020 Jan 1;156(1):57-64. doi: 10.1001/jamadermatol.2019.3353.

引用本文的文献

2
Pulmonary Embolism-Related Mortality in Patients With Cancer.癌症患者中与肺栓塞相关的死亡率
JAMA Netw Open. 2025 Feb 3;8(2):e2460315. doi: 10.1001/jamanetworkopen.2024.60315.
7
Urban-sub-urban-rural variation in the supply and demand of emergency medical services.城乡急诊医疗服务的供需差异。
Front Public Health. 2023 Jan 25;10:1064385. doi: 10.3389/fpubh.2022.1064385. eCollection 2022.

本文引用的文献

5
Comparability of segmented line regression models.分段线性回归模型的可比性。
Biometrics. 2004 Dec;60(4):1005-14. doi: 10.1111/j.0006-341X.2004.00256.x.
6
Permutation tests for joinpoint regression with applications to cancer rates.用于连接点回归的排列检验及其在癌症发病率中的应用。
Stat Med. 2000 Feb 15;19(3):335-51. doi: 10.1002/(sici)1097-0258(20000215)19:3<335::aid-sim336>3.0.co;2-z.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验