Mazumdar M, Glassman J R
Division of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA.
Stat Med. 2000 Jan 15;19(1):113-32. doi: 10.1002/(sici)1097-0258(20000115)19:1<113::aid-sim245>3.0.co;2-o.
Categorizing prognostic variables is essential for their use in clinical decision-making. Often a single cutpoint that stratifies patients into high-risk and low-risk categories is sought. These categories may be used for making treatment recommendations, determining study eligibility, or to control for varying patient prognoses in the design of a clinical trial. Methods used to categorize variables include: biological determination (most desirable but often unavailable); arbitrary selection of a cutpoint at the median value; graphical examination of the data for a threshold effect; and exploration of all observed values for the one which best separates the risk groups according to a chi-squared test. The last method, called the minimum p-value approach, involves multiple testing which inflates the type I error rates. Several methods for adjusting the inflated p-values have been proposed but remain infrequently used. Exploratory methods for categorization and the minimum p-value approach with its various p-value corrections are reviewed, and code for their easy implementation is provided. The combined use of these methods is recommended, and demonstrated in the context of two cancer-related examples which highlight a variety of the issues involved in the categorization of prognostic variables.
对预后变量进行分类对于其在临床决策中的应用至关重要。通常会寻找一个单一的切点,将患者分为高风险和低风险类别。这些类别可用于提出治疗建议、确定研究资格,或在临床试验设计中控制不同患者的预后。用于对变量进行分类的方法包括:生物学判定(最理想但通常不可用);在中位数处任意选择一个切点;对数据进行图形检查以寻找阈值效应;以及根据卡方检验探索所有观察值,找出最能区分风险组的那个值。最后一种方法,称为最小p值法,涉及多重检验,这会使I型错误率膨胀。已经提出了几种调整膨胀p值的方法,但仍很少使用。本文回顾了分类的探索性方法以及带有各种p值校正的最小p值法,并提供了便于实施的代码。建议联合使用这些方法,并在两个与癌症相关的例子中进行了演示,这些例子突出了预后变量分类中涉及的各种问题。