Young Derek S, Mathew Thomas
Department of Statistics, University of Kentucky, Lexington, KY, USA.
Department of Mathematics and Statistics, University of Maryland Baltimore County, Baltimore, MD, USA.
Stat Methods Med Res. 2020 Dec;29(12):3569-3585. doi: 10.1177/0962280220933910. Epub 2020 Jun 29.
Reference regions are widely used in clinical chemistry and laboratory medicine to interpret the results of biochemical or physiological tests of patients. There are well-established methods in the literature for reference limits for univariate measurements; however, limited methods are available for the construction of multivariate reference regions, since traditional multivariate statistical regions (e.g. confidence, prediction, and tolerance regions) are not constructed based on a hyperrectangular geometry. The present work addresses this problem by developing multivariate hyperrectangular nonparametric tolerance regions for setting the reference regions. The approach utilizes statistical data depth to determine which points to trim and then the extremes of the trimmed dataset are used as the faces of the hyperrectangular region. Also presented is a strategy for determining the number of points to trim based on previously established asymptotic results. An extensive coverage study shows the favorable performance of the proposed procedure for moderate to large sample sizes. The procedure is applied to obtain reference regions for addressing two important clinical problems: (1) assessing kidney function in adolescents and (2) characterizing insulin-like growth factor concentrations in the serum of adults.
参考区间在临床化学和检验医学中被广泛用于解释患者生化或生理检测的结果。文献中有成熟的单变量测量参考限确定方法;然而,构建多变量参考区间的方法有限,因为传统的多变量统计区间(如置信区间、预测区间和容忍区间)并非基于超矩形几何构建。本研究通过开发用于设置参考区间的多变量超矩形非参数容忍区间来解决这一问题。该方法利用统计数据深度来确定要剔除哪些点,然后将剔除后数据集的极值用作超矩形区域的边界。还提出了一种基于先前确立的渐近结果来确定剔除点数的策略。一项广泛的覆盖研究表明,对于中等到大样本量,所提出的方法具有良好的性能。该方法被应用于获取参考区间,以解决两个重要的临床问题:(1)评估青少年的肾功能;(2)表征成年人血清中胰岛素样生长因子的浓度。