Weichle Thomas, Hynes Denise M, Durazo-Arvizu Ramon, Tarlov Elizabeth, Zhang Qiuying
Center of Innovation for Complex Chronic Healthcare, Edward Hines, Jr. VA Hospital, Hines, IL 60141 USA ; VA Information Resource Center, Edward Hines, Jr. VA Hospital, Hines, IL 60141 USA.
Springerplus. 2013 Nov 18;2:614. doi: 10.1186/2193-1801-2-614. eCollection 2013.
The distributions of medical costs are often skewed to the right because small numbers of patients use large amounts of health care resources. Using data from a study of colon cancer costs, we show, by example, the impact and magnitude of outliers and influential observations on health care costs and compared the effects of statistical costing methods for addressing the disproportionate influence of outliers and influential observations. We used data from a retrospective cohort study of 3,842 elderly veterans with colon cancer who were enrolled in and used health care from, both the Department of Veterans Affairs and Medicare in 1999-2004. After calculating the average colon cancer episode cost and distribution for the full cohort, we used box-plot methods, Winsorization, DFBETAs, and Cook's distance to identify and assess or adjust the outlying and/or influential observations. The number of observations identified as outlying and/or influential ranged from 13 when the predicted DFBETA measurement was greater than 0.15 and the observation was a qualified box-plot outlier to 384 cases using the Winsorization method at the 5th and 95th percentiles. Average costs of colon cancer episodes using these methods were similar. The method of choice from the results of this particular analysis can be conditionally based on whether the purpose is to control only for influential observations or to simultaneously control for outliers and influential observations. Understanding how estimates could change with each approach is important in assessing the impact of a particular method on the results.
医疗费用的分布往往向右偏斜,因为少数患者使用了大量的医疗资源。通过一项结肠癌成本研究的数据,我们举例说明了异常值和有影响的观测值对医疗费用的影响及其程度,并比较了统计成本核算方法在处理异常值和有影响的观测值的不成比例影响方面的效果。我们使用了一项回顾性队列研究的数据,该研究涉及1999 - 2004年期间登记并使用退伍军人事务部和医疗保险的3842名老年退伍军人结肠癌患者。在计算了整个队列的平均结肠癌发病成本和分布后,我们使用箱线图方法、缩尾法、DFBETAs和库克距离来识别、评估或调整异常和/或有影响的观测值。被确定为异常和/或有影响的观测值数量范围从预测的DFBETA测量值大于0.15且该观测值是合格的箱线图异常值时的13个,到使用第5和第95百分位数的缩尾法时的384例。使用这些方法的结肠癌发病平均成本相似。根据该特定分析结果选择的方法可以有条件地基于目的是仅控制有影响的观测值还是同时控制异常值和有影响的观测值。了解每种方法的估计如何变化对于评估特定方法对结果的影响很重要。