Division of Research and Optimal Patient Care, American College of Surgeons, Chicago, IL.
Division of Research and Optimal Patient Care, American College of Surgeons, Chicago, IL.
J Am Coll Surg. 2017 May;224(5):787-795e1. doi: 10.1016/j.jamcollsurg.2016.12.057. Epub 2017 Apr 4.
The American College of Surgeons NSQIP offers a Surgical Risk Calculator (SRC) that provides detailed, patient-level, risk assessments for many adverse outcomes to surgeons, patients, and the general public. The SRC calculator was designed to help guide discussion and decisions by providing generally applicable (not hospital-specific) information about surgical risk using easily understood and broadly available preoperative variables. Although large, internal evaluations have shown that the SRC has good accuracy (model discrimination and calibration), external validations have been inconsistent and tend to favor a conclusion of inadequate performance.
External studies, attempting to validate the SRC, were examined with respect to 3 design features: sample size (small samples reduce reliability), case-mix homogeneity (homogeneity reduces discrimination); and number of institutions providing data (few institutions reduces generalizability). The impact of each feature was then examined in several sets of simulation studies.
Each of the 3 design features has the potential to act as an artifactual cause for apparent SRC predictive failure. In addition, demonstrations that SRC estimates are inferior to those from models that use additional (sometimes operation-specific) predictor variables were seen as not relevant with respect to the SRC's intended scope.
The SRC predictive failures, reported by studies with the described design limitations, should not be misunderstood as disqualifying the SRC as an accurate and appropriate tool for its intended purpose of providing a general purpose risk calculator, applicable across many surgical domains, using easily understood and generally available predictive information.
美国外科医师学院 NSQIP 提供了一个外科风险计算器 (SRC),为外科医生、患者和公众提供了许多不良结果的详细、患者水平的风险评估。SRC 计算器旨在通过使用易于理解和广泛可用的术前变量提供关于手术风险的一般适用(非医院特定)信息,帮助指导讨论和决策。尽管进行了大量的内部评估,但 SRC 的准确性(模型区分度和校准度)良好,外部验证结果却不一致,往往倾向于认为其性能不足。
针对 SRC 的外部验证研究,从以下 3 个设计特征进行了检查:样本量(样本量小会降低可靠性)、病例组合同质性(同质性降低区分度)和提供数据的机构数量(机构数量少会降低通用性)。然后在几组模拟研究中检查了每个特征的影响。
这 3 个设计特征中的每一个都有可能成为 SRC 预测失败的人为原因。此外,认为 SRC 估计值不如使用额外(有时是特定于手术的)预测变量的模型的估计值,这与 SRC 的预期用途无关。
在具有描述性设计限制的研究中报告的 SRC 预测失败,不应被误解为 SRC 作为一种准确和适当的工具,用于提供一种通用目的的风险计算器,适用于许多外科领域,使用易于理解和普遍可用的预测信息。