Center for Health Research-Southeast, Kaiser Permanente, Atlanta, GA.
Health Serv Res. 2014 Feb;49(1):268-83. doi: 10.1111/1475-6773.12089. Epub 2013 Jul 16.
To validate classification of race/ethnicity based on the Bayesian Improved Surname Geocoding method (BISG) and assess variations in validity by gender and age.
DATA SOURCES/STUDY SETTING: Secondary data on members of Kaiser Permanente Georgia, an integrated managed care organization, through 2010.
For 191,494 members with self-reported race/ethnicity, probabilities for belonging to each of six race/ethnicity categories predicted from the BISG algorithm were used to assign individuals to a race/ethnicity category over a range of cutoffs greater than a probability of 0.50. Overall as well as gender- and age-stratified sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. Receiver operating characteristic (ROC) curves were generated and used to identify optimal cutoffs for race/ethnicity assignment.
The overall cutoffs for assignment that optimized sensitivity and specificity ranged from 0.50 to 0.57 for the four main racial/ethnic categories (White, Black, Asian/Pacific Islander, Hispanic). Corresponding sensitivity, specificity, PPV, and NPV ranged from 64.4 to 81.4 percent, 80.8 to 99.7 percent, 75.0 to 91.6 percent, and 79.4 to 98.0 percent, respectively. Accuracy of assignment was better among males and individuals of 65 years or older.
BISG may be useful for classifying race/ethnicity of health plan members when needed for health care studies.
验证基于贝叶斯改进姓氏地理编码方法(BISG)的种族/民族分类,并评估性别和年龄对其有效性的影响。
数据来源/研究环境:2010 年通过凯撒永久医疗集团佐治亚州的综合管理式医疗组织成员的二级数据。
对于 191494 名自我报告种族/民族的成员,使用 BISG 算法预测的属于六个种族/民族类别的概率来分配个体的种族/民族类别,分类阈值范围大于 0.50 的概率。计算了总体以及性别和年龄分层的敏感性、特异性、阳性预测值(PPV)和阴性预测值(NPV)。生成了接收器操作特征(ROC)曲线,并用于确定种族/民族分配的最佳分类阈值。
优化敏感性和特异性的总体分配阈值范围为 0.50 至 0.57,适用于四个主要种族/民族类别(白种人、黑种人、亚裔/太平洋岛民、西班牙裔/拉丁裔)。相应的敏感性、特异性、PPV 和 NPV 范围分别为 64.4%至 81.4%、80.8%至 99.7%、75.0%至 91.6%和 79.4%至 98.0%。在男性和 65 岁及以上的个体中,分配的准确性更高。
BISG 可能有助于在需要进行医疗保健研究时对健康计划成员的种族/民族进行分类。