RAND Corporation, Pittsburgh, PA.
Kaiser Permanente Center for Effectiveness & Safety Research.
Med Care. 2022 Aug 1;60(8):556-562. doi: 10.1097/MLR.0000000000001732. Epub 2022 May 16.
Data on race-and-ethnicity that are needed to measure health equity are often limited or missing. The importance of first name and sex in predicting race-and-ethnicity is not well understood.
The objective of this study was to compare the contribution of first-name information to the accuracy of basic and more complex racial-and-ethnic imputations that incorporate surname information.
We imputed race-and-ethnicity in a sample of Medicare beneficiaries under 2 scenarios: (1) with only sparse predictors (name, address, sex) and (2) with a rich set (adding limited administrative race-and-ethnicity, demographics, and insurance).
A total of 284,627 Medicare beneficiaries who completed the 2014 Medicare Consumer Assessment of Healthcare Providers and Systems survey and reported race-and-ethnicity were included.
Hispanic, non-Hispanic Asian/Pacific Islander, and non-Hispanic White racial-and-ethnic imputations are more accurate for males than females under both sparse-predictor and rich-predictor scenarios; adding first-name information increases accuracy more for females than males. In contrast, imputations of non-Hispanic Black race-and-ethnicity are similarly accurate for females and males, and first names increase accuracy equally for each sex in both sparse-predictor and rich-predictor scenarios. For all 4 racial-and-ethnic groups, incorporating first-name information improves prediction accuracy more under the sparse-predictor scenario than under the rich-predictor scenario.
First-name information contributes more to the accuracy of racial-and-ethnic imputations in a sparse-predictor scenario than in a rich-predictor scenario and generally narrows sex gaps in accuracy of imputations.
衡量健康公平所需的种族和民族数据通常有限或缺失。名字和性别在预测种族和民族方面的重要性尚未得到充分理解。
本研究旨在比较名字信息对基本和更复杂种族和民族推断准确性的贡献,这些推断纳入了姓氏信息。
我们在两种情况下对医疗保险受益人样本进行了种族和民族推断:(1)仅使用稀疏预测因子(姓名、地址、性别),(2)使用丰富的预测因子(增加有限的行政种族和民族、人口统计学和保险信息)。
共有 284627 名完成了 2014 年医疗保险消费者评估医疗保健提供者和系统调查并报告了种族和民族的医疗保险受益人被纳入研究。
在稀疏预测因子和丰富预测因子两种情况下,男性的西班牙裔、非西班牙裔亚裔/太平洋岛民和非西班牙裔白人种族和民族推断准确性均高于女性;增加名字信息可提高女性的准确性,甚于男性。相比之下,非西班牙裔黑人种族和民族推断在女性和男性中具有相似的准确性,且名字信息在稀疏预测因子和丰富预测因子两种情况下都能同等程度地提高每个性别的准确性。对于所有 4 个种族和民族群体,在稀疏预测因子情况下,纳入名字信息可提高推断准确性,甚于在丰富预测因子情况下。
在稀疏预测因子情况下,名字信息对种族和民族推断的准确性贡献大于丰富预测因子情况,并且通常会缩小推断准确性方面的性别差距。