两种主观肤色量表的有效性及其对医疗保健模式公平性的影响。

Validity of two subjective skin tone scales and its implications on healthcare model fairness.

作者信息

Cu Cassandra W, Dundas Nicole E, Heintz Timothy, Sheikh Zahida A, Alonso-Bermudez Bianca, Walker Jasmine, Wooten Avery, Badathala Anusha, Chapman Allyson, Ehie Odinakachukwu, Raghunathan Karthik, Mills Hunter, Espejo Edie, Boscardin John, Wallace Arthur W, Cobert Julien

机构信息

School of Medicine, Tufts University School of Medicine, Boston, MA, USA.

UC Berkeley Department of Bioengineering, Berkeley, CA, USA.

出版信息

NPJ Digit Med. 2025 Oct 3;8(1):595. doi: 10.1038/s41746-025-01975-7.

DOI:10.1038/s41746-025-01975-7

PMID:41044148

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12494915/

Abstract

Skin tone assessments are critical for fairness evaluation in healthcare algorithms (e.g., pulse oximetry) but lack validation. Using prospectively collected facial images from 90 hospitalized adults at the San Francisco VA, three independent annotators rated facial regions in triplicate using Fitzpatrick (I-VI) and Monk (1-10) skin tone scales. Patients also self-identified their skin tone. Annotator confidence was recorded using 5-point Likert scales. Across 810 images in 90 patients (9 images each), within-rater agreement was high, but inter-annotator agreement was moderate to low. Annotators frequently rated patients as darker when patients self-identified as lighter, and lighter when patients self-identified as darker. In linear mixed-effects models controlling for facial region and annotator confidence, darker self-reported skin tones were associated with lighter annotator scores. These findings highlight challenges in consistent skin tone labeling and suggest that current methods for assessing representation in biosensor-based algorithm studies may be influenced by labeling bias.

摘要

肤色评估对于医疗保健算法（如脉搏血氧饱和度测定）中的公平性评估至关重要，但缺乏验证。利用从旧金山退伍军人事务部前瞻性收集的90名住院成年人的面部图像，三名独立注释者使用菲茨帕特里克（I-VI）和蒙克（1-10）肤色量表对面部区域进行了三次评分。患者也自行确定了自己的肤色。使用5点李克特量表记录注释者的信心。在90名患者的810张图像（每人9张）中，评分者内部一致性较高，但注释者之间的一致性为中度至低度。当患者自行确定肤色较浅时，注释者经常将其评为较深；而当患者自行确定肤色较深时，注释者则将其评为较浅。在控制面部区域和注释者信心的线性混合效应模型中，自我报告的较深肤色与注释者较低的评分相关。这些发现凸显了在一致的肤色标注方面的挑战，并表明当前基于生物传感器的算法研究中评估代表性的方法可能受到标注偏差的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b3c/12494915/869ddec697fb/41746_2025_1975_Fig1_HTML.jpg

相似文献

Validity of two subjective skin tone scales and its implications on healthcare model fairness.两种主观肤色量表的有效性及其对医疗保健模式公平性的影响。

NPJ Digit Med. 2025 Oct 3;8(1):595. doi: 10.1038/s41746-025-01975-7.

Mid Forehead Brow Lift额中眉提升术

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Shoulder Arthrogram肩关节造影

Vesicoureteral Reflux膀胱输尿管反流

Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施：系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。

Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作：定性证据综合评价。

Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.

Interventions for preventing falls in older people in care facilities.护理机构中预防老年人跌倒的干预措施。

Cochrane Database Syst Rev. 2025 Aug 20;8:CD016064. doi: 10.1002/14651858.CD016064.

Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物：网状Meta分析

Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.

The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。

Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.

本文引用的文献

Preliminary Development and Validation of Automated Nociception Recognition Using Computer Vision in Perioperative Patients.

Anesthesiology. 2025 Apr 1;142(4):726-737. doi: 10.1097/ALN.0000000000005370. Epub 2025 Jan 13.

Adherence to FDA Guidance on Pulse Oximetry Testing Among Diverse Individuals, 1996-2024.1996 - 2024年不同人群对美国食品药品监督管理局脉搏血氧饱和度检测指南的遵循情况。

JAMA. 2025 Feb 18;333(7):631-632. doi: 10.1001/jama.2024.26473.

A survey of skin tone assessment in prospective research.前瞻性研究中肤色评估的调查。

NPJ Digit Med. 2024 Jul 17;7(1):191. doi: 10.1038/s41746-024-01176-8.

Considerations for the Use of Fitzpatrick Skin Type in Plastic Surgery Research.整形外科研究中使用菲茨帕特里克皮肤类型的考量因素。

Plast Reconstr Surg Glob Open. 2024 Jun 5;12(6):e5866. doi: 10.1097/GOX.0000000000005866. eCollection 2024 Jun.

Skin Pigmentation and Pulse Oximeter Accuracy in the Intensive Care Unit: A Pilot Prospective Study.重症监护病房中的皮肤色素沉着与脉搏血氧仪准确性：一项前瞻性试点研究。

Am J Respir Crit Care Med. 2024 Aug 1;210(3):355-358. doi: 10.1164/rccm.202401-0036LE.

Skin Tone Estimation under Diverse Lighting Conditions.不同光照条件下的肤色估计

J Imaging. 2024 Apr 30;10(5):109. doi: 10.3390/jimaging10050109.

Clinical Outcomes Associated With Overestimation of Oxygen Saturation by Pulse Oximetry in Patients Hospitalized With COVID-19.脉搏血氧饱和度测量值高估与 COVID-19 住院患者临床结局的相关性。

JAMA Netw Open. 2023 Aug 1;6(8):e2330856. doi: 10.1001/jamanetworkopen.2023.30856.

Race and Ethnic Categories: A Brief Review of Global Terms and Nomenclature.种族和族裔类别：全球术语和命名法简述

Cureus. 2023 Jul 1;15(7):e41253. doi: 10.7759/cureus.41253. eCollection 2023 Jul.

Diagnostic Errors, Health Disparities, and Artificial Intelligence: A Combination for Health or Harm?诊断错误、健康差异与人工智能：是健康助力还是危害之源？

JAMA Health Forum. 2021 Sep 3;2(9):e212430. doi: 10.1001/jamahealthforum.2021.2430.

Algorithmic fairness in computational medicine.计算医学中的算法公平性。

EBioMedicine. 2022 Oct;84:104250. doi: 10.1016/j.ebiom.2022.104250. Epub 2022 Sep 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

两种主观肤色量表的有效性及其对医疗保健模式公平性的影响。

Validity of two subjective skin tone scales and its implications on healthcare model fairness.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献