• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过众包软目标标签训练情感计算机视觉模型。

Training Affective Computer Vision Models by Crowdsourcing Soft-Target Labels.

作者信息

Washington Peter, Kalantarian Haik, Kent Jack, Husic Arman, Kline Aaron, Leblanc Emilie, Hou Cathy, Mutlu Cezmi, Dunlap Kaitlyn, Penev Yordan, Stockham Nate, Chrisman Brianna, Paskov Kelley, Jung Jae-Yoon, Voss Catalin, Haber Nick, Wall Dennis P

机构信息

Department of Bioengineering, Stanford University.

Department of Pediatrics (Systems Medicine), Stanford University.

出版信息

Cognit Comput. 2021 Sep;13(5):1363-1373. doi: 10.1007/s12559-021-09936-4. Epub 2021 Sep 27.

DOI:10.1007/s12559-021-09936-4
PMID:35669554
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9165031/
Abstract

BACKGROUND/INTRODUCTION: Emotion detection classifiers traditionally predict discrete emotions. However, emotion expressions are often subjective, thus requiring a method to handle compound and ambiguous labels. We explore the feasibility of using crowdsourcing to acquire reliable soft-target labels and evaluate an emotion detection classifier trained with these labels. We hypothesize that training with labels that are representative of the diversity of human interpretation of an image will result in predictions that are similarly representative on a disjoint test set. We also hypothesize that crowdsourcing can generate distributions which mirror those generated in a lab setting.

METHODS

We center our study on the Child Affective Facial Expression (CAFE) dataset, a gold standard collection of images depicting pediatric facial expressions along with 100 human labels per image. To test the feasibility of crowdsourcing to generate these labels, we used Microworkers to acquire labels for 207 CAFE images. We evaluate both unfiltered workers as well as workers selected through a short crowd filtration process. We then train two versions of a ResNet-152 neural network on soft-target CAFE labels using the original 100 annotations provided with the dataset: (1) a classifier trained with traditional one-hot encoded labels, and (2) a classifier trained with vector labels representing the distribution of CAFE annotator responses. We compare the resulting softmax output distributions of the two classifiers with a 2-sample independent t-test of L1 distances between the classifier's output probability distribution and the distribution of human labels.

RESULTS

While agreement with CAFE is weak for unfiltered crowd workers, the filtered crowd agree with the CAFE labels 100% of the time for happy, neutral, sad and "fear + surprise", and 88.8% for "anger + disgust". While the F1-score for a one-hot encoded classifier is much higher (94.33% vs. 78.68%) with respect to the ground truth CAFE labels, the output probability vector of the crowd-trained classifier more closely resembles the distribution of human labels (t=3.2827, p=0.0014).

CONCLUSIONS

For many applications of affective computing, reporting an emotion probability distribution that accounts for the subjectivity of human interpretation can be more useful than an absolute label. Crowdsourcing, including a sufficient filtering mechanism for selecting reliable crowd workers, is a feasible solution for acquiring soft-target labels.

摘要

背景/引言:传统的情感检测分类器预测离散情感。然而,情感表达往往具有主观性,因此需要一种方法来处理复合和模糊的标签。我们探讨了使用众包来获取可靠的软目标标签的可行性,并评估了使用这些标签训练的情感检测分类器。我们假设,使用代表人类对图像解释多样性的标签进行训练,将在不相交的测试集上产生同样具有代表性的预测。我们还假设众包可以生成与实验室环境中生成的分布相似的分布。

方法

我们的研究以儿童情感面部表情(CAFE)数据集为中心,这是一个描绘儿童面部表情的图像的黄金标准集合,每个图像还有100个人类标签。为了测试众包生成这些标签的可行性,我们使用Microworkers为207张CAFE图像获取标签。我们评估了未经过滤的工人以及通过简短的人群过滤过程选择的工人。然后,我们使用数据集中提供的原始100个注释,在软目标CAFE标签上训练了两个版本的ResNet-152神经网络:(1)一个使用传统的独热编码标签训练的分类器,(2)一个使用表示CAFE注释者响应分布的向量标签训练的分类器。我们使用分类器输出概率分布与人类标签分布之间的L1距离的双样本独立t检验,比较了两个分类器的softmax输出分布。

结果

虽然未经过滤的人群工人与CAFE的一致性较弱,但经过过滤的人群在快乐、中性、悲伤和“恐惧+惊讶”方面100%与CAFE标签一致,在“愤怒+厌恶”方面88.8%一致。虽然相对于真实的CAFE标签,独热编码分类器的F1分数要高得多(94.33%对78.68%),但人群训练分类器的输出概率向量更接近人类标签的分布(t=3.2827,p=0.0014)。

结论

对于情感计算的许多应用,报告一个考虑人类解释主观性的情感概率分布可能比绝对标签更有用。众包,包括一个用于选择可靠人群工人的充分过滤机制,是获取软目标标签的可行解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5584/9165031/1a78bd8dd99e/nihms-1751015-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5584/9165031/cb1876a2e350/nihms-1751015-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5584/9165031/7fa129c6bdb8/nihms-1751015-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5584/9165031/40c028a71108/nihms-1751015-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5584/9165031/1a78bd8dd99e/nihms-1751015-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5584/9165031/cb1876a2e350/nihms-1751015-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5584/9165031/7fa129c6bdb8/nihms-1751015-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5584/9165031/40c028a71108/nihms-1751015-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5584/9165031/1a78bd8dd99e/nihms-1751015-f0004.jpg

相似文献

1
Training Affective Computer Vision Models by Crowdsourcing Soft-Target Labels.通过众包软目标标签训练情感计算机视觉模型。
Cognit Comput. 2021 Sep;13(5):1363-1373. doi: 10.1007/s12559-021-09936-4. Epub 2021 Sep 27.
2
Improved Digital Therapy for Developmental Pediatrics Using Domain-Specific Artificial Intelligence: Machine Learning Study.使用特定领域人工智能改进发育儿科学数字疗法:机器学习研究
JMIR Pediatr Parent. 2022 Apr 8;5(2):e26760. doi: 10.2196/26760.
3
Training and Profiling a Pediatric Facial Expression Classifier for Children on Mobile Devices: Machine Learning Study.在移动设备上为儿童训练和分析儿科面部表情分类器:机器学习研究
JMIR Form Res. 2023 Mar 21;7:e39917. doi: 10.2196/39917.
4
Gamified Crowdsourcing as a Novel Approach to Lung Ultrasound Data Set Labeling: Prospective Analysis.游戏化众包作为一种新型的肺部超声数据集标注方法:前瞻性分析。
J Med Internet Res. 2024 Jul 4;26:e51397. doi: 10.2196/51397.
5
Crowdsourcing for Machine Learning in Public Health Surveillance: Lessons Learned From Amazon Mechanical Turk.公共卫生监测中机器学习的众包:从亚马逊土耳其机器人学到的经验教训。
J Med Internet Res. 2022 Jan 18;24(1):e28749. doi: 10.2196/28749.
6
Active Crowdsourcing for Multilabel Annotation.用于多标签标注的主动众包
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3549-3559. doi: 10.1109/TNNLS.2022.3194022. Epub 2024 Feb 29.
7
A study of crowdsourced segment-level surgical skill assessment using pairwise rankings.一项使用成对排名进行众包的节段级手术技能评估研究。
Int J Comput Assist Radiol Surg. 2015 Sep;10(9):1435-47. doi: 10.1007/s11548-015-1238-6. Epub 2015 Jun 30.
8
CrowdTeacher: Robust Co-teaching with Noisy Answers and Sample-Specific Perturbations for Tabular Data.群体教师:针对表格数据,通过带噪声答案和特定样本扰动进行稳健协同教学
Adv Knowl Discov Data Min. 2021 May;12713:181-193. doi: 10.1007/978-3-030-75765-6_15. Epub 2021 May 8.
9
Reliability of crowdsourcing as a method for collecting emotions labels on pictures.众包作为一种在图片上收集情感标签的方法的可靠性。
BMC Res Notes. 2019 Oct 30;12(1):715. doi: 10.1186/s13104-019-4764-4.
10
Validity of Online Screening for Autism: Crowdsourcing Study Comparing Paid and Unpaid Diagnostic Tasks.自闭症在线筛查的有效性:比较付费和免费诊断任务的众包研究
J Med Internet Res. 2019 May 23;21(5):e13668. doi: 10.2196/13668.

引用本文的文献

1
Digitally Diagnosing Multiple Developmental Delays Using Crowdsourcing Fused With Machine Learning: Protocol for a Human-in-the-Loop Machine Learning Study.利用众包与机器学习相结合的方法进行数字诊断多种发育迟缓:人在回路机器学习研究方案
JMIR Res Protoc. 2024 Feb 8;13:e52205. doi: 10.2196/52205.
2
Classifying Autism From Crowdsourced Semistructured Speech Recordings: Machine Learning Model Comparison Study.从众包半结构化语音记录中分类自闭症:机器学习模型比较研究。
JMIR Pediatr Parent. 2022 Apr 14;5(2):e35406. doi: 10.2196/35406.

本文引用的文献

1
Crowdsourced privacy-preserved feature tagging of short home videos for machine learning ASD detection.用于机器学习自闭症谱系障碍检测的短家庭视频的众包隐私保护特征标记
Sci Rep. 2021 Apr 7;11(1):7620. doi: 10.1038/s41598-021-87059-4.
2
Selection of trustworthy crowd workers for telemedical diagnosis of pediatric autism spectrum disorder.选择可信赖的众包工人进行儿科自闭症谱系障碍的远程医疗诊断。
Pac Symp Biocomput. 2021;26:14-25.
3
Guess What?: Towards Understanding Autism from Structured Video Using Facial Affect.你猜怎么着?:利用面部表情从结构化视频理解自闭症。
J Healthc Inform Res. 2019;3(1):43-66. doi: 10.1007/s41666-018-0034-9. Epub 2018 Oct 2.
4
Feature replacement methods enable reliable home video analysis for machine learning detection of autism.特征替换方法可实现可靠的家庭视频分析,从而支持机器学习进行自闭症检测。
Sci Rep. 2020 Dec 4;10(1):21245. doi: 10.1038/s41598-020-76874-w.
5
Precision Telemedicine through Crowdsourced Machine Learning: Testing Variability of Crowd Workers for Video-Based Autism Feature Recognition.通过众包机器学习实现精准远程医疗:测试众包工作者在基于视频的自闭症特征识别中的变异性。
J Pers Med. 2020 Aug 13;10(3):86. doi: 10.3390/jpm10030086.
6
A Mobile Game for Automatic Emotion-Labeling of Images.一款用于图像自动情感标注的手机游戏。
IEEE Trans Games. 2020 Jun;12(2):213-218. doi: 10.1109/tg.2018.2877325. Epub 2018 Oct 22.
7
The Performance of Emotion Classifiers for Children With Parent-Reported Autism: Quantitative Feasibility Study.针对家长报告的自闭症儿童的情感分类器性能:定量可行性研究。
JMIR Ment Health. 2020 Apr 1;7(4):e13174. doi: 10.2196/13174.
8
Multi-modular AI Approach to Streamline Autism Diagnosis in Young Children.多模块人工智能方法简化幼儿自闭症诊断
Sci Rep. 2020 Mar 19;10(1):5014. doi: 10.1038/s41598-020-61213-w.
9
Data-Driven Diagnostics and the Potential of Mobile Artificial Intelligence for Digital Therapeutic Phenotyping in Computational Psychiatry.数据驱动的诊断与移动人工智能在计算精神病学中数字化治疗表型分析的潜力
Biol Psychiatry Cogn Neurosci Neuroimaging. 2020 Aug;5(8):759-769. doi: 10.1016/j.bpsc.2019.11.015. Epub 2019 Dec 13.
10
Feature Selection and Dimension Reduction of Social Autism Data.社会自闭症数据的特征选择与降维。
Pac Symp Biocomput. 2020;25:707-718.