基于自然语言处理的 YouTube 字幕在视频人体姿态分析中标记数据的自动生成

Automatic Generation of Labeled Data for Video-Based Human Pose Analysis via NLP applied to YouTube Subtitles.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340044.

DOI:10.1109/EMBC40787.2023.10340044

Abstract

With recent advancements in computer vision as well as machine learning (ML), video-based at-home exercise evaluation systems have become a popular topic of current research. However, performance depends heavily on the amount of available training data. Since labeled datasets specific to exercising are rare, we propose a method that makes use of the abundance of fitness videos available online. Specifically, we utilize the advantage that videos often not only show the exercises, but also provide language as an additional source of information. With push-ups as an example, we show that through the analysis of subtitle data using natural language processing (NLP), it is possible to create a labeled (irrelevant, relevant correct, relevant incorrect) dataset containing relevant information for pose analysis. In particular, we show that irrelevant clips (n = 332) have significantly different joint visibility values compared to relevant clips (n = 298). Inspecting cluster centroids also show different poses for the different classes.

摘要

随着计算机视觉和机器学习（ML）的最新进展，基于视频的家庭锻炼评估系统已成为当前研究的热门话题。然而，其性能在很大程度上取决于可用的训练数据量。由于针对锻炼的标记数据集很少，因此我们提出了一种利用大量在线健身视频的方法。具体来说，我们利用视频不仅通常显示锻炼，而且还提供语言作为额外信息源的优势。以俯卧撑为例，我们通过使用自然语言处理（NLP）分析字幕数据，展示了创建包含姿势分析相关信息的标记（不相关、相关正确、相关错误）数据集的可能性。特别是，我们表明，与相关剪辑（n = 298）相比，不相关剪辑（n = 332）的关节可见度值差异显著。检查聚类中心还表明，不同类别的姿势不同。

相似文献

Automatic Generation of Labeled Data for Video-Based Human Pose Analysis via NLP applied to YouTube Subtitles.基于自然语言处理的 YouTube 字幕在视频人体姿态分析中标记数据的自动生成

Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340044.

Automatic Identification of Hate Speech - A Case-Study of alt-Right YouTube Videos.自动识别仇恨言论 - 以 alt-Right 油管视频为例的研究

F1000Res. 2024 Apr 23;13:328. doi: 10.12688/f1000research.147107.1. eCollection 2024.

Ensembles of natural language processing systems for portable phenotyping solutions.用于便携表型解决方案的自然语言处理系统集合。

J Biomed Inform. 2019 Dec;100:103318. doi: 10.1016/j.jbi.2019.103318. Epub 2019 Oct 23.

The reliability, functional quality, understandability, and actionability of fall prevention content in YouTube: an observational study.YouTube 中预防跌倒内容的可靠性、功能质量、可理解性和可操作性：一项观察性研究。

BMC Geriatr. 2022 Aug 9;22(1):654. doi: 10.1186/s12877-022-03330-x.

Quality of English-language videos available on YouTube as a source of information on osteoporosis.YouTube 上关于骨质疏松症的英文视频的质量。

Arch Osteoporos. 2022 Jan 20;17(1):19. doi: 10.1007/s11657-022-01064-2.

Consulting "Dr. YouTube": an objective evaluation of hypospadias videos on a popular video-sharing website.咨询“Dr. YouTube”：对热门视频分享网站上的尿道下裂视频的客观评估。

J Pediatr Urol. 2020 Feb;16(1):70.e1-70.e9. doi: 10.1016/j.jpurol.2019.11.011. Epub 2019 Dec 4.

YouTube Videos Related to the Fukushima Nuclear Disaster: Content Analysis.YouTube 上与福岛核灾难相关的视频：内容分析。

JMIR Public Health Surveill. 2021 Jun 7;7(6):e26481. doi: 10.2196/26481.

YouTube as a source of patient information for ankylosing spondylitis exercises.YouTube 作为强直性脊柱炎运动患者信息的来源。

Clin Rheumatol. 2019 Jun;38(6):1747-1751. doi: 10.1007/s10067-018-04413-0. Epub 2019 Jan 15.

The usefulness and validity of English-language videos on YouTube as an educational resource for spondyloarthritis.YouTube 上的英文视频作为脊柱关节炎教育资源的有用性和有效性。

Clin Rheumatol. 2021 Apr;40(4):1567-1573. doi: 10.1007/s10067-020-05377-w. Epub 2020 Sep 2.

A study on users' preference towards diabetes-related video clips on YouTube.YouTube 上糖尿病相关视频剪辑用户偏好的研究。

BMC Med Inform Decis Mak. 2020 Feb 28;20(1):43. doi: 10.1186/s12911-020-1035-1.

引用本文的文献

Accuracy Evaluation of 3D Pose Reconstruction Algorithms Through Stereo Camera Information Fusion for Physical Exercises with MediaPipe Pose.通过基于MediaPipe姿态的立体相机信息融合对三维姿态重建算法进行准确性评估以用于体育锻炼

Sensors (Basel). 2024 Dec 4;24(23):7772. doi: 10.3390/s24237772.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于自然语言处理的 YouTube 字幕在视频人体姿态分析中标记数据的自动生成

Automatic Generation of Labeled Data for Video-Based Human Pose Analysis via NLP applied to YouTube Subtitles.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献