• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多模态深度学习的在线过程阶段检测

Online Process Phase Detection Using Multimodal Deep Learning.

作者信息

Li Xinyu, Zhang Yanyi, Li Mengzhu, Chen Shuhong, Austin Farneth R, Marsic Ivan, Burd Randall S

机构信息

Rutgers University, Piscataway, NJ, USA.

Children's National Medical Center, Washington, DC, USA.

出版信息

Ubiquitous Comput Electron Mob Commun Conf (UEMCON) IEEE Annu. 2016 Oct;2016. doi: 10.1109/UEMCON.2016.7777912. Epub 2016 Dec 12.

DOI:10.1109/UEMCON.2016.7777912
PMID:30357017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6196033/
Abstract

We present a multimodal deep-learning structure that automatically predicts phases of the trauma resuscitation process in real-time. The system first pre-processes the audio and video streams captured by a Kinect's built-in microphone array and depth sensor. A multimodal deep learning structure then extracts video and audio features, which are later combined through a "slow fusion" model. The final decision is then made from the combined features through a modified softmax classification layer. The model was trained on 20 trauma resuscitation cases (>13 hours), and was tested on 5 other cases. Our results showed over 80% online detection accuracy with 0.7 F-Score, outperforming previous systems.

摘要

我们提出了一种多模态深度学习结构,可实时自动预测创伤复苏过程的阶段。该系统首先对由Kinect的内置麦克风阵列和深度传感器捕获的音频和视频流进行预处理。然后,多模态深度学习结构提取视频和音频特征,随后通过“慢融合”模型将这些特征进行组合。最后,通过修改后的softmax分类层根据组合后的特征做出最终决策。该模型在20个创伤复苏病例(超过13小时)上进行了训练,并在其他5个病例上进行了测试。我们的结果显示在线检测准确率超过80%,F值为0.7,优于先前的系统。

相似文献

1
Online Process Phase Detection Using Multimodal Deep Learning.基于多模态深度学习的在线过程阶段检测
Ubiquitous Comput Electron Mob Commun Conf (UEMCON) IEEE Annu. 2016 Oct;2016. doi: 10.1109/UEMCON.2016.7777912. Epub 2016 Dec 12.
2
Real-time medical phase recognition using long-term video understanding and progress gate method.基于长期视频理解和进度门方法的实时医学相位识别。
Med Image Anal. 2021 Dec;74:102224. doi: 10.1016/j.media.2021.102224. Epub 2021 Sep 3.
3
Language-Based Process Phase Detection in the Trauma Resuscitation.创伤复苏中基于语言的过程阶段检测
Proc (IEEE Int Conf Healthc Inform). 2017 Aug;2017:239-247. doi: 10.1109/ICHI.2017.50. Epub 2017 Sep 14.
4
Speech Intention Classification with Multimodal Deep Learning.基于多模态深度学习的语音意图分类
Adv Artif Intell. 2017 May;10233:260-271. doi: 10.1007/978-3-319-57351-9_30. Epub 2017 Apr 11.
5
Progress Estimation and Phase Detection for Sequential Processes.顺序过程的进度估计与阶段检测
Proc ACM Interact Mob Wearable Ubiquitous Technol. 2017 Sep;1(3). doi: 10.1145/3130936.
6
Poster Abstract: 3D Activity Localization With Multiple Sensors.海报摘要:利用多传感器进行三维活动定位
IPSN. 2017 Apr;2017:297-298. doi: 10.1145/3055031.3055057.
7
A Multimodal Feature Fusion-Based Deep Learning Method for Online Fault Diagnosis of Rotating Machinery.基于多模态特征融合的旋转机械在线故障诊断深度学习方法。
Sensors (Basel). 2018 Oct 18;18(10):3521. doi: 10.3390/s18103521.
8
Multimodal Attention Network for Trauma Activity Recognition from Spoken Language and Environmental Sound.用于从口语和环境声音中识别创伤活动的多模态注意力网络。
Proc (IEEE Int Conf Healthc Inform). 2019 Jun;2019. doi: 10.1109/ichi.2019.8904713. Epub 2019 Nov 21.
9
MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework.MildInt:基于深度学习的多模态纵向数据整合框架。
Front Genet. 2019 Jun 28;10:617. doi: 10.3389/fgene.2019.00617. eCollection 2019.
10
Fusion of Video and Inertial Sensing for Deep Learning-Based Human Action Recognition.视频与惯性传感器融合的深度学习人体动作识别
Sensors (Basel). 2019 Aug 24;19(17):3680. doi: 10.3390/s19173680.

引用本文的文献

1
Real-time Context-Aware Multimodal Network for Activity and Activity-Stage Recognition from Team Communication in Dynamic Clinical Settings.用于动态临床环境中团队沟通的活动及活动阶段识别的实时上下文感知多模态网络
Proc ACM Interact Mob Wearable Ubiquitous Technol. 2023 Mar;7(1). doi: 10.1145/3580798. Epub 2023 Mar 28.
2
State-of-the-art of situation recognition systems for intraoperative procedures.术中操作情境识别系统的最新技术。
Med Biol Eng Comput. 2022 Apr;60(4):921-939. doi: 10.1007/s11517-022-02520-4. Epub 2022 Feb 17.
3
Real-time medical phase recognition using long-term video understanding and progress gate method.基于长期视频理解和进度门方法的实时医学相位识别。
Med Image Anal. 2021 Dec;74:102224. doi: 10.1016/j.media.2021.102224. Epub 2021 Sep 3.
4
Multimodal Attention Network for Trauma Activity Recognition from Spoken Language and Environmental Sound.用于从口语和环境声音中识别创伤活动的多模态注意力网络。
Proc (IEEE Int Conf Healthc Inform). 2019 Jun;2019. doi: 10.1109/ichi.2019.8904713. Epub 2019 Nov 21.
5
Speech Intention Classification with Multimodal Deep Learning.基于多模态深度学习的语音意图分类
Adv Artif Intell. 2017 May;10233:260-271. doi: 10.1007/978-3-319-57351-9_30. Epub 2017 Apr 11.
6
Progress Estimation and Phase Detection for Sequential Processes.顺序过程的进度估计与阶段检测
Proc ACM Interact Mob Wearable Ubiquitous Technol. 2017 Sep;1(3). doi: 10.1145/3130936.
7
Poster Abstract: 3D Activity Localization With Multiple Sensors.海报摘要:利用多传感器进行三维活动定位
IPSN. 2017 Apr;2017:297-298. doi: 10.1145/3055031.3055057.
8
Region-based Activity Recognition Using Conditional GAN.基于条件生成对抗网络的区域活动识别
Proc ACM Int Conf Multimed. 2017 Oct;2017:1059-1067. doi: 10.1145/3123266.3123365.
9
Language-Based Process Phase Detection in the Trauma Resuscitation.创伤复苏中基于语言的过程阶段检测
Proc (IEEE Int Conf Healthc Inform). 2017 Aug;2017:239-247. doi: 10.1109/ICHI.2017.50. Epub 2017 Sep 14.

本文引用的文献

1
Activity Recognition for Medical Teamwork Based on Passive RFID.基于无源射频识别技术的医疗团队协作活动识别
IEEE Int Conf RFID. 2016 May;2016. doi: 10.1109/RFID.2016.7488002. Epub 2016 Jun 9.
2
Regionlets for Generic Object Detection.区域块用于通用目标检测。
IEEE Trans Pattern Anal Mach Intell. 2015 Oct;37(10):2071-84. doi: 10.1109/TPAMI.2015.2389830.
3
Automatic phase prediction from low-level surgical activities.基于低级别手术活动的自动阶段预测。
Int J Comput Assist Radiol Surg. 2015 Jun;10(6):833-41. doi: 10.1007/s11548-015-1195-0. Epub 2015 Apr 23.
4
Statistical modeling and recognition of surgical workflow.手术流程的统计建模与识别。
Med Image Anal. 2012 Apr;16(3):632-41. doi: 10.1016/j.media.2010.10.001. Epub 2010 Dec 8.
5
Modeling and online recognition of surgical phases using Hidden Markov Models.使用隐马尔可夫模型对外科手术阶段进行建模与在线识别。
Med Image Comput Comput Assist Interv. 2008;11(Pt 2):627-35. doi: 10.1007/978-3-540-85990-1_75.