• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

智能超越学习:一种用于视频理解的上下文感知人工智能系统。

Intelligence Is beyond Learning: A Context-Aware Artificial Intelligent System for Video Understanding.

作者信息

Ghozia Ahmed, Attiya Gamal, Adly Emad, El-Fishawy Nawal

机构信息

Computer Science and Engineering Department, Faculty of Electronic Engineering, Menoufia University, Shibin El Kom, Menofia Governorate, Egypt.

出版信息

Comput Intell Neurosci. 2020 Dec 23;2020:8813089. doi: 10.1155/2020/8813089. eCollection 2020.

DOI:10.1155/2020/8813089
PMID:33424960
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7775170/
Abstract

Understanding video files is a challenging task. While the current video understanding techniques rely on deep learning, the obtained results suffer from a lack of real trustful meaning. Deep learning recognizes patterns from big data, leading to deep feature abstraction, not deep understanding. Deep learning tries to understand multimedia production by analyzing its content. We cannot understand the semantics of a multimedia file by analyzing its content only. Events occurring in a scene earn their meanings from the context containing them. A screaming kid could be scared of a threat or surprised by a lovely gift or just playing in the backyard. Artificial intelligence is a heterogeneous process that goes beyond learning. In this article, we discuss the heterogeneity of AI as a process that includes innate knowledge, approximations, and context awareness. We present a context-aware video understanding technique that makes the machine intelligent enough to understand the message behind the video stream. The main purpose is to understand the video stream by extracting real meaningful concepts, emotions, temporal data, and spatial data from the video context. The diffusion of heterogeneous data patterns from the video context leads to accurate decision-making about the video message and outperforms systems that rely on deep learning. Objective and subjective comparisons prove the accuracy of the concepts extracted by the proposed context-aware technique in comparison with the current deep learning video understanding techniques. Both systems are compared in terms of retrieval time, computing time, data size consumption, and complexity analysis. Comparisons show a significant efficient resource usage of the proposed context-aware system, which makes it a suitable solution for real-time scenarios. Moreover, we discuss the pros and cons of deep learning architectures.

摘要

理解视频文件是一项具有挑战性的任务。虽然当前的视频理解技术依赖于深度学习,但所获得的结果缺乏真正可信的意义。深度学习从大数据中识别模式,导致深度特征抽象,而非深度理解。深度学习试图通过分析多媒体内容来理解多媒体制作。仅通过分析多媒体文件的内容,我们无法理解其语义。场景中发生的事件从包含它们的上下文环境中获得其意义。一个尖叫的孩子可能是受到了威胁的惊吓,或者是被一份可爱的礼物惊喜到,又或者只是在后院玩耍。人工智能是一个超越学习的异构过程。在本文中,我们将讨论人工智能作为一个包含先天知识、近似值和上下文感知的过程的异构性。我们提出了一种上下文感知视频理解技术,使机器足够智能,能够理解视频流背后的信息。主要目的是通过从视频上下文中提取真正有意义的概念、情感、时间数据和空间数据来理解视频流。来自视频上下文的异构数据模式的扩散导致对视频信息的准确决策,并且优于依赖深度学习的系统。客观和主观比较证明了与当前深度学习视频理解技术相比,所提出的上下文感知技术提取的概念的准确性。在检索时间、计算时间、数据大小消耗和复杂性分析方面对这两种系统进行了比较。比较结果表明,所提出的上下文感知系统具有显著高效的资源使用情况,这使其成为实时场景的合适解决方案。此外,我们还讨论了深度学习架构的优缺点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/ad86f787635c/CIN2020-8813089.alg.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/65dc9c340b72/CIN2020-8813089.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/589f2d9d7f72/CIN2020-8813089.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/e1d4532017b0/CIN2020-8813089.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/ae398e471f43/CIN2020-8813089.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/51930d2dfac1/CIN2020-8813089.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/a21d20665056/CIN2020-8813089.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/5defd563cd59/CIN2020-8813089.alg.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/3321cdea8726/CIN2020-8813089.alg.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/ad86f787635c/CIN2020-8813089.alg.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/65dc9c340b72/CIN2020-8813089.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/589f2d9d7f72/CIN2020-8813089.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/e1d4532017b0/CIN2020-8813089.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/ae398e471f43/CIN2020-8813089.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/51930d2dfac1/CIN2020-8813089.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/a21d20665056/CIN2020-8813089.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/5defd563cd59/CIN2020-8813089.alg.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/3321cdea8726/CIN2020-8813089.alg.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc53/7775170/ad86f787635c/CIN2020-8813089.alg.004.jpg

相似文献

1
Intelligence Is beyond Learning: A Context-Aware Artificial Intelligent System for Video Understanding.智能超越学习:一种用于视频理解的上下文感知人工智能系统。
Comput Intell Neurosci. 2020 Dec 23;2020:8813089. doi: 10.1155/2020/8813089. eCollection 2020.
2
Interpretable Artificial Intelligence: Why and When.可解释人工智能:为何与何时。
AJR Am J Roentgenol. 2020 May;214(5):1137-1138. doi: 10.2214/AJR.19.22145. Epub 2020 Mar 4.
3
Big Data-Artificial Intelligence Fusion Technology in Education in the Context of the New Crown Epidemic.新冠疫情背景下教育领域的大数据-人工智能融合技术。
Big Data. 2022 Jun;10(3):262-276. doi: 10.1089/big.2021.0245. Epub 2022 May 23.
4
Artificial Intelligence, Machine Learning, Deep Learning, and Cognitive Computing: What Do These Terms Mean and How Will They Impact Health Care?人工智能、机器学习、深度学习和认知计算:这些术语是什么意思,它们将如何影响医疗保健?
J Arthroplasty. 2018 Aug;33(8):2358-2361. doi: 10.1016/j.arth.2018.02.067. Epub 2018 Feb 27.
5
A Hybrid Stacked CNN and Residual Feedback GMDH-LSTM Deep Learning Model for Stroke Prediction Applied on Mobile AI Smart Hospital Platform.基于移动 AI 智能医院平台的应用,采用混合堆叠 CNN 和残差反馈 GMDH-LSTM 深度学习模型进行中风预测。
Sensors (Basel). 2023 Mar 27;23(7):3500. doi: 10.3390/s23073500.
6
Intelligent health data analytics: A convergence of artificial intelligence and big data.智能健康数据分析:人工智能与大数据的融合
Healthc Manage Forum. 2019 Jul;32(4):178-182. doi: 10.1177/0840470419846134. Epub 2019 May 22.
7
Shallow and deep learning classifiers in medical image analysis.医学图像分析中的浅层和深度学习分类器。
Eur Radiol Exp. 2024 Mar 5;8(1):26. doi: 10.1186/s41747-024-00428-2.
8
Design and Implementation of Interactive Platform for Operation and Maintenance of Multimedia Information System Based on Artificial Intelligence and Big Data.基于人工智能和大数据的多媒体信息系统运维交互平台的设计与实现。
Comput Intell Neurosci. 2022 May 4;2022:4620930. doi: 10.1155/2022/4620930. eCollection 2022.
9
The Role and Promise of Artificial Intelligence in Medical Toxicology.人工智能在医学毒理学中的作用和前景。
J Med Toxicol. 2020 Oct;16(4):458-464. doi: 10.1007/s13181-020-00769-5. Epub 2020 Mar 25.
10
Intelligent Integrated System for Fruit Detection Using Multi-UAV Imaging and Deep Learning.基于多无人机成像和深度学习的水果智能检测集成系统。
Sensors (Basel). 2024 Mar 16;24(6):1913. doi: 10.3390/s24061913.

引用本文的文献

1
Nontechnical Skills for Intraoperative Team Members.手术团队成员的非技术技能。
Anesthesiol Clin. 2023 Dec;41(4):803-818. doi: 10.1016/j.anclin.2023.03.013. Epub 2023 Apr 18.

本文引用的文献

1
Generalised Dice Overlap as a Deep Learning Loss Function for Highly Unbalanced Segmentations.广义骰子重叠作为高度不平衡分割的深度学习损失函数
Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2017). 2017;2017:240-248. doi: 10.1007/978-3-319-67558-9_28. Epub 2017 Sep 9.
2
Moments in Time Dataset: One Million Videos for Event Understanding.时间点数据集:用于事件理解的一百万段视频。
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):502-508. doi: 10.1109/TPAMI.2019.2901464. Epub 2019 Feb 25.
3
Deep learning and process understanding for data-driven Earth system science.
深度学习与过程理解在数据驱动的地球系统科学中的应用。
Nature. 2019 Feb;566(7743):195-204. doi: 10.1038/s41586-019-0912-1. Epub 2019 Feb 13.
4
Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science.基于网络科学的自适应稀疏连接启发的人工神经网络的可扩展训练。
Nat Commun. 2018 Jun 19;9(1):2383. doi: 10.1038/s41467-018-04316-3.
5
Improving machine learning reproducibility in genetic association studies with proportional instance cross validation (PICV).通过比例实例交叉验证(PICV)提高遗传关联研究中机器学习的可重复性。
BioData Min. 2018 Apr 19;11:6. doi: 10.1186/s13040-018-0167-7. eCollection 2018.
6
Deep Learning in Neuroradiology.深度学习在神经影像学中的应用。
AJNR Am J Neuroradiol. 2018 Oct;39(10):1776-1784. doi: 10.3174/ajnr.A5543. Epub 2018 Feb 1.
7
Deep learning for healthcare: review, opportunities and challenges.深度学习在医疗保健领域的应用:综述、机遇与挑战。
Brief Bioinform. 2018 Nov 27;19(6):1236-1246. doi: 10.1093/bib/bbx044.
8
Building machines that learn and think like people.建造像人一样学习和思考的机器。
Behav Brain Sci. 2017 Jan;40:e253. doi: 10.1017/S0140525X16001837. Epub 2016 Nov 24.
9
Long-Term Recurrent Convolutional Networks for Visual Recognition and Description.长期递归卷积网络的视觉识别与描述。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):677-691. doi: 10.1109/TPAMI.2016.2599174. Epub 2016 Sep 1.
10
Mastering the game of Go with deep neural networks and tree search.用深度神经网络和树搜索掌握围棋游戏。
Nature. 2016 Jan 28;529(7587):484-9. doi: 10.1038/nature16961.