面向实时高分辨率高光谱视频理解的硬件加速集成光电平台。

Hardware-accelerated integrated optoelectronic platform towards real-time high-resolution hyperspectral video understanding.

作者信息

Makarenko Maksim, Burguete-Lopez Arturo, Wang Qizhou, Giancola Silvio, Ghanem Bernard, Passone Luca, Fratalocchi Andrea

机构信息

PRIMALIGHT, Faculty of Electrical Engineering; Applied Mathematics and Computational Science, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia.

AI & Advanced Computing Lab, EXPEC ARC, Saudi Aramco, 4143 Dhahran Blvd, Gharb Al Dhahran, Dhahran, 34466, Saudi Arabia.

出版信息

Nat Commun. 2024 Aug 15;15(1):7051. doi: 10.1038/s41467-024-51406-6.

DOI:10.1038/s41467-024-51406-6

PMID:39147787

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11327253/

Abstract

Recent advancements in artificial intelligence have significantly expanded capabilities in processing language and images. However, the challenge of comprehensively understanding video content still needs to be solved. The main problem is the requirement to process real-time multidimensional video information at data rates exceeding 1 Tb/s, a demand that current hardware technologies cannot meet. This work introduces a hardware-accelerated integrated optoelectronic platform specifically designed for the real-time analysis of multidimensional video. By leveraging optical information processing within artificial intelligence hardware and combining it with advanced machine vision networks, the platform achieves data processing speeds of 1.2 Tb/s. This capability supports the analysis of hundreds of frequency bands with megapixel spatial resolution at video frame rates, significantly outperforming existing technologies in speed by three to four orders of magnitude. The platform demonstrates effectiveness for AI-driven tasks, such as video semantic segmentation and object understanding, across indoor and aerial scenarios. By overcoming the current data processing speed limitations, the platform shows promise in real-time AI video understanding, with potential implications for enhancing human-machine interactions and advancing cognitive processing technologies.

摘要

人工智能领域的最新进展显著扩展了语言和图像处理能力。然而，全面理解视频内容的挑战仍有待解决。主要问题在于需要以超过1 Tb/s的数据速率处理实时多维视频信息，这是当前硬件技术无法满足的需求。这项工作介绍了一种专门为多维视频实时分析设计的硬件加速集成光电平台。通过在人工智能硬件中利用光学信息处理，并将其与先进的机器视觉网络相结合，该平台实现了1.2 Tb/s的数据处理速度。这种能力支持在视频帧率下以百万像素空间分辨率分析数百个频段，在速度上比现有技术显著高出三到四个数量级。该平台在室内和空中场景中展示了对人工智能驱动任务（如视频语义分割和对象理解）的有效性。通过克服当前的数据处理速度限制，该平台在实时人工智能视频理解方面展现出潜力，对增强人机交互和推进认知处理技术具有潜在影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a05/11327253/23533fe1dad0/41467_2024_51406_Fig1_HTML.jpg

相似文献

Hardware-accelerated integrated optoelectronic platform towards real-time high-resolution hyperspectral video understanding.面向实时高分辨率高光谱视频理解的硬件加速集成光电平台。

Nat Commun. 2024 Aug 15;15(1):7051. doi: 10.1038/s41467-024-51406-6.

Developing the surgeon-machine interface: using a novel instance-segmentation framework for intraoperative landmark labelling.开发外科医生-机器接口：使用新颖的实例分割框架进行术中地标标记。

Front Surg. 2023 Oct 23;10:1259756. doi: 10.3389/fsurg.2023.1259756. eCollection 2023.

Recording human electrocorticographic (ECoG) signals for neuroscientific research and real-time functional cortical mapping.记录用于神经科学研究和实时功能性皮层图谱绘制的人类皮层脑电图（ECoG）信号。

J Vis Exp. 2012 Jun 26(64):3993. doi: 10.3791/3993.

Advancements in Microprocessor Architecture for Ubiquitous AI-An Overview on History, Evolution, and Upcoming Challenges in AI Implementation.用于普适人工智能的微处理器架构进展——人工智能实施的历史、演进及未来挑战概述

Micromachines (Basel). 2021 Jun 6;12(6):665. doi: 10.3390/mi12060665.

High-Speed Hyperspectral Video Acquisition By Combining Nyquist and Compressive Sampling.结合奈奎斯特采样和压缩采样的高速高光谱视频采集

IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):857-870. doi: 10.1109/TPAMI.2018.2817496. Epub 2018 Mar 20.

Artificial Intelligence in the Detection of Barrett's Esophagus: A Systematic Review.人工智能在巴雷特食管检测中的应用：一项系统综述。

Cureus. 2023 Oct 26;15(10):e47755. doi: 10.7759/cureus.47755. eCollection 2023 Oct.

A neuromorphic system for video object recognition.一种用于视频目标识别的神经形态系统。

Front Comput Neurosci. 2014 Nov 28;8:147. doi: 10.3389/fncom.2014.00147. eCollection 2014.

Parallel convolutional processing using an integrated photonic tensor core.基于集成光子张量核的并行卷积处理。

Nature. 2021 Jan;589(7840):52-58. doi: 10.1038/s41586-020-03070-1. Epub 2021 Jan 6.

Feasibility of video-based real-time nystagmus tracking: a lightweight deep learning model approach using ocular object segmentation.基于视频的实时眼球震颤追踪的可行性：一种使用眼部目标分割的轻量级深度学习模型方法

Front Neurol. 2024 Feb 21;15:1342108. doi: 10.3389/fneur.2024.1342108. eCollection 2024.

mIoT: Metamorphic IoT Platform for On-Demand Hardware Replacement in Large-Scaled IoT Applications.mIoT：用于大规模物联网应用中按需硬件更换的变形物联网平台。

Sensors (Basel). 2020 Jun 12;20(12):3337. doi: 10.3390/s20123337.

引用本文的文献

From spectrum to yield: advances in crop photosynthesis with hyperspectral imaging.从光谱到产量：利用高光谱成像技术实现作物光合作用的进展

Photosynthetica. 2025 Jul 8;63(2):196-233. doi: 10.32615/ps.2025.012. eCollection 2025.

本文引用的文献

Machine learning can guide food security efforts when primary data are not available.机器学习可以在无法获得原始数据时指导食品安全工作。

Nat Food. 2022 Sep;3(9):716-728. doi: 10.1038/s43016-022-00587-8. Epub 2022 Sep 15.

Metasurfaces-Driven Hyperspectral Imaging via Multiplexed Plasmonic Resonance Energy Transfer.基于超表面的高光谱成像技术通过复用等离子体激元共振能量转移实现。

Adv Mater. 2023 Aug;35(32):e2300229. doi: 10.1002/adma.202300229. Epub 2023 Jun 11.

Foundation models for generalist medical artificial intelligence.通用型医学人工智能的基础模型。

Nature. 2023 Apr;616(7956):259-265. doi: 10.1038/s41586-023-05881-4. Epub 2023 Apr 12.

Snapshot multispectral imaging using a diffractive optical network.使用衍射光学网络的快照多光谱成像。

Light Sci Appl. 2023 Apr 6;12(1):86. doi: 10.1038/s41377-023-01135-0.

A federated graph neural network framework for privacy-preserving personalization.联邦图神经网络框架用于保护隐私的个性化服务。

Nat Commun. 2022 Jun 2;13(1):3091. doi: 10.1038/s41467-022-30714-9.

Communication-efficient federated learning via knowledge distillation.基于知识蒸馏的高效通信联邦学习。

Nat Commun. 2022 Apr 19;13(1):2032. doi: 10.1038/s41467-022-29763-x.

Federated learning for predicting clinical outcomes in patients with COVID-19.基于联邦学习的 COVID-19 患者临床结局预测

Nat Med. 2021 Oct;27(10):1735-1743. doi: 10.1038/s41591-021-01506-3. Epub 2021 Sep 15.

Spectrally encoded single-pixel machine vision using diffractive networks.使用衍射网络的光谱编码单像素机器视觉。

Sci Adv. 2021 Mar 26;7(13). doi: 10.1126/sciadv.abd7690. Print 2021 Mar.

Broadband vectorial ultrathin optics with experimental efficiency up to 99% in the visible region via universal approximators.通过通用逼近器实现的宽带矢量超薄光学，在可见光区域实验效率高达99%。

Light Sci Appl. 2021 Mar 4;10(1):47. doi: 10.1038/s41377-021-00489-7.

Using deep learning for dermatologist-level detection of suspicious pigmented skin lesions from wide-field images.利用深度学习技术，从宽场图像中检测可疑色素性皮肤病变，达到皮肤科医生的水平。

Sci Transl Med. 2021 Feb 17;13(581). doi: 10.1126/scitranslmed.abb3652.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

面向实时高分辨率高光谱视频理解的硬件加速集成光电平台。

Hardware-accelerated integrated optoelectronic platform towards real-time high-resolution hyperspectral video understanding.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献