• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个支持混合数据探索的异构多模态医学数据融合框架。

A heterogeneous multi-modal medical data fusion framework supporting hybrid data exploration.

作者信息

Zhang Yong, Sheng Ming, Liu Xingyue, Wang Ruoyu, Lin Weihang, Ren Peng, Wang Xia, Zhao Enlai, Song Wenchao

机构信息

BNRist, DCST, RIIT, Tsinghua University, Beijing, 100084 China.

Beihang University, Beijing, 102206 China.

出版信息

Health Inf Sci Syst. 2022 Aug 26;10(1):22. doi: 10.1007/s13755-022-00183-x. eCollection 2022 Dec.

DOI:10.1007/s13755-022-00183-x
PMID:36039096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9417071/
Abstract

Industry 4.0 era has witnessed that more and more high-tech and precise devices are applied into medical field to provide better services. Besides EMRs, medical data include a large amount of unstructured data such as X-rays, MRI scans, CT scans and PET scans, which is still continually increasing. These massive, heterogeneous multi-modal data bring the big challenge to finding valuable data sets for healthcare researchers and other users. The traditional data warehouses are able to integrate the data and support interactive data exploration through ETL process. However, they have high cost and are not real-time. Furthermore, they lack of the ability to deal with multi-modal data in two phases-data fusion and data exploration. In the data fusion phase, it is difficult to unify the multi-modal data under one data model. In the data exploration phase, it is challenging to explore the multi-modal data at the same time, which impedes the process of extracting the diverse information underlying multi-modal data. Therefore, in order to solve these problems, we propose a highly efficient data fusion framework supporting data exploration for heterogeneous multi-modal medical data based on data lake. This framework provides a novel and efficient method to fuse the fragmented multi-modal medical data and store their metadata in the data lake. It offers a user-friendly interface supporting hybrid graph queries to explore multi-modal data. Indexes are created to accelerate the hybrid data exploration. One prototype has been implemented and tested in a hospital, which demonstrates the effectiveness of our framework.

摘要

工业4.0时代见证了越来越多的高科技精密设备应用于医疗领域以提供更好的服务。除了电子病历,医疗数据还包括大量非结构化数据,如X光、核磁共振成像扫描、计算机断层扫描和正电子发射断层扫描,并且其数量仍在持续增长。这些海量、异构的多模态数据给医疗保健研究人员和其他用户寻找有价值的数据集带来了巨大挑战。传统数据仓库能够通过ETL过程集成数据并支持交互式数据探索。然而,它们成本高昂且不实时。此外,它们缺乏在数据融合和数据探索两个阶段处理多模态数据的能力。在数据融合阶段,难以将多模态数据统一在一个数据模型之下。在数据探索阶段,同时探索多模态数据具有挑战性,这阻碍了提取多模态数据背后各种信息的过程。因此,为了解决这些问题,我们提出了一种基于数据湖的高效数据融合框架,用于支持对异构多模态医疗数据进行数据探索。该框架提供了一种新颖且高效的方法来融合碎片化的多模态医疗数据,并将其元数据存储在数据湖中。它提供了一个支持混合图查询的用户友好界面来探索多模态数据。通过创建索引来加速混合数据探索。我们已经在一家医院实现并测试了一个原型,这证明了我们框架的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/820994ff7ce6/13755_2022_183_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/1bd560a8aabb/13755_2022_183_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/16cdd48981a8/13755_2022_183_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/b9800ab57285/13755_2022_183_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/c28011b52368/13755_2022_183_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/05e0ebabb92c/13755_2022_183_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/820994ff7ce6/13755_2022_183_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/1bd560a8aabb/13755_2022_183_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/16cdd48981a8/13755_2022_183_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/b9800ab57285/13755_2022_183_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/c28011b52368/13755_2022_183_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/05e0ebabb92c/13755_2022_183_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63b2/9418392/820994ff7ce6/13755_2022_183_Fig6_HTML.jpg

相似文献

1
A heterogeneous multi-modal medical data fusion framework supporting hybrid data exploration.一个支持混合数据探索的异构多模态医学数据融合框架。
Health Inf Sci Syst. 2022 Aug 26;10(1):22. doi: 10.1007/s13755-022-00183-x. eCollection 2022 Dec.
2
Transformer-Based Multi-Modal Data Fusion Method for COPD Classification and Physiological and Biochemical Indicators Identification.基于Transformer的慢性阻塞性肺疾病分类及生理生化指标识别的多模态数据融合方法
Biomolecules. 2023 Sep 15;13(9):1391. doi: 10.3390/biom13091391.
3
A multi-source heterogeneous medical data enhancement framework based on lakehouse.一种基于湖仓架构的多源异构医学数据增强框架。
Health Inf Sci Syst. 2024 Jul 5;12(1):37. doi: 10.1007/s13755-024-00295-6. eCollection 2024 Dec.
4
BPGAN: Brain PET synthesis from MRI using generative adversarial network for multi-modal Alzheimer's disease diagnosis.基于生成对抗网络的脑 PET 从 MRI 合成用于多模态阿尔茨海默病诊断
Comput Methods Programs Biomed. 2022 Apr;217:106676. doi: 10.1016/j.cmpb.2022.106676. Epub 2022 Feb 1.
5
Multi-modal neuroimaging feature fusion for diagnosis of Alzheimer's disease.用于阿尔茨海默病诊断的多模态神经影像特征融合
J Neurosci Methods. 2020 Jul 15;341:108795. doi: 10.1016/j.jneumeth.2020.108795. Epub 2020 May 22.
6
Hahn-PCNN-CNN: an end-to-end multi-modal brain medical image fusion framework useful for clinical diagnosis. Hahn-PCNN-CNN:一种端到端的多模态脑医学影像融合框架,有助于临床诊断。
BMC Med Imaging. 2021 Jul 14;21(1):111. doi: 10.1186/s12880-021-00642-z.
7
Adaptive Multi-Modal Fusion Framework for Activity Monitoring of People With Mobility Disability.用于行动不便者活动监测的自适应多模态融合框架
IEEE J Biomed Health Inform. 2022 Aug;26(8):4314-4324. doi: 10.1109/JBHI.2022.3168004. Epub 2022 Aug 11.
8
NeuroPycon: An open-source python toolbox for fast multi-modal and reproducible brain connectivity pipelines.NeuroPycon:一个开源的 Python 工具包,用于快速进行多模态和可重复的脑连接管道。
Neuroimage. 2020 Oct 1;219:117020. doi: 10.1016/j.neuroimage.2020.117020. Epub 2020 Jun 6.
9
A multi-modal fusion framework based on multi-task correlation learning for cancer prognosis prediction.一种基于多任务关联学习的多模态融合框架用于癌症预后预测。
Artif Intell Med. 2022 Apr;126:102260. doi: 10.1016/j.artmed.2022.102260. Epub 2022 Feb 24.
10
Hi-Net: Hybrid-Fusion Network for Multi-Modal MR Image Synthesis.Hi-Net:用于多模态磁共振图像合成的混合融合网络。
IEEE Trans Med Imaging. 2020 Sep;39(9):2772-2781. doi: 10.1109/TMI.2020.2975344. Epub 2020 Feb 20.

引用本文的文献

1
Methodological development study: Dynamic mask attention graph neural network for mechanical ventilation in elderly intensive care unit patients.方法学发展研究:用于老年重症监护病房患者机械通气的动态掩码注意力图神经网络
Digit Health. 2025 Jul 30;11:20552076251361680. doi: 10.1177/20552076251361680. eCollection 2025 Jan-Dec.
2
The Social Construction of Categorical Data: Mixed Methods Approach to Assessing Data Features in Publicly Available Datasets.分类数据的社会建构:评估公开可用数据集中数据特征的混合方法
JMIR Med Inform. 2025 Jan 28;13:e59452. doi: 10.2196/59452.
3
Eight quick tips for biologically and medically informed machine learning.

本文引用的文献

1
HFIP: an integrated multi-omics data and knowledge platform for the precision medicine of heart failure.HFIP:心力衰竭精准医学的一体化多组学数据和知识平台。
Database (Oxford). 2021 Nov 13;2021(2021). doi: 10.1093/database/baab076.
2
Image Preprocessing in Classification and Identification of Diabetic Eye Diseases.糖尿病眼病分类与识别中的图像预处理
Data Sci Eng. 2021;6(4):455-471. doi: 10.1007/s41019-021-00167-z. Epub 2021 Aug 17.
3
Fusion of whole and part features for the classification of histopathological image of breast tissue.
生物医学知识辅助机器学习的八点快速提示。
PLoS Comput Biol. 2025 Jan 9;21(1):e1012711. doi: 10.1371/journal.pcbi.1012711. eCollection 2025 Jan.
4
Applications of Multimodal Artificial Intelligence in Non-Hodgkin Lymphoma B Cells.多模态人工智能在非霍奇金淋巴瘤B细胞中的应用
Biomedicines. 2024 Aug 5;12(8):1753. doi: 10.3390/biomedicines12081753.
5
A multi-source heterogeneous medical data enhancement framework based on lakehouse.一种基于湖仓架构的多源异构医学数据增强框架。
Health Inf Sci Syst. 2024 Jul 5;12(1):37. doi: 10.1007/s13755-024-00295-6. eCollection 2024 Dec.
6
Interrelated feature selection from health surveys using domain knowledge graph.使用领域知识图谱从健康调查中进行相关特征选择。
Health Inf Sci Syst. 2023 Nov 16;11(1):54. doi: 10.1007/s13755-023-00254-7. eCollection 2023 Dec.
7
Intelligent Warehouse in Industry 4.0-Systematic Literature Review.工业 4.0 中的智能仓库——系统文献综述。
Sensors (Basel). 2023 Apr 19;23(8):4105. doi: 10.3390/s23084105.
8
Intelligent antepartum fetal monitoring via deep learning and fusion of cardiotocographic signals and clinical data.通过深度学习以及胎心监护信号与临床数据融合实现智能产前胎儿监测。
Health Inf Sci Syst. 2023 Mar 19;11(1):16. doi: 10.1007/s13755-023-00219-w. eCollection 2023 Dec.
9
Design and development of a disease-specific clinical database system to increase the availability of hospital data in China.设计并开发一个针对特定疾病的临床数据库系统,以提高中国医院数据的可用性。
Health Inf Sci Syst. 2023 Jan 30;11(1):11. doi: 10.1007/s13755-023-00211-4. eCollection 2023 Dec.
融合整体与局部特征用于乳腺组织病理图像分类
Health Inf Sci Syst. 2020 Nov 4;8(1):38. doi: 10.1007/s13755-020-00131-7. eCollection 2020 Dec.
4
The technology behind TB DEPOT: a novel public analytics platform integrating tuberculosis clinical, genomic, and radiological data for visual and statistical exploration.TB DEPOT 背后的技术:一个新型的公共分析平台,整合了结核病临床、基因组和影像学数据,用于可视化和统计探索。
J Am Med Inform Assoc. 2021 Jan 15;28(1):71-79. doi: 10.1093/jamia/ocaa228.
5
Automated epilepsy detection techniques from electroencephalogram signals: a review study.基于脑电图信号的自动癫痫检测技术:一项综述研究
Health Inf Sci Syst. 2020 Oct 12;8(1):33. doi: 10.1007/s13755-020-00129-1. eCollection 2020 Dec.
6
Automated detection of mild and multi-class diabetic eye diseases using deep learning.使用深度学习自动检测轻度和多类糖尿病眼病。
Health Inf Sci Syst. 2020 Oct 8;8(1):32. doi: 10.1007/s13755-020-00125-5. eCollection 2020 Dec.
7
INSMA: An integrated system for multimodal data acquisition and analysis in the intensive care unit.INSMA:一种用于重症监护病房多模态数据采集与分析的集成系统。
J Biomed Inform. 2020 Jun;106:103434. doi: 10.1016/j.jbi.2020.103434. Epub 2020 Apr 28.
8
Keyword extraction and structuralization of medical reports.医学报告的关键词提取与结构化
Health Inf Sci Syst. 2020 Apr 3;8(1):18. doi: 10.1007/s13755-020-00108-6. eCollection 2020 Dec.
9
A Survey on Deep Learning for Multimodal Data Fusion.深度学习在多模态数据融合中的研究综述。
Neural Comput. 2020 May;32(5):829-864. doi: 10.1162/neco_a_01273. Epub 2020 Mar 18.
10
Integration of mathematical model predictions into routine workflows to support clinical decision making in haematology.将数学模型预测整合到常规工作流程中,以支持血液学中的临床决策。
BMC Med Inform Decis Mak. 2020 Feb 10;20(1):28. doi: 10.1186/s12911-020-1039-x.