用于功能磁共振成像大数据分析的分布式计算平台。

A Distributed Computing Platform for fMRI Big Data Analytics.

作者信息

Makkie Milad, Li Xiang, Quinn Shannon, Lin Binbin, Ye Jieping, Mon Geoffrey, Liu Tianming

机构信息

Department of Computer Science, University of Georgia, Athens, GA 30602.

Clincial Data Science Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, 02114.

出版信息

IEEE Trans Big Data. 2019 Jun;5(2):109-119. doi: 10.1109/TBDATA.2018.2811508. Epub 2018 Mar 6.

DOI:10.1109/TBDATA.2018.2811508

PMID:31240237

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6592627/

Abstract

Since the BRAIN Initiative and Human Brain Project began, a few efforts have been made to address the computational challenges of neuroscience Big Data. The promises of these two projects were to model the complex interaction of brain and behavior and to understand and diagnose brain diseases by collecting and analyzing large quanitites of data. Archiving, analyzing, and sharing the growing neuroimaging datasets posed major challenges. New computational methods and technologies have emerged in the domain of Big Data but have not been fully adapted for use in neuroimaging. In this work, we introduce the current challenges of neuroimaging in a big data context. We review our efforts toward creating a data management system to organize the large-scale fMRI datasets, and present our novel algorithms/methods for the distributed fMRI data processing that employs Hadoop and Spark. Finally, we demonstrate the significant performance gains of our algorithms/methods to perform distributed dictionary learning.

摘要

自“脑计划”（BRAIN Initiative）和“人类脑计划”（Human Brain Project）启动以来，已经做出了一些努力来应对神经科学大数据的计算挑战。这两个项目的目标是通过收集和分析大量数据来模拟大脑与行为的复杂相互作用，并理解和诊断脑部疾病。存档、分析和共享不断增长的神经影像数据集带来了重大挑战。大数据领域已经出现了新的计算方法和技术，但尚未完全适用于神经影像。在这项工作中，我们介绍了大数据背景下神经影像的当前挑战。我们回顾了我们为创建一个数据管理系统以组织大规模功能磁共振成像（fMRI）数据集所做的努力，并展示了我们用于分布式fMRI数据处理的新颖算法/方法，该方法采用了Hadoop和Spark。最后，我们展示了我们的算法/方法在执行分布式字典学习方面的显著性能提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a45/6592627/e7c7d08e1d21/nihms-1530627-f0001.jpg

相似文献

A Distributed Computing Platform for fMRI Big Data Analytics.用于功能磁共振成像大数据分析的分布式计算平台。

IEEE Trans Big Data. 2019 Jun;5(2):109-119. doi: 10.1109/TBDATA.2018.2811508. Epub 2018 Mar 6.

Big Data Approaches for the Analysis of Large-Scale fMRI Data Using Apache Spark and GPU Processing: A Demonstration on Resting-State fMRI Data from the Human Connectome Project.使用Apache Spark和GPU处理分析大规模功能磁共振成像数据的大数据方法：来自人类连接体项目静息态功能磁共振成像数据的演示

Front Neurosci. 2016 Jan 6;9:492. doi: 10.3389/fnins.2015.00492. eCollection 2015.

Fast and scalable distributed deep convolutional autoencoder for fMRI big data analytics.用于功能磁共振成像大数据分析的快速可扩展分布式深度卷积自动编码器

Neurocomputing (Amst). 2019 Jan 24;325:20-30. doi: 10.1016/j.neucom.2018.09.066. Epub 2018 Oct 9.

Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends.MapReduce 编程框架在临床大数据分析中的应用：现状与未来趋势。

BioData Min. 2014 Oct 29;7:22. doi: 10.1186/1756-0381-7-22. eCollection 2014.

A distributed computing model for big data anonymization in the networks.一种用于网络大数据匿名化的分布式计算模型。

PLoS One. 2023 Apr 28;18(4):e0285212. doi: 10.1371/journal.pone.0285212. eCollection 2023.

Next generation distributed computing for cancer research.用于癌症研究的下一代分布式计算。

Cancer Inform. 2015 Apr 27;13(Suppl 7):97-109. doi: 10.4137/CIN.S16344. eCollection 2014.

Analyzing big datasets of genomic sequences: fast and scalable collection of k-mer statistics.分析基因组序列的大数据集：快速可扩展的 k-mer 统计信息收集。

BMC Bioinformatics. 2019 Apr 18;20(Suppl 4):138. doi: 10.1186/s12859-019-2694-8.

PySpark and RDKit: Moving towards Big Data in Cheminformatics.PySpark 和 RDKit：迈向化学生物信息学的大数据时代。

Mol Inform. 2019 Jun;38(6):e1800082. doi: 10.1002/minf.201800082. Epub 2019 Mar 7.

Framing Apache Spark in life sciences.从生命科学角度构建Apache Spark

Heliyon. 2023 Feb 9;9(2):e13368. doi: 10.1016/j.heliyon.2023.e13368. eCollection 2023 Feb.

Distributed Fast Self-Organized Maps for Massive Spectrophotometric Data Analysis .分布式快速自组织映射在海量分光光度数据分析中的应用。

Sensors (Basel). 2018 May 3;18(5):1419. doi: 10.3390/s18051419.

引用本文的文献

Anesthesia decision analysis using a cloud-based big data platform.基于云的大数据平台的麻醉决策分析。

Eur J Med Res. 2024 Mar 25;29(1):201. doi: 10.1186/s40001-024-01764-0.

Current methods and new directions in resting state fMRI.静息态 fMRI 的当前方法和新方向。

Clin Imaging. 2020 Sep;65:47-53. doi: 10.1016/j.clinimag.2020.04.004. Epub 2020 Apr 12.

Functional Neuroimaging in the New Era of Big Data.新时代的功能神经影像学：大数据篇

Genomics Proteomics Bioinformatics. 2019 Aug;17(4):393-401. doi: 10.1016/j.gpb.2018.11.005. Epub 2019 Dec 4.

本文引用的文献

Task fMRI data analysis based on supervised stochastic coordinate coding.基于监督随机坐标编码的任务 fMRI 数据分析。

Med Image Anal. 2017 May;38:1-16. doi: 10.1016/j.media.2016.12.003. Epub 2017 Feb 20.

HAFNI-enabled largescale platform for neuroimaging informatics (HELPNI).用于神经影像信息学的基于 Hafni 的大规模平台（HELPNI）。

Brain Inform. 2015 Dec;2(4):225-238. doi: 10.1007/s40708-015-0024-0. Epub 2015 Nov 27.

Signal sampling for efficient sparse representation of resting state FMRI data.用于静息态功能磁共振成像数据高效稀疏表示的信号采样

Brain Imaging Behav. 2016 Dec;10(4):1206-1222. doi: 10.1007/s11682-015-9487-0.

Measuring macroscopic brain connections in vivo.活体测量宏观脑连接。

Nat Neurosci. 2015 Nov;18(11):1546-55. doi: 10.1038/nn.4134. Epub 2015 Oct 27.

Convolutional Sparse Coding for Trajectory Reconstruction.卷积稀疏编码用于轨迹重建。

IEEE Trans Pattern Anal Mach Intell. 2015 Mar;37(3):529-40. doi: 10.1109/TPAMI.2013.2295311.

Open source tools for large-scale neuroscience.用于大规模神经科学研究的开源工具。

Curr Opin Neurobiol. 2015 Jun;32:156-63. doi: 10.1016/j.conb.2015.04.002. Epub 2015 May 16.

Data Interpretation in the Digital Age.数字时代的数据解读

Perspect Sci. 2014 Sep 12;22(3):397-417. doi: 10.1162/POSC_a_00140.

Holistic atlases of functional networks and interactions reveal reciprocal organizational architecture of cortical function.功能网络与相互作用的整体图谱揭示了皮质功能的相互组织架构。

IEEE Trans Biomed Eng. 2015 Apr;62(4):1120-31. doi: 10.1109/TBME.2014.2369495. Epub 2014 Nov 20.

Making big data open: data sharing in neuroimaging.使大数据开放：神经影像学中的数据共享。

Nat Neurosci. 2014 Nov;17(11):1510-7. doi: 10.1038/nn.3818. Epub 2014 Oct 28.

Big data from small data: data-sharing in the 'long tail' of neuroscience.从小数据到大数据：神经科学“长尾”中的数据共享。

Nat Neurosci. 2014 Nov;17(11):1442-7. doi: 10.1038/nn.3838.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验