高维多模态神经影像数据的可视化与无监督预测聚类

Visualization and unsupervised predictive clustering of high-dimensional multimodal neuroimaging data.

作者信息

Mwangi Benson, Soares Jair C, Hasan Khader M

机构信息

UT Center of Excellence on Mood Disorders, Department of Psychiatry and Behavioral Sciences, UT Houston Medical School, Houston, TX, USA.

出版信息

J Neurosci Methods. 2014 Oct 30;236:19-25. doi: 10.1016/j.jneumeth.2014.08.001. Epub 2014 Aug 10.

DOI:10.1016/j.jneumeth.2014.08.001

PMID:25117552

Abstract

BACKGROUND

Neuroimaging machine learning studies have largely utilized supervised algorithms - meaning they require both neuroimaging scan data and corresponding target variables (e.g. healthy vs. diseased) to be successfully 'trained' for a prediction task. Noticeably, this approach may not be optimal or possible when the global structure of the data is not well known and the researcher does not have an a priori model to fit the data.

NEW METHOD

We set out to investigate the utility of an unsupervised machine learning technique; t-distributed stochastic neighbour embedding (t-SNE) in identifying 'unseen' sample population patterns that may exist in high-dimensional neuroimaging data. Multimodal neuroimaging scans from 92 healthy subjects were pre-processed using atlas-based methods, integrated and input into the t-SNE algorithm. Patterns and clusters discovered by the algorithm were visualized using a 2D scatter plot and further analyzed using the K-means clustering algorithm.

COMPARISON WITH EXISTING METHODS

t-SNE was evaluated against classical principal component analysis.

CONCLUSION

Remarkably, based on unlabelled multimodal scan data, t-SNE separated study subjects into two very distinct clusters which corresponded to subjects' gender labels (cluster silhouette index value=0.79). The resulting clusters were used to develop an unsupervised minimum distance clustering model which identified 93.5% of subjects' gender. Notably, from a neuropsychiatric perspective this method may allow discovery of data-driven disease phenotypes or sub-types of treatment responders.

摘要

背景

神经影像学机器学习研究大多采用监督算法，这意味着它们需要神经影像学扫描数据和相应的目标变量（如健康与患病）才能成功地针对预测任务进行“训练”。值得注意的是，当数据的整体结构不为人所知且研究人员没有先验模型来拟合数据时，这种方法可能并非最优或可行。

新方法

我们着手研究一种无监督机器学习技术——t分布随机邻域嵌入（t-SNE）在识别高维神经影像学数据中可能存在的“未见”样本群体模式方面的效用。对92名健康受试者的多模态神经影像学扫描采用基于图谱的方法进行预处理，整合后输入t-SNE算法。该算法发现的模式和聚类通过二维散点图进行可视化，并使用K均值聚类算法进行进一步分析。

与现有方法的比较

将t-SNE与经典主成分分析进行评估比较。

结论

值得注意的是，基于未标记的多模态扫描数据，t-SNE将研究对象分为两个非常不同的聚类，这与受试者的性别标签相对应（聚类轮廓指数值 = 0.79）。所得聚类用于开发一个无监督最小距离聚类模型，该模型识别出了93.5%受试者的性别。值得注意的是，从神经精神病学角度来看，这种方法可能有助于发现数据驱动的疾病表型或治疗反应者的亚型。

相似文献

Visualization and unsupervised predictive clustering of high-dimensional multimodal neuroimaging data.

J Neurosci Methods. 2014 Oct 30;236:19-25. doi: 10.1016/j.jneumeth.2014.08.001. Epub 2014 Aug 10.

Identifying neuroanatomical signatures of anorexia nervosa: a multivariate machine learning approach.

Psychol Med. 2015 Oct;45(13):2805-12. doi: 10.1017/S0033291715000768. Epub 2015 May 20.

Identification and individualized prediction of clinical phenotypes in bipolar disorders using neurocognitive data, neuroimaging scans and machine learning.

Neuroimage. 2017 Jan 15;145(Pt B):254-264. doi: 10.1016/j.neuroimage.2016.02.016. Epub 2016 Feb 13.

Multimodal active subspace analysis for computing assessment oriented subspaces from neuroimaging data.

J Neurosci Methods. 2024 Jun;406:110109. doi: 10.1016/j.jneumeth.2024.110109. Epub 2024 Mar 15.

Multimodal Neuroimaging: Basic Concepts and Classification of Neuropsychiatric Diseases.

Clin EEG Neurosci. 2019 Jan;50(1):20-33. doi: 10.1177/1550059418782093. Epub 2018 Jun 20.

DGCyTOF: Deep learning with graphic cluster visualization to predict cell types of single cell mass cytometry data.

PLoS Comput Biol. 2022 Apr 11;18(4):e1008885. doi: 10.1371/journal.pcbi.1008885. eCollection 2022 Apr.

Analysis of FMRI data using an integrated principal component analysis and supervised affinity propagation clustering approach.

IEEE Trans Biomed Eng. 2011 Nov;58(11):3184-96. doi: 10.1109/TBME.2011.2165542. Epub 2011 Aug 22.

A novel approach for fMRI data analysis based on the combination of sparse approximation and affinity propagation clustering.

Magn Reson Imaging. 2014 Jul;32(6):736-46. doi: 10.1016/j.mri.2014.02.023. Epub 2014 Mar 14.

Cluster tendency assessment in neuronal spike data.

PLoS One. 2019 Nov 12;14(11):e0224547. doi: 10.1371/journal.pone.0224547. eCollection 2019.

Machine learning on brain MRI data for differential diagnosis of Parkinson's disease and Progressive Supranuclear Palsy.

J Neurosci Methods. 2014 Jan 30;222:230-7. doi: 10.1016/j.jneumeth.2013.11.016. Epub 2013 Nov 26.

引用本文的文献

Genome-wide analysis of therapeutic response uncovers molecular pathways governing tamoxifen resistance in ER+ breast cancer.

EBioMedicine. 2020 Nov;61:103047. doi: 10.1016/j.ebiom.2020.103047. Epub 2020 Oct 21.

Hypothalamic estrogen receptor alpha establishes a sexually dimorphic regulatory node of energy expenditure.

Nat Metab. 2020 Apr;2(4):351-363. doi: 10.1038/s42255-020-0189-6. Epub 2020 Apr 13.

Tracking the Brain State Transition Process of Dynamic Function Connectivity Based on Resting State fMRI.

Comput Intell Neurosci. 2019 Oct 7;2019:9027803. doi: 10.1155/2019/9027803. eCollection 2019.

Heterogeneous beta-catenin activation is sufficient to cause hepatocellular carcinoma in zebrafish.

Biol Open. 2019 Oct 17;8(10):bio047829. doi: 10.1242/bio.047829.

Moving Beyond ERP Components: A Selective Review of Approaches to Integrate EEG and Behavior.

Front Hum Neurosci. 2018 Mar 26;12:106. doi: 10.3389/fnhum.2018.00106. eCollection 2018.

Brain Subtyping Enhances The Neuroanatomical Discrimination of Schizophrenia.

Schizophr Bull. 2018 Aug 20;44(5):1060-1069. doi: 10.1093/schbul/sby008.

Deep Learning in Neuroradiology.

AJNR Am J Neuroradiol. 2018 Oct;39(10):1776-1784. doi: 10.3174/ajnr.A5543. Epub 2018 Feb 1.

Beyond Lumping and Splitting: A Review of Computational Approaches for Stratifying Psychiatric Disorders.

Biol Psychiatry Cogn Neurosci Neuroimaging. 2016 Sep;1(5):433-447. doi: 10.1016/j.bpsc.2016.04.002.

A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets.

Front Neuroinform. 2016 Mar 15;10:9. doi: 10.3389/fninf.2016.00009. eCollection 2016.

Identification and individualized prediction of clinical phenotypes in bipolar disorders using neurocognitive data, neuroimaging scans and machine learning.

Neuroimage. 2017 Jan 15;145(Pt B):254-264. doi: 10.1016/j.neuroimage.2016.02.016. Epub 2016 Feb 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

高维多模态神经影像数据的可视化与无监督预测聚类

Visualization and unsupervised predictive clustering of high-dimensional multimodal neuroimaging data.

作者信息

Mwangi Benson, Soares Jair C, Hasan Khader M

机构信息

UT Center of Excellence on Mood Disorders, Department of Psychiatry and Behavioral Sciences, UT Houston Medical School, Houston, TX, USA.

出版信息

J Neurosci Methods. 2014 Oct 30;236:19-25. doi: 10.1016/j.jneumeth.2014.08.001. Epub 2014 Aug 10.

DOI:10.1016/j.jneumeth.2014.08.001

PMID:25117552

Abstract

BACKGROUND

NEW METHOD

COMPARISON WITH EXISTING METHODS

t-SNE was evaluated against classical principal component analysis.

CONCLUSION

摘要

背景

新方法

与现有方法的比较

将t-SNE与经典主成分分析进行评估比较。

高维多模态神经影像数据的可视化与无监督预测聚类

Visualization and unsupervised predictive clustering of high-dimensional multimodal neuroimaging data.

作者信息

机构信息

出版信息

BACKGROUND

NEW METHOD

COMPARISON WITH EXISTING METHODS

CONCLUSION

背景

新方法

与现有方法的比较

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

高维多模态神经影像数据的可视化与无监督预测聚类

Visualization and unsupervised predictive clustering of high-dimensional multimodal neuroimaging data.

作者信息

机构信息

出版信息

BACKGROUND

NEW METHOD

COMPARISON WITH EXISTING METHODS

CONCLUSION

背景

新方法

与现有方法的比较

结论

相似文献

引用本文的文献