• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于视觉图神经网络和对比学习的光谱图像分析用于帕金森病检测

Spectro-Image Analysis with Vision Graph Neural Networks and Contrastive Learning for Parkinson's Disease Detection.

作者信息

Madusanka Nuwan, Malekroodi Hadi Sedigh, Herath H M K K M B, Hewage Chaminda, Yi Myunggi, Lee Byeong-Il

机构信息

Digital Healthcare Research Center, Pukyong National University, Busan 48513, Republic of Korea.

Industry 4.0 Convergence Bionics Engineering, Pukyoung National University, Busan 48513, Republic of Korea.

出版信息

J Imaging. 2025 Jul 2;11(7):220. doi: 10.3390/jimaging11070220.

DOI:10.3390/jimaging11070220
PMID:40710607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12296162/
Abstract

This study presents a novel framework that integrates Vision Graph Neural Networks (ViGs) with supervised contrastive learning for enhanced spectro-temporal image analysis of speech signals in Parkinson's disease (PD) detection. The approach introduces a frequency band decomposition strategy that transforms raw audio into three complementary spectral representations, capturing distinct PD-specific characteristics across low-frequency (0-2 kHz), mid-frequency (2-6 kHz), and high-frequency (6 kHz+) bands. The framework processes mel multi-band spectro-temporal representations through a ViG architecture that models complex graph-based relationships between spectral and temporal components, trained using a supervised contrastive objective that learns discriminative representations distinguishing PD-affected from healthy speech patterns. Comprehensive experimental validation on multi-institutional datasets from Italy, Colombia, and Spain demonstrates that the proposed ViG-contrastive framework achieves superior classification performance, with the ViG-M-GELU architecture achieving 91.78% test accuracy. The integration of graph neural networks with contrastive learning enables effective learning from limited labeled data while capturing complex spectro-temporal relationships that traditional Convolution Neural Network (CNN) approaches miss, representing a promising direction for developing more accurate and clinically viable speech-based diagnostic tools for PD.

摘要

本研究提出了一种新颖的框架,该框架将视觉图神经网络(ViGs)与监督对比学习相结合,用于帕金森病(PD)检测中语音信号的增强频谱-时间图像分析。该方法引入了一种频带分解策略,将原始音频转换为三种互补的频谱表示,捕捉低频(0-2kHz)、中频(2-6kHz)和高频(6kHz以上)频段中不同的帕金森病特异性特征。该框架通过ViG架构处理梅尔多频带频谱-时间表示,该架构对频谱和时间成分之间基于图的复杂关系进行建模,并使用监督对比目标进行训练,该目标学习区分帕金森病影响的语音模式和健康语音模式的判别表示。对来自意大利、哥伦比亚和西班牙的多机构数据集进行的全面实验验证表明,所提出的ViG对比框架实现了卓越的分类性能,其中ViG-M-GELU架构的测试准确率达到91.78%。图神经网络与对比学习的集成能够从有限的标记数据中进行有效学习,同时捕捉传统卷积神经网络(CNN)方法遗漏的复杂频谱-时间关系,这代表了为帕金森病开发更准确且临床上可行的基于语音的诊断工具的一个有前景的方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/cd40c46a7703/jimaging-11-00220-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/905652120128/jimaging-11-00220-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/04bac07f34c1/jimaging-11-00220-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/c01523d50771/jimaging-11-00220-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/d32af7e4e2ad/jimaging-11-00220-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/cd40c46a7703/jimaging-11-00220-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/905652120128/jimaging-11-00220-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/04bac07f34c1/jimaging-11-00220-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/c01523d50771/jimaging-11-00220-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/d32af7e4e2ad/jimaging-11-00220-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/12296162/cd40c46a7703/jimaging-11-00220-g005.jpg

相似文献

1
Spectro-Image Analysis with Vision Graph Neural Networks and Contrastive Learning for Parkinson's Disease Detection.基于视觉图神经网络和对比学习的光谱图像分析用于帕金森病检测
J Imaging. 2025 Jul 2;11(7):220. doi: 10.3390/jimaging11070220.
2
Speech-Based Parkinson's Detection Using Pre-Trained Self-Supervised Automatic Speech Recognition (ASR) Models and Supervised Contrastive Learning.基于语音的帕金森病检测:使用预训练的自监督自动语音识别(ASR)模型和监督对比学习
Bioengineering (Basel). 2025 Jul 1;12(7):728. doi: 10.3390/bioengineering12070728.
3
SAGEFusionNet: An Auxiliary Supervised Graph Neural Network for Brain Age Prediction as a Neurodegenerative Biomarker.SAGEFusionNet:一种用于脑龄预测的辅助监督图神经网络,作为神经退行性生物标志物
Brain Sci. 2025 Jul 15;15(7):752. doi: 10.3390/brainsci15070752.
4
Short-Term Memory Impairment短期记忆障碍
5
Long-term care plan recommendation for older adults with disabilities: a bipartite graph transformer and self-supervised approach.针对残疾老年人的长期护理计划建议:一种二分图变压器和自监督方法。
J Am Med Inform Assoc. 2025 Apr 1;32(4):689-701. doi: 10.1093/jamia/ocae327.
6
Multi-view based heterogeneous graph contrastive learning for drug-target interaction prediction.基于多视图的异构图对比学习用于药物-靶点相互作用预测
J Biomed Inform. 2025 Aug;168:104852. doi: 10.1016/j.jbi.2025.104852. Epub 2025 Jun 2.
7
Margin-aware optimized contrastive learning for enhanced self-supervised histopathological image classification.用于增强自监督组织病理学图像分类的边缘感知优化对比学习
Health Inf Sci Syst. 2024 Nov 29;13(1):2. doi: 10.1007/s13755-024-00316-4. eCollection 2025 Dec.
8
Taking a look at your speech: identifying diagnostic status and negative symptoms of psychosis using convolutional neural networks.审视你的言语:使用卷积神经网络识别精神病的诊断状态和阴性症状。
NPP Digit Psychiatry Neurosci. 2025;3(1):19. doi: 10.1038/s44277-025-00040-1. Epub 2025 Jul 8.
9
The Global Landscape of Genetic Variation in Parkinson's disease: Multi-Ancestry Insights into Established Disease Genes and their Translational Relevance.帕金森病遗传变异的全球格局:对既定疾病基因及其转化相关性的多血统见解
medRxiv. 2025 Jul 11:2025.07.08.25330815. doi: 10.1101/2025.07.08.25330815.
10
GatorST: A Versatile Contrastive Meta-Learning Framework for Spatial Transcriptomic Data Analysis.GatorST:用于空间转录组数据分析的通用对比元学习框架。
bioRxiv. 2025 Jul 19:2025.07.01.662625. doi: 10.1101/2025.07.01.662625.

本文引用的文献

1
Automatic detection of Parkinsonian speech using wavelet scattering features.基于小波散射特征的帕金森病语音自动检测
JASA Express Lett. 2025 May 1;5(5). doi: 10.1121/10.0036660.
2
Voice biomarkers as prognostic indicators for Parkinson's disease using machine learning techniques.使用机器学习技术将语音生物标志物作为帕金森病的预后指标。
Sci Rep. 2025 Apr 9;15(1):12129. doi: 10.1038/s41598-025-96950-3.
3
Two-stage data augmentation for improved ASR performance for dysarthric speech.用于改善构音障碍语音自动语音识别性能的两阶段数据增强
Comput Biol Med. 2025 May;189:109954. doi: 10.1016/j.compbiomed.2025.109954. Epub 2025 Mar 13.
4
Pre-trained convolutional neural networks identify Parkinson's disease from spectrogram images of voice samples.预训练卷积神经网络从语音样本的频谱图图像中识别帕金森病。
Sci Rep. 2025 Mar 1;15(1):7337. doi: 10.1038/s41598-025-92105-6.
5
NeuroVoz: a Castillian Spanish corpus of parkinsonian speech.NeuroVoz:一个卡斯蒂利亚西班牙语帕金森氏症语音语料库。
Sci Data. 2024 Dec 18;11(1):1367. doi: 10.1038/s41597-024-04186-z.
6
MelTrans: Mel-Spectrogram Relationship-Learning for Speech Emotion Recognition via Transformers.基于 Transformer 的梅尔频谱关系学习在语音情感识别中的应用。
Sensors (Basel). 2024 Aug 25;24(17):5506. doi: 10.3390/s24175506.
7
Machine Learning Models for Parkinson Disease: Systematic Review.帕金森病的机器学习模型:系统综述
JMIR Med Inform. 2024 May 17;12:e50117. doi: 10.2196/50117.
8
Harnessing Voice Analysis and Machine Learning for Early Diagnosis of Parkinson's Disease: A Comparative Study Across Three Datasets.利用语音分析和机器学习进行帕金森病的早期诊断:三个数据集的比较研究
J Voice. 2024 May 12. doi: 10.1016/j.jvoice.2024.04.020.
9
Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson's Disease: A Study on Speaker Diarization and Classification Techniques.基于机器学习的帕金森病早期检测语音分析:说话人分割和分类技术研究。
Sensors (Basel). 2024 Feb 26;24(5):1499. doi: 10.3390/s24051499.
10
A machine learning method to process voice samples for identification of Parkinson's disease.一种用于处理语音样本以识别帕金森病的机器学习方法。
Sci Rep. 2023 Nov 23;13(1):20615. doi: 10.1038/s41598-023-47568-w.