• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合图像、语音和患者问卷数据对喉部疾病进行分类。

Combining image, voice, and the patient's questionnaire data to categorize laryngeal disorders.

机构信息

Department of Electrical & Control Equipment, Kaunas University of Technology, Lithuania.

出版信息

Artif Intell Med. 2010 May;49(1):43-50. doi: 10.1016/j.artmed.2010.02.002. Epub 2010 Mar 24.

DOI:10.1016/j.artmed.2010.02.002
PMID:20338736
Abstract

OBJECTIVE

This paper is concerned with soft computing techniques for categorizing laryngeal disorders based on information extracted from an image of patient's vocal folds, a voice signal, and questionnaire data.

METHODS

Multiple feature sets are exploited to characterize images and voice signals. To characterize colour, texture, and geometry of biological structures seen in colour images of vocal folds, eight feature sets are used. Twelve feature sets are used to obtain a comprehensive characterization of a voice signal (the sustained phonation of the vowel sound /a/). Answers to 14 questions constitute the questionnaire feature set. A committee of support vector machines is designed for categorizing the image, voice, and query data represented by the multiple feature sets into the healthy, nodular and diffuse classes. Five alternatives to aggregate separate SVMs into a committee are explored. Feature selection and classifier design are combined into the same learning process based on genetic search.

RESULTS

Data of all the three modalities were available from 240 patients. Among those, 151 patients belong to the nodular class, 64 to the diffuse class and 25 to the healthy class. When using a single feature set to characterize each modality, the test set data classification accuracy of 75.0%, 72.1%, and 85.0% was obtained for the image, voice and questionnaire data, respectively. The use of multiple feature sets allowed to increase the accuracy to 89.5% and 87.7% for the image and voice data, respectively. The test set data classification accuracy of over 98.0% was obtained from a committee exploiting multiple feature sets from all the three modalities. The highest classification accuracy was achieved when using the SVM-based aggregation with hyper parameters of the SVM determined by genetic search. Bearing in mind the difficulty of the task, the obtained classification accuracy is rather encouraging.

CONCLUSIONS

Combination of both multiple feature sets characterizing a single modality and the three modalities allowed to substantially improve the classification accuracy if compared to the highest accuracy obtained from a single feature set and a single modality. In spite of the unbalanced data sets used, the error rates obtained for the three classes were rather similar.

摘要

目的

本文关注基于从患者声带图像、语音信号和问卷数据中提取的信息,应用软计算技术对声带疾病进行分类。

方法

利用多个特征集来描述图像和语音信号。为了描述声带彩色图像中生物结构的颜色、纹理和几何形状,使用了 8 个特征集。为了全面描述语音信号(元音/a/的持续发音),使用了 12 个特征集。问卷特征集由 14 个问题的答案组成。设计了一个支持向量机委员会,用于将多个特征集表示的图像、语音和查询数据分类为健康、结节和弥漫性类别。探索了五种将独立的 SVM 聚合到委员会中的方法。特征选择和分类器设计结合到基于遗传搜索的同一个学习过程中。

结果

共有 240 名患者的三种模态数据可用。其中,151 名患者属于结节类,64 名患者属于弥漫性类,25 名患者属于健康类。当使用单个特征集来描述每种模态时,图像、语音和问卷数据的测试集数据分类准确率分别为 75.0%、72.1%和 85.0%。使用多个特征集可以将图像和语音数据的准确率分别提高到 89.5%和 87.7%。使用来自所有三种模态的多个特征集的委员会可以获得超过 98.0%的测试集数据分类准确率。使用基于遗传搜索确定 SVM 超参数的 SVM 聚合获得了最高的分类准确率。考虑到任务的难度,所获得的分类准确率相当令人鼓舞。

结论

与单个特征集和单个模态获得的最高准确率相比,组合单个模态的多个特征集以及三种模态可以大大提高分类准确率。尽管使用了不平衡的数据集,但三个类别的错误率相当相似。

相似文献

1
Combining image, voice, and the patient's questionnaire data to categorize laryngeal disorders.结合图像、语音和患者问卷数据对喉部疾病进行分类。
Artif Intell Med. 2010 May;49(1):43-50. doi: 10.1016/j.artmed.2010.02.002. Epub 2010 Mar 24.
2
Multiple feature sets based categorization of laryngeal images.基于多特征集的喉部图像分类
Comput Methods Programs Biomed. 2007 Mar;85(3):257-66. doi: 10.1016/j.cmpb.2006.11.002. Epub 2006 Dec 11.
3
Automated speech analysis applied to laryngeal disease categorization.应用于喉疾病分类的自动语音分析。
Comput Methods Programs Biomed. 2008 Jul;91(1):36-47. doi: 10.1016/j.cmpb.2008.01.008. Epub 2008 Mar 17.
4
Classification of functional voice disorders based on phonovibrograms.基于声门图的功能性嗓音障碍分类。
Artif Intell Med. 2010 May;49(1):51-9. doi: 10.1016/j.artmed.2010.01.001.
5
Using the patient's questionnaire data to screen laryngeal disorders.
Comput Biol Med. 2009 Feb;39(2):148-55. doi: 10.1016/j.compbiomed.2008.11.008. Epub 2009 Jan 13.
6
Towards a computer-aided diagnosis system for vocal cord diseases.迈向声带疾病的计算机辅助诊断系统。
Artif Intell Med. 2006 Jan;36(1):71-84. doi: 10.1016/j.artmed.2004.11.001.
7
A kernel-based approach to categorizing laryngeal images.
Comput Med Imaging Graph. 2007 Dec;31(8):587-94. doi: 10.1016/j.compmedimag.2007.07.003. Epub 2007 Aug 21.
8
Categorizing normal and pathological voices: automated and perceptual categorization.正常和病理性嗓音的分类:自动分类和感知分类。
J Voice. 2011 Nov;25(6):700-8. doi: 10.1016/j.jvoice.2010.04.009. Epub 2010 Jun 25.
9
Towards noninvasive screening for malignant tumours in human larynx.面向人类喉恶性肿瘤的无创筛查。
Med Eng Phys. 2010 Jan;32(1):83-9. doi: 10.1016/j.medengphy.2009.10.011. Epub 2009 Nov 18.
10
A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms.一种使用声振图从喉部高速视频分析持续和动态声带振动的通用程序。
Artif Intell Med. 2016 Jan;66:15-28. doi: 10.1016/j.artmed.2015.10.002. Epub 2015 Oct 30.

引用本文的文献

1
New developments in the application of artificial intelligence to laryngology.人工智能在喉科学中的应用新进展。
Curr Opin Otolaryngol Head Neck Surg. 2024 Dec 1;32(6):391-397. doi: 10.1097/MOO.0000000000000999. Epub 2024 Jul 24.
2
Advanced computing solutions for analysis of laryngeal disorders.高级计算解决方案用于分析喉部疾病。
Med Biol Eng Comput. 2019 Nov;57(11):2535-2552. doi: 10.1007/s11517-019-02031-9. Epub 2019 Sep 6.