使用 WEKA 模型器比较用于预测自闭症谱系障碍的分类算法。

Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler.

机构信息

Centre for Global Sustainability Studies, Universiti Sains Malaysia, Minden, Malaysia.

School of Languages, Literacies, and Translation, Universiti Sains Malaysia, Minden, Malaysia.

出版信息

BMC Med Inform Decis Mak. 2022 Nov 24;22(1):306. doi: 10.1186/s12911-022-02050-x.

DOI:10.1186/s12911-022-02050-x

PMID:36434656

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9700876/

Abstract

BACKGROUND

In healthcare area, big data, if integrated with machine learning, enables health practitioners to predict the result of a disorder or disease more accurately. In Autistic Spectrum Disorder (ASD), it is important to screen the patients to enable them to undergo proper treatments as early as possible. However, difficulties may arise in predicting ASD occurrences accurately, mainly caused by human errors. Data mining, if embedded into health screening practice, can help to overcome the difficulties. This study attempts to evaluate the performance of six best classifiers, taken from existing works, at analysing ASD screening training dataset.

RESULT

We tested Naive Bayes, Logistic Regression, KNN, J48, Random Forest, SVM, and Deep Neural Network algorithms to ASD screening dataset and compared the classifiers' based on significant parameters; sensitivity, specificity, accuracy, receiver operating characteristic, area under the curve, and runtime, in predicting ASD occurrences. We also found that most of previous studies focused on classifying health-related dataset while ignoring the missing values which may contribute to significant impacts to the classification result which in turn may impact the life of the patients. Thus, we addressed the missing values by implementing imputation method where they are replaced with the mean of the available records found in the dataset.

CONCLUSION

We found that J48 produced promising results as compared to other classifiers when tested in both circumstances, with and without missing values. Our findings also suggested that SVM does not necessarily perform well for small and simple datasets. The outcome is hoped to assist health practitioners in making accurate diagnosis of ASD occurrences in patients.

摘要

背景

在医疗保健领域，大数据如果与机器学习相结合，可以帮助医疗从业者更准确地预测疾病或疾病的结果。在自闭症谱系障碍（ASD）中，对患者进行筛查以使其能够尽早接受适当的治疗非常重要。然而，准确预测 ASD 的发生可能会遇到困难，主要是由于人为错误。数据挖掘如果嵌入到健康筛查实践中，可以帮助克服这些困难。本研究尝试评估从现有工作中选取的六种最佳分类器在分析 ASD 筛查训练数据集方面的性能。

结果

我们测试了朴素贝叶斯、逻辑回归、KNN、J48、随机森林、SVM 和深度神经网络算法，对 ASD 筛查数据集进行了测试，并根据灵敏度、特异性、准确性、接收者操作特征、曲线下面积和运行时间等重要参数对分类器进行了比较，以预测 ASD 的发生。我们还发现，大多数先前的研究都集中在对健康相关数据集进行分类，而忽略了缺失值，这些缺失值可能会对分类结果产生重大影响，进而影响患者的生命。因此，我们通过实施插补方法来解决缺失值问题，即将缺失值替换为数据集内可用记录的平均值。

结论

我们发现，在有和没有缺失值的情况下，J48 的测试结果都优于其他分类器。我们的研究结果还表明，SVM 不一定适用于小型和简单的数据集。希望这一结果能帮助医疗从业者对患者的 ASD 发生做出准确诊断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ef0/9700876/b5cfb6182be8/12911_2022_2050_Fig1_HTML.jpg

相似文献

Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler.使用 WEKA 模型器比较用于预测自闭症谱系障碍的分类算法。

BMC Med Inform Decis Mak. 2022 Nov 24;22(1):306. doi: 10.1186/s12911-022-02050-x.

Identification of newborns at risk for autism using electronic medical records and machine learning.利用电子病历和机器学习识别自闭症风险新生儿。

Eur Psychiatry. 2020 Feb 26;63(1):e22. doi: 10.1192/j.eurpsy.2020.17.

A clustering approach for autistic trait classification.自闭症特质分类的聚类方法。

Inform Health Soc Care. 2020 Sep;45(3):309-326. doi: 10.1080/17538157.2019.1687482. Epub 2020 Feb 3.

A new classification system for autism based on machine learning of artificial intelligence.基于人工智能机器学习的自闭症新分类系统。

Technol Health Care. 2022;30(3):605-622. doi: 10.3233/THC-213032.

A comparison of machine learning algorithms for the surveillance of autism spectrum disorder.机器学习算法在自闭症谱系障碍监测中的比较。

PLoS One. 2019 Sep 25;14(9):e0222907. doi: 10.1371/journal.pone.0222907. eCollection 2019.

Comparative analysis of weka-based classification algorithms on medical diagnosis datasets.基于 WEKA 的分类算法在医学诊断数据集上的比较分析。

Technol Health Care. 2023;31(S1):397-408. doi: 10.3233/THC-236034.

Machine Learning Prediction of Autism Spectrum Disorder From a Minimal Set of Medical and Background Information.基于最小的医疗和背景信息集对自闭症谱系障碍的机器学习预测。

JAMA Netw Open. 2024 Aug 1;7(8):e2429229. doi: 10.1001/jamanetworkopen.2024.29229.

Diagnosis of Autism Spectrum Disorders in Young Children Based on Resting-State Functional Magnetic Resonance Imaging Data Using Convolutional Neural Networks.基于卷积神经网络的静息态功能磁共振成像数据对幼儿孤独症谱系障碍的诊断。

J Digit Imaging. 2019 Dec;32(6):899-918. doi: 10.1007/s10278-019-00196-1.

Prediction and Analysis of Autism Spectrum Disorder Using Machine Learning Techniques.使用机器学习技术预测和分析自闭症谱系障碍。

J Healthc Eng. 2023 Jul 10;2023:4853800. doi: 10.1155/2023/4853800. eCollection 2023.

Autism spectrum disorder detection with kNN imputer and machine learning classifiers via questionnaire mode of screening.通过问卷调查筛选模式，利用k近邻插补法和机器学习分类器进行自闭症谱系障碍检测。

Health Inf Sci Syst. 2024 Mar 6;12(1):18. doi: 10.1007/s13755-024-00277-8. eCollection 2024 Dec.

本文引用的文献

Exploring message framing to engage parents in early screening for autism spectrum disorder.探索信息框架以促使家长参与自闭症谱系障碍的早期筛查。

Patient Educ Couns. 2020 Jun 27. doi: 10.1016/j.pec.2020.06.024.

Voice Disorder Identification by using Hilbert-Huang Transform (HHT) and K Nearest Neighbor (KNN).基于希尔伯特-黄变换（HHT）和 K 最近邻（KNN）的嗓音障碍识别。

J Voice. 2021 Nov;35(6):932.e1-932.e11. doi: 10.1016/j.jvoice.2020.03.009. Epub 2020 May 10.

Comparing regression, naive Bayes, and random forest methods in the prediction of individual survival to second lactation in Holstein cattle.比较回归、朴素贝叶斯和随机森林方法在荷斯坦奶牛个体预测第二次泌乳存活中的应用。

J Dairy Sci. 2019 Oct;102(10):9409-9421. doi: 10.3168/jds.2019-16295. Epub 2019 Aug 22.

A review of screening tools for the identification of autism spectrum disorders and developmental delay in infants and young children: recommendations for use in low- and middle-income countries.婴幼儿孤独症谱系障碍和发育迟缓筛查工具的评价：在中低收入国家使用的建议。

Autism Res. 2019 Feb;12(2):176-199. doi: 10.1002/aur.2033. Epub 2019 Feb 1.

Gait abnormalities in minimally disabled people with Multiple Sclerosis: A 3D-motion analysis study.多发性硬化症轻度残疾患者的步态异常：一项 3D 运动分析研究。

Mult Scler Relat Disord. 2019 Apr;29:100-107. doi: 10.1016/j.msard.2019.01.028. Epub 2019 Jan 23.

Amnestic Mild Cognitive Impairment Is Associated With Frequency-Specific Brain Network Alterations in Temporal Poles.遗忘型轻度认知障碍与颞极频率特异性脑网络改变有关。

Front Aging Neurosci. 2018 Dec 6;10:400. doi: 10.3389/fnagi.2018.00400. eCollection 2018.

An accessible and efficient autism screening method for behavioural data and predictive analyses.一种适用于行为数据和预测分析的便捷、高效的自闭症筛查方法。

Health Informatics J. 2019 Dec;25(4):1739-1755. doi: 10.1177/1460458218796636. Epub 2018 Sep 19.

Developmental pathways to autism: a review of prospective studies of infants at risk.自闭症的发展途径：对高危婴儿的前瞻性研究综述。

Neurosci Biobehav Rev. 2014 Feb;39(100):1-33. doi: 10.1016/j.neubiorev.2013.12.001. Epub 2013 Dec 18.

Autism spectrum disorder and autistic traits in the Avon Longitudinal Study of Parents and Children: precursors and early signs.自闭症谱系障碍和自闭症特质在父母和孩子的雅芳纵向研究中：前兆和早期迹象。

J Am Acad Child Adolesc Psychiatry. 2012 Mar;51(3):249-260.e25. doi: 10.1016/j.jaac.2011.12.009. Epub 2012 Feb 3.

The relationship between carers' report of autistic traits and clinical diagnoses of autism spectrum disorders in adults with intellectual disability.照顾者报告的自闭症特征与智障成年人自闭症谱系障碍的临床诊断之间的关系。

Res Dev Disabil. 2010 May-Jun;31(3):705-12. doi: 10.1016/j.ridd.2010.01.012. Epub 2010 Feb 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用 WEKA 模型器比较用于预测自闭症谱系障碍的分类算法。

Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler.

机构信息

出版信息

BACKGROUND

RESULT

CONCLUSION

背景

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献