• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估现代学习方法的分类准确率。

Evaluating classification accuracy for modern learning approaches.

作者信息

Li Jialiang, Gao Ming, D'Agostino Ralph

机构信息

Department of Statistics and Applied Probability, National University of Singapore, Singapore.

Duke University-NUS Graduate Medical School, Singapore.

出版信息

Stat Med. 2019 Jun 15;38(13):2477-2503. doi: 10.1002/sim.8103. Epub 2019 Jan 30.

DOI:10.1002/sim.8103
PMID:30701585
Abstract

Deep learning neural network models such as multilayer perceptron (MLP) and convolutional neural network (CNN) are novel and attractive artificial intelligence computing tools. However, evaluation of the performance of these methods is not readily available for practitioners yet. We provide a tutorial for evaluating classification accuracy for various state-of-the-art learning approaches, including familiar shallow and deep learning methods. For qualitative response variables with more than two categories, many traditional accuracy measures such as sensitivity, specificity, and area under the receiver operating characteristic curve are not applicable and we have to consider their extensions properly. In this paper, a few important statistical concepts for multicategory classification accuracy are reviewed and their utilities for various learning algorithms are demonstrated with real medical examples. We offer problem-based R code to illustrate how to perform these statistical computations step by step. We expect that such analysis tools will become more familiar to practitioners and receive broader applications in biostatistics.

摘要

深度学习神经网络模型,如多层感知器(MLP)和卷积神经网络(CNN),是新颖且有吸引力的人工智能计算工具。然而,从业者目前还难以获得这些方法性能的评估。我们提供了一个教程,用于评估各种最先进学习方法的分类准确率,包括常见的浅层和深度学习方法。对于具有两个以上类别的定性响应变量,许多传统的准确率度量,如灵敏度、特异性和接收器操作特征曲线下的面积并不适用,我们必须适当地考虑它们的扩展。本文回顾了多类别分类准确率的一些重要统计概念,并通过实际医学示例展示了它们在各种学习算法中的效用。我们提供基于问题的R代码来说明如何逐步执行这些统计计算。我们期望这样的分析工具能为从业者所更熟悉,并在生物统计学中得到更广泛的应用。

相似文献

1
Evaluating classification accuracy for modern learning approaches.评估现代学习方法的分类准确率。
Stat Med. 2019 Jun 15;38(13):2477-2503. doi: 10.1002/sim.8103. Epub 2019 Jan 30.
2
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用:以新生儿呼吸暂停预测为例的研究
Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.
3
Architectures and accuracy of artificial neural network for disease classification from omics data.基于组学数据的疾病分类的人工神经网络结构和准确性。
BMC Genomics. 2019 Mar 4;20(1):167. doi: 10.1186/s12864-019-5546-z.
4
Automated Amharic News Categorization Using Deep Learning Models.基于深度学习模型的阿姆哈拉语新闻自动分类。
Comput Intell Neurosci. 2021 Jul 27;2021:3774607. doi: 10.1155/2021/3774607. eCollection 2021.
5
Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques.使用机器学习和深度学习技术评估白内障手术视频中的相位自动识别。
JAMA Netw Open. 2019 Apr 5;2(4):e191860. doi: 10.1001/jamanetworkopen.2019.1860.
6
Application of Deep Learning Architectures for Accurate and Rapid Detection of Internal Mechanical Damage of Blueberry Using Hyperspectral Transmittance Data.深度学习架构在利用高光谱透过率数据准确快速检测蓝莓内部机械损伤中的应用。
Sensors (Basel). 2018 Apr 7;18(4):1126. doi: 10.3390/s18041126.
7
Machine Learning and Deep Learning Approaches in Breast Cancer Survival Prediction Using Clinical Data.使用临床数据进行乳腺癌生存预测的机器学习和深度学习方法
Folia Biol (Praha). 2019;65(5-6):212-220. doi: 10.14712/fb2019065050212.
8
Deep learning assisted detection of glaucomatous optic neuropathy and potential designs for a generalizable model.深度学习辅助青光眼视神经病变检测及通用模型的潜在设计。
PLoS One. 2020 May 14;15(5):e0233079. doi: 10.1371/journal.pone.0233079. eCollection 2020.
9
A new ensemble residual convolutional neural network for remaining useful life estimation.一种新的集成残差卷积神经网络用于剩余使用寿命估计。
Math Biosci Eng. 2019 Jan 28;16(2):862-880. doi: 10.3934/mbe.2019040.
10
Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification.卷积神经网络 (CNN) 和循环神经网络 (RNN) 架构在放射学文本报告分类中的比较效果。
Artif Intell Med. 2019 Jun;97:79-88. doi: 10.1016/j.artmed.2018.11.004. Epub 2018 Nov 23.

引用本文的文献

1
Alteration of gut microbiota associated with hypertension in children.儿童高血压相关的肠道微生物群改变。
BMC Microbiol. 2025 May 8;25(1):282. doi: 10.1186/s12866-025-03999-1.
2
Comparison of two statistical methodologies for a binary classification problem of two-dimensional images.二维图像二元分类问题的两种统计方法比较
J Appl Stat. 2023 Nov 15;51(12):2279-2297. doi: 10.1080/02664763.2023.2279012. eCollection 2024.
3
A nomogram for predicting lymph node metastasis in early gastric signet ring cell carcinoma.预测早期胃印戒细胞癌淋巴结转移的列线图
Sci Rep. 2023 Sep 12;13(1):15039. doi: 10.1038/s41598-023-40733-1.
4
A network approach to compute hypervolume under receiver operating characteristic manifold for multi-class biomarkers.一种用于在多类生物标志物的接收器操作特征流形下计算超体积的网络方法。
Stat Med. 2023 Jan 3. doi: 10.1002/sim.9646.
5
A novel approach to joint prediction of preeclampsia and delivery timing using semicompeting risks.一种使用半竞争风险联合预测子痫前期和分娩时机的新方法。
Am J Obstet Gynecol. 2023 Mar;228(3):338.e1-338.e12. doi: 10.1016/j.ajog.2022.08.045. Epub 2022 Aug 26.
6
FetalGAN: Automated Segmentation of Fetal Functional Brain MRI Using Deep Generative Adversarial Learning and Multi-Scale 3D U-Net.胎儿生成对抗网络(FetalGAN):利用深度生成对抗学习和多尺度3D U型网络对胎儿功能性脑磁共振成像进行自动分割
Front Neurosci. 2022 Jun 7;16:887634. doi: 10.3389/fnins.2022.887634. eCollection 2022.
7
ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R.有序贝叶斯:使用R语言对高维数据拟合有序贝叶斯回归模型
Stats (Basel). 2022 Jun;5(2):371-384. doi: 10.3390/stats5020021. Epub 2022 Apr 15.
8
Gout and Hospital Admission for Ambulatory Care-Sensitive Conditions: Risks and Trajectories.痛风与需门诊治疗的慢性病住院:风险和轨迹。
J Rheumatol. 2022 Jul;49(7):731-739. doi: 10.3899/jrheum.220038. Epub 2022 Apr 15.
9
Evaluation of competing risks prediction models using polytomous discrimination index.使用多分类判别指数评估竞争风险预测模型。
Can J Stat. 2021 Sep;49(3):731-753. doi: 10.1002/cjs.11583. Epub 2020 Nov 20.
10
EA3: A softmax algorithm for evidence appraisal aggregation.EA3:证据评价聚合的 softmax 算法。
PLoS One. 2021 Jun 17;16(6):e0253057. doi: 10.1371/journal.pone.0253057. eCollection 2021.