Suppr超能文献

临床分类统计方法中的抽样策略。

Sampling strategies in a statistical approach to clinical classification.

作者信息

Yang Y, Chute C G

机构信息

Section of Medical Information Resources, Mayo Clinic/Foundation, Rochester, Minnesota 55905, USA.

出版信息

Proc Annu Symp Comput Appl Med Care. 1995:32-6.

Abstract

This paper studies the sampling strategies for the Expert Network (EexNet), a statistical learning system used for patient record classification at the Mayo Clinic. The goal is to achieve high accuracy classification at an affordable computational cost in very large applications. The learning curves of ExpNet were observed with respect to the choice of training resources, the size, vocabulary coverage and category coverage of a training set, and the category distribution over training instances. A method combining advantages of different sampling strategies is proposed and evaluated using a large training corpus. As a result, Expert Network has achieved its nearly-optimal classification accuracy (measured by average precision) using a relatively small training set, with a fast real-time response which satisfies the needs of human-machine interaction.

摘要

本文研究了专家网络(EexNet)的采样策略,EexNet是梅奥诊所用于患者记录分类的一种统计学习系统。目标是在非常大规模的应用中,以可承受的计算成本实现高精度分类。针对训练资源的选择、训练集的大小、词汇覆盖率和类别覆盖率以及训练实例的类别分布,观察了ExpNet的学习曲线。提出了一种结合不同采样策略优点的方法,并使用大型训练语料库进行了评估。结果表明,专家网络使用相对较小的训练集就实现了近乎最优的分类准确率(以平均精度衡量),具有快速的实时响应,满足了人机交互的需求。

相似文献

6
Machine learning and rule-based approaches to assertion classification.用于断言分类的机器学习和基于规则的方法。
J Am Med Inform Assoc. 2009 Jan-Feb;16(1):109-15. doi: 10.1197/jamia.M2950. Epub 2008 Oct 24.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验