• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于从多个生物医学注释中学习分类器的双凸优化

Bi-convex Optimization to Learn Classifiers from Multiple Biomedical Annotations.

作者信息

Wang Xin, Bi Jinbo

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):564-575. doi: 10.1109/TCBB.2016.2576457. Epub 2016 Jun 7.

DOI:10.1109/TCBB.2016.2576457
PMID:27295686
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5159326/
Abstract

The problem of constructing classifiers from multiple annotators who provide inconsistent training labels is important and occurs in many application domains. Many existing methods focus on the understanding and learning of the crowd behaviors. Several probabilistic algorithms consider the construction of classifiers for specific tasks using consensus of multiple labelers annotations. These methods impose a prior on the consensus and develop an expectation-maximization algorithm based on logistic regression loss. We extend the discussion to the hinge loss commonly used by support vector machines. Our formulations form bi-convex programs that construct classifiers and estimate the reliability of each labeler simultaneously. Each labeler is associated with a reliability parameter, which can be a constant, or class-dependent, or varies for different examples. The hinge loss is modified by replacing the true labels by the weighted combination of labelers' labels with reliabilities as weights. Statistical justification is discussed to motivate the use of linear combination of labels. In parallel to the expectation-maximization algorithm for logistic-based methods, efficient alternating algorithms are developed to solve the proposed bi-convex programs. Experimental results on benchmark datasets and three real-world biomedical problems demonstrate that the proposed methods either outperform or are competitive to the state of the art.

摘要

从提供不一致训练标签的多个注释者构建分类器的问题很重要,并且在许多应用领域中都会出现。许多现有方法专注于对群体行为的理解和学习。一些概率算法考虑使用多个标注者注释的共识来构建特定任务的分类器。这些方法对共识施加先验,并基于逻辑回归损失开发期望最大化算法。我们将讨论扩展到支持向量机常用的铰链损失。我们的公式形成了双凸规划,可同时构建分类器并估计每个标注者的可靠性。每个标注者都与一个可靠性参数相关联,该参数可以是常数、与类别相关或因不同示例而异。通过将真实标签替换为以可靠性为权重的标注者标签的加权组合来修改铰链损失。讨论了统计依据以推动标签线性组合的使用。与基于逻辑的方法的期望最大化算法并行,开发了高效的交替算法来解决所提出的双凸规划。在基准数据集和三个实际生物医学问题上的实验结果表明,所提出的方法要么优于现有技术,要么与之具有竞争力。

相似文献

1
Bi-convex Optimization to Learn Classifiers from Multiple Biomedical Annotations.用于从多个生物医学注释中学习分类器的双凸优化
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):564-575. doi: 10.1109/TCBB.2016.2576457. Epub 2016 Jun 7.
2
Inter-labeler and intra-labeler variability of condition severity classification models using active and passive learning methods.采用主动学习和被动学习方法的条件严重程度分类模型的标签间和标签内变异性。
Artif Intell Med. 2017 Sep;81:12-32. doi: 10.1016/j.artmed.2017.03.003. Epub 2017 Apr 27.
3
Unsupervised Bayesian Inference to Fuse Biosignal Sensory Estimates for Personalizing Care.无监督贝叶斯推断融合生物信号传感器估计值以实现个性化护理。
IEEE J Biomed Health Inform. 2019 Jan;23(1):47-58. doi: 10.1109/JBHI.2018.2820054. Epub 2018 Jun 5.
4
Multiple-instance learning algorithms for computer-aided detection.用于计算机辅助检测的多实例学习算法
IEEE Trans Biomed Eng. 2008 Mar;55(3):1015-21. doi: 10.1109/TBME.2007.909544.
5
Performance-based classifier combination in atlas-based image segmentation using expectation-maximization parameter estimation.基于期望最大化参数估计的基于图谱的图像分割中基于性能的分类器组合
IEEE Trans Med Imaging. 2004 Aug;23(8):983-94. doi: 10.1109/TMI.2004.830803.
6
Probabilistic classification vector machines.概率分类向量机
IEEE Trans Neural Netw. 2009 Jun;20(6):901-14. doi: 10.1109/TNN.2009.2014161. Epub 2009 Apr 24.
7
Learn ++.NC: combining ensemble of classifiers with dynamically weighted consult-and-vote for efficient incremental learning of new classes.学习++。NC:将分类器集成与动态加权咨询投票相结合,以实现新类别的高效增量学习。
IEEE Trans Neural Netw. 2009 Jan;20(1):152-68. doi: 10.1109/TNN.2008.2008326. Epub 2008 Dec 22.
8
Mammographic masses characterization based on localized texture and dataset fractal analysis using linear, neural and support vector machine classifiers.基于局部纹理和数据集分形分析,使用线性、神经网络和支持向量机分类器对乳腺X线摄影肿块进行特征描述。
Artif Intell Med. 2006 Jun;37(2):145-62. doi: 10.1016/j.artmed.2006.03.002. Epub 2006 May 23.
9
A weighted rule based method for predicting malignancy of pulmonary nodules by nodule characteristics.一种基于加权规则的方法,通过结节特征预测肺结节的恶性程度。
J Biomed Inform. 2015 Aug;56:69-79. doi: 10.1016/j.jbi.2015.05.011. Epub 2015 May 22.
10
Fully corrective boosting with arbitrary loss and regularization.具有任意损失和正则化的完全校正提升。
Neural Netw. 2013 Dec;48:44-58. doi: 10.1016/j.neunet.2013.07.006. Epub 2013 Jul 16.

引用本文的文献

1
Chained Deep Learning Using Generalized Cross-Entropy for Multiple Annotators Classification.链式深度学习使用广义交叉熵进行多标注分类。
Sensors (Basel). 2023 Mar 28;23(7):3518. doi: 10.3390/s23073518.

本文引用的文献

1
Crowdsourcing for bioinformatics.生物信息学众包。
Bioinformatics. 2013 Aug 15;29(16):1925-33. doi: 10.1093/bioinformatics/btt333. Epub 2013 Jun 19.
2
Distributed human intelligence for colonic polyp classification in computer-aided detection for CT colonography.用于 CT 结肠成像中计算机辅助检测的结肠息肉分类的分布式人体智能。
Radiology. 2012 Mar;262(3):824-33. doi: 10.1148/radiol.11110938. Epub 2012 Jan 24.
3
Support vector machine versus logistic regression modeling for prediction of hospital mortality in critically ill patients with haematological malignancies.支持向量机与逻辑回归模型在预测血液系统恶性肿瘤重症患者医院死亡率中的比较
BMC Med Inform Decis Mak. 2008 Dec 5;8:56. doi: 10.1186/1472-6947-8-56.
4
Automated heart abnormality detection using sparse linear classifiers.使用稀疏线性分类器的自动心脏异常检测。
IEEE Eng Med Biol Mag. 2007 Mar-Apr;26(2):56-63. doi: 10.1109/memb.2007.335591.
5
Support vector machines versus logistic regression: improving prospective performance in clinical decision-making.支持向量机与逻辑回归:改善临床决策中的前瞻性表现
Ultrasound Obstet Gynecol. 2006 Jun;27(6):607-8. doi: 10.1002/uog.2791.
6
A cautionary note on the robustness of latent class models for estimating diagnostic error without a gold standard.关于在没有金标准的情况下用于估计诊断错误的潜在类别模型稳健性的警示说明。
Biometrics. 2004 Jun;60(2):427-35. doi: 10.1111/j.0006-341X.2004.00187.x.
7
Evaluation of diagnostic tests without gold standards.无金标准情况下诊断试验的评估
Stat Methods Med Res. 1998 Dec;7(4):354-70. doi: 10.1177/096228029800700404.