可扩展的生物医学命名实体识别：数据库支持的支持向量机方法研究

Scalable biomedical Named Entity Recognition: investigation of a database-supported SVM approach.

作者信息

Habib Mona Soliman, Kalita Jugal

机构信息

Cairo Microsoft Innovation Lab, 306 Korniche El-Nile, Maadi Cairo, Egypt.

出版信息

Int J Bioinform Res Appl. 2010;6(2):191-208. doi: 10.1504/IJBRA.2010.032121.

DOI:10.1504/IJBRA.2010.032121

PMID:20223740

Abstract

This paper explores scalability issues associated with the Named Entity Recognition problem in the biomedical publications domain using Support Vector Machines. The performance results using existing binary and multi-class SVMs with increasing training data are compared to results obtained using our new implementations. Our approach eliminates prior language or domain-specific knowledge and achieves good out-of-the-box accuracy measures comparable to those obtained using more complex approaches. The training time of multi-class SVMs is reduced by several orders of magnitude, which would make support vector machines a more viable and practical solution for real-world problems with large datasets.

摘要

本文使用支持向量机探讨了生物医学出版物领域中与命名实体识别问题相关的可扩展性问题。将现有二分类和多分类支持向量机在训练数据增加时的性能结果与使用我们新实现方法获得的结果进行了比较。我们的方法无需先前的语言或特定领域知识，并且实现了与使用更复杂方法相当的良好开箱即用准确率指标。多分类支持向量机的训练时间减少了几个数量级，这将使支持向量机成为处理具有大型数据集的现实世界问题更可行、更实用的解决方案。

相似文献

Scalable biomedical Named Entity Recognition: investigation of a database-supported SVM approach.

Int J Bioinform Res Appl. 2010;6(2):191-208. doi: 10.1504/IJBRA.2010.032121.

Biomedical named entity recognition using two-phase model based on SVMs.

J Biomed Inform. 2004 Dec;37(6):436-47. doi: 10.1016/j.jbi.2004.08.012.

SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.

BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.

A new fuzzy support vectors machine for biomedical data classification.

Annu Int Conf IEEE Eng Med Biol Soc. 2008;2008:4676-9. doi: 10.1109/IEMBS.2008.4650256.

Two criteria for model selection in multiclass support vector machines.

IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1432-48. doi: 10.1109/TSMCB.2008.927272.

Posterior probability support vector machines for unbalanced data.

IEEE Trans Neural Netw. 2005 Nov;16(6):1561-73. doi: 10.1109/TNN.2005.857955.

Vicinal support vector classifier using supervised kernel-based clustering.

Artif Intell Med. 2014 Mar;60(3):189-96. doi: 10.1016/j.artmed.2014.01.003. Epub 2014 Feb 7.

New support vector-based design method for binary hierarchical classifiers for multi-class classification problems.

Neural Netw. 2008 Mar-Apr;21(2-3):502-10. doi: 10.1016/j.neunet.2007.12.005. Epub 2007 Dec 8.

Training hard-margin support vector machines using greedy stagewise algorithm.

IEEE Trans Neural Netw. 2008 Aug;19(8):1446-55. doi: 10.1109/TNN.2008.2000576.

Combined SVM-CRFs for biological named entity recognition with maximal bidirectional squeezing.

PLoS One. 2012;7(6):e39230. doi: 10.1371/journal.pone.0039230. Epub 2012 Jun 26.

引用本文的文献

Artificial intelligence and bioinformatics: a journey from traditional techniques to smart approaches.

Gastroenterol Hepatol Bed Bench. 2024;17(3):241-252. doi: 10.22037/ghfbb.v17i3.2977.

Combined SVM-CRFs for biological named entity recognition with maximal bidirectional squeezing.

PLoS One. 2012;7(6):e39230. doi: 10.1371/journal.pone.0039230. Epub 2012 Jun 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

可扩展的生物医学命名实体识别：数据库支持的支持向量机方法研究

Scalable biomedical Named Entity Recognition: investigation of a database-supported SVM approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献