Agarwal Shashank, Choubey Lisha, Yu Hong
University of Wisconsin-Milwaukee, Milwaukee, WI.
AMIA Annu Symp Proc. 2010 Nov 13;2010:11-5.
Citations are widely used in scientific literature. The traditional model of referencing considers all citations to be the same; however, semantically, citations play different roles. By studying the context in which citations appear, it is possible to determine the role that they play. Here, we report on the development of an eight-category classification scheme, annotation using that scheme, and development and evaluation of supervised machine-learning classifiers using the annotated data. We annotated 1,710 sentences using the annotation schema and our trained classifier obtained an average F1-score of 76.5%. The classifier is available for free as a Java API from http://citation.askhermes.org.
引用在科学文献中被广泛使用。传统的参考文献模式认为所有引用都是相同的;然而,从语义上讲,引用发挥着不同的作用。通过研究引用出现的上下文,可以确定它们所起的作用。在此,我们报告了一个八类分类方案的开发、使用该方案进行注释,以及使用注释数据开发和评估监督式机器学习分类器的情况。我们使用注释模式对1710个句子进行了注释,我们训练的分类器获得了76.5%的平均F1分数。该分类器可作为Java API从http://citation.askhermes.org免费获取。