Kouramajian V, Devadhar V, Fowler J, Maram S
Department of Computer Science, Wichita State University, Kansas 67260, USA.
Proc Annu Symp Comput Appl Med Care. 1995:878-82.
Categorization by Reference is a novel text classification technique that examines the existing classifications of the citations found in an as-yet unclassified text to determine what terms should be assigned to that text. The existence of the Medical Subject Headings and MEDLINE make the biomedical domain a prime candidate for application of this technique. We describe our approach and implementation of a prototype, presenting some results of our initial tests. We further discuss refinements that could improve the precision of the technique, and describe its possible use in categorizing portions of the World-Wide Web.
参考文献分类法是一种新颖的文本分类技术,它通过检查一篇尚未分类的文本中所发现的参考文献的现有分类,来确定该文本应被赋予哪些术语。医学主题词表和医学文献数据库的存在,使生物医学领域成为应用该技术的理想候选对象。我们描述了我们的方法以及一个原型的实现,并展示了我们初步测试的一些结果。我们进一步讨论了可以提高该技术精度的改进措施,并描述了其在对万维网的部分内容进行分类时的可能用途。