Cohen K Bretonnel, Goss Foster R, Zweigenbaum Pierre, Hunter Lawrence E
Computational Bioscience Program, University of Colorado School of Medicine, Aurora, CO, USA.
University of Colorado School of Medicine, Department of Emergency Medicine, Aurora, CO, USA.
Stud Health Technol Inform. 2017;245:346-350.
Prior knowledge of the distributional characteristics of linguistic phenomena can be useful for a variety of language processing tasks. This paper describes the distribution of negation in two types of biomedical texts: scientific journal articles and progress notes. Two types of negation are examined: explicit negation at the syntactic level and affixal negation at the sub-word level. The data show that the distribution of negation is significantly different in the two document types, with explicit negation more frequent in the clinical documents than in the scientific publications and affixal negation more frequent in the journal articles at the type level and token levels. All code is available on GitHub
事先了解语言现象的分布特征有助于完成各种语言处理任务。本文描述了否定在两种生物医学文本中的分布情况:科学期刊文章和病程记录。研究了两种否定类型:句法层面的显性否定和子词层面的词缀否定。数据表明,否定在这两种文档类型中的分布存在显著差异,临床文档中的显性否定比科学出版物中更频繁,而在类型层面和词元层面,期刊文章中的词缀否定更频繁。所有代码可在GitHub上获取