• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

扩展适用于多种语言的NegEx词汇表。

Extending the NegEx lexicon for multiple languages.

作者信息

Chapman Wendy W, Hillert Dieter, Velupillai Sumithra, Kvist Maria, Skeppstedt Maria, Chapman Brian E, Conway Mike, Tharp Melissa, Mowery Danielle L, Deleger Louise

机构信息

Division of Biomedical Informatics, University of California, San Diego, La Jolla, CA, USA.

出版信息

Stud Health Technol Inform. 2013;192:677-81.

PMID:23920642
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3923890/
Abstract

We translated an existing English negation lexicon (NegEx) to Swedish, French, and German and compared the lexicon on corpora from each language. We observed Zipf's law for all languages, i.e., a few phrases occur a large number of times, and a large number of phrases occur fewer times. Negation triggers "no" and "not" were common for all languages; however, other triggers varied considerably. The lexicon is available in OWL and RDF format and can be extended to other languages. We discuss the challenges in translating negation triggers to other languages and issues in representing multilingual lexical knowledge.

摘要

我们将现有的英语否定词词典(NegEx)翻译成瑞典语、法语和德语,并在每种语言的语料库上对该词典进行了比较。我们观察到所有语言都遵循齐普夫定律,即少数短语出现的次数很多,而大量短语出现的次数较少。否定触发词“no”和“not”在所有语言中都很常见;然而,其他触发词差异很大。该词典以OWL和RDF格式提供,并且可以扩展到其他语言。我们讨论了将否定触发词翻译成其他语言时面临的挑战以及表示多语言词汇知识时存在的问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c002/3923890/8d572cae2984/nihms550233f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c002/3923890/4a9f6e344af8/nihms550233f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c002/3923890/8d572cae2984/nihms550233f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c002/3923890/4a9f6e344af8/nihms550233f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c002/3923890/8d572cae2984/nihms550233f2.jpg

相似文献

1
Extending the NegEx lexicon for multiple languages.扩展适用于多种语言的NegEx词汇表。
Stud Health Technol Inform. 2013;192:677-81.
2
Medication extraction and guessing in Swedish, French and English.瑞典语、法语和英语的药物提取与猜测
Stud Health Technol Inform. 2013;192:1189.
3
Acquisition of lexical resources from SNOMED for medical language processing.从医学系统命名法(SNOMED)获取词汇资源用于医学语言处理。
Stud Health Technol Inform. 1998;52 Pt 1:586-90.
4
Towards a multilingual medical lexicon.迈向多语言医学词汇表。
AMIA Annu Symp Proc. 2006;2006:534-8.
5
Automatic lexeme acquisition for a multilingual medical subword thesaurus.用于多语言医学子词词典的自动词元获取。
Int J Med Inform. 2007 Feb-Mar;76(2-3):184-9. doi: 10.1016/j.ijmedinf.2006.05.032. Epub 2006 Jul 12.
6
The effect of a general lexicon in corpus-based identification of French-English medical word translations.通用词汇表在基于语料库的法英医学词汇翻译识别中的作用。
Stud Health Technol Inform. 2003;95:397-402.
7
Solvable null model for the distribution of word frequencies.词频分布的可解零模型
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Oct;70(4 Pt 1):042901. doi: 10.1103/PhysRevE.70.042901. Epub 2004 Oct 25.
8
A semantic lexicon for medical language processing.用于医学语言处理的语义词典。
J Am Med Inform Assoc. 1999 May-Jun;6(3):205-18. doi: 10.1136/jamia.1999.0060205.
9
Towards a unified medical lexicon for French.迈向统一的法语医学词汇表。
Stud Health Technol Inform. 2003;95:415-20.
10
GALEN: a third generation terminology tool to support a multipurpose national coding system for surgical procedures.盖伦:一种支持外科手术多用途国家编码系统的第三代术语工具。
Int J Med Inform. 2000 Sep;58-59:71-85. doi: 10.1016/s1386-5056(00)00077-0.

引用本文的文献

1
Multi-task transfer learning for the prediction of entity modifiers in clinical text: application to opioid use disorder case detection.多任务迁移学习在临床文本中实体修饰符预测中的应用:在阿片类药物使用障碍病例检测中的应用。
J Biomed Semantics. 2024 Jun 7;15(1):11. doi: 10.1186/s13326-024-00311-4.
2
Moving Biosurveillance Beyond Coded Data Using AI for Symptom Detection From Physician Notes: Retrospective Cohort Study.利用人工智能从医生笔记中检测症状,推动生物监测超越编码数据:回顾性队列研究。
J Med Internet Res. 2024 Apr 4;26:e53367. doi: 10.2196/53367.
3
A natural language processing algorithm accurately classifies steatotic liver disease pathology to estimate the risk of cirrhosis.

本文引用的文献

1
Negation detection in Swedish clinical text: An adaption of NegEx to Swedish.瑞典临床文本中的否定检测:NegEx对瑞典语的适应性调整。
J Biomed Semantics. 2011;2 Suppl 3(Suppl 3):S3. doi: 10.1186/2041-1480-2-S3-S3. Epub 2011 Jul 14.
2
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛:临床文本中的概念、断言和关系
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.
3
Document-level classification of CT pulmonary angiography reports based on an extension of the ConText algorithm.
一种自然语言处理算法可以准确地对脂肪性肝病的病理学进行分类,以估计肝硬化的风险。
Hepatol Commun. 2024 Mar 29;8(4). doi: 10.1097/HC9.0000000000000403. eCollection 2024 Apr 1.
4
Negation recognition in clinical natural language processing using a combination of the NegEx algorithm and a convolutional neural network.使用 NegEx 算法和卷积神经网络相结合的方法进行临床自然语言处理中的否定识别。
BMC Med Inform Decis Mak. 2023 Oct 13;23(1):216. doi: 10.1186/s12911-023-02301-5.
5
Linguistic and ontological challenges of multiple domains contributing to transformed health ecosystems.多个领域在促成变革性健康生态系统方面面临的语言和本体论挑战。
Front Med (Lausanne). 2023 Mar 15;10:1073313. doi: 10.3389/fmed.2023.1073313. eCollection 2023.
6
Increasing adverse drug events extraction robustness on social media: Case study on negation and speculation.提高社交媒体中药物不良反应抽取的稳健性:基于否定词和推测词的案例研究。
Exp Biol Med (Maywood). 2022 Nov;247(22):2003-2014. doi: 10.1177/15353702221128577. Epub 2022 Oct 31.
7
Radiology Text Analysis System (RadText): Architecture and Evaluation.放射学文本分析系统(RadText):架构与评估
Proc (IEEE Int Conf Healthc Inform). 2022 Jun;2022:288-296. doi: 10.1109/ichi54592.2022.00050. Epub 2022 Sep 8.
8
Real-world Patient Trajectory Prediction from Clinical Notes Using Artificial Neural Networks and UMLS-Based Extraction of Concepts.使用人工神经网络和基于统一医学语言系统的概念提取从临床记录中进行真实世界患者轨迹预测
J Healthc Inform Res. 2021 Jun 5;5(4):474-496. doi: 10.1007/s41666-021-00100-z. eCollection 2021 Dec.
9
Annotation and initial evaluation of a large annotated German oncological corpus.一个大型带注释的德语肿瘤学语料库的注释与初步评估。
JAMIA Open. 2021 Apr 19;4(2):ooab025. doi: 10.1093/jamiaopen/ooab025. eCollection 2021 Apr.
10
Natural language processing of radiology reports to investigate the effects of the COVID-19 pandemic on the incidence and age distribution of fractures.利用自然语言处理技术对放射科报告进行分析,以探讨 COVID-19 大流行对骨折发病率和年龄分布的影响。
Skeletal Radiol. 2022 Feb;51(2):375-380. doi: 10.1007/s00256-021-03760-5. Epub 2021 Apr 13.
基于 ConText 算法扩展的 CT 肺动脉造影报告的文档级分类。
J Biomed Inform. 2011 Oct;44(5):728-37. doi: 10.1016/j.jbi.2011.03.011. Epub 2011 Apr 1.
4
Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统(cTAKES):架构、组件评估和应用。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.
5
An overview of MetaMap: historical perspective and recent advances.MetaMap 概述:历史视角与最新进展。
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.
6
Recognizing obesity and comorbidities in sparse data.在稀疏数据中识别肥胖及合并症。
J Am Med Inform Assoc. 2009 Jul-Aug;16(4):561-70. doi: 10.1197/jamia.M3115. Epub 2009 Apr 23.
7
Towards a multilingual medical lexicon.迈向多语言医学词汇表。
AMIA Annu Symp Proc. 2006;2006:534-8.
8
Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system.提取用于哮喘研究的主要诊断、合并症和吸烟状况:自然语言处理系统的评估
BMC Med Inform Decis Mak. 2006 Jul 26;6:30. doi: 10.1186/1472-6947-6-30.
9
A controlled trial of automated classification of negation from clinical notes.一项关于临床记录中否定词自动分类的对照试验。
BMC Med Inform Decis Mak. 2005 May 5;5:13. doi: 10.1186/1472-6947-5-13.
10
A simple algorithm for identifying negated findings and diseases in discharge summaries.一种用于识别出院小结中否定性检查结果和疾病的简单算法。
J Biomed Inform. 2001 Oct;34(5):301-10. doi: 10.1006/jbin.2001.1029.