• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

印地语CCG语料库:一个来自印地语依存关系语料库的CCG语料库。

Hindi CCGbank: A CCG treebank from the Hindi dependency treebank.

作者信息

Ambati Bharat Ram, Deoskar Tejaswini, Steedman Mark

机构信息

1ILCC, School of Informatics, University of Edinburgh, Edinburgh, UK.

2Institute for Logic, Language and Computation, University of Amsterdam, Amsterdam, The Netherlands.

出版信息

Lang Resour Eval. 2018;52(1):67-100. doi: 10.1007/s10579-017-9379-6. Epub 2017 Jan 25.

DOI:10.1007/s10579-017-9379-6
PMID:31983909
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6954025/
Abstract

In this paper, we present an approach for automatically creating a combinatory categorial grammar (CCG) treebank from a dependency treebank for the subject-object-verb language Hindi. Rather than a direct conversion from dependency trees to CCG trees, we propose a two stage approach: a language independent generic algorithm first extracts a CCG lexicon from the dependency treebank. An exhaustive CCG parser then creates a treebank of CCG derivations. We also discuss special cases of this generic algorithm to handle linguistic phenomena specific to Hindi. In doing so we extract different constructions with long-range dependencies like coordinate constructions and non-projective dependencies resulting from constructions like relative clauses, noun elaboration and verbal modifiers.

摘要

在本文中,我们提出了一种从印地语(主宾动语言)的依存句法库自动创建组合范畴语法(CCG)句法库的方法。我们不是直接将依存树转换为CCG树,而是提出了一种两阶段方法:一种与语言无关的通用算法首先从依存句法库中提取CCG词库。然后,一个详尽的CCG解析器创建一个CCG推导的句法库。我们还讨论了这种通用算法的特殊情况,以处理印地语特有的语言现象。在此过程中,我们提取了具有长距离依存关系的不同结构,如并列结构以及由关系从句、名词细化和动词修饰语等结构产生的非投射依存关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/88958f61dc34/10579_2017_9379_Fig25_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/effab269c7c7/10579_2017_9379_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/12bc58ee102e/10579_2017_9379_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/02fbc6a35249/10579_2017_9379_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/3c6b2b852313/10579_2017_9379_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/250877aa5d78/10579_2017_9379_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/fc74fc41326a/10579_2017_9379_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/b65b8b406a06/10579_2017_9379_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/ec890ac940c7/10579_2017_9379_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/dba7abfb2995/10579_2017_9379_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/a9e8d5f815e8/10579_2017_9379_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/2ff47c9045f6/10579_2017_9379_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/55854a33c0b3/10579_2017_9379_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/a6a4245ec67a/10579_2017_9379_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/b8d19b8f0b18/10579_2017_9379_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/bcb4078eabce/10579_2017_9379_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/7b9e6e9b85f6/10579_2017_9379_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/ed8f1706ea02/10579_2017_9379_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/ac4db7ce1f4d/10579_2017_9379_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/7ccef5cd1966/10579_2017_9379_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/124fa9a0b326/10579_2017_9379_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/da352fd5abdd/10579_2017_9379_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/13efcb6de701/10579_2017_9379_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/5c0265d48cd7/10579_2017_9379_Fig23_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/4c86a7dcdabe/10579_2017_9379_Fig24_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/88958f61dc34/10579_2017_9379_Fig25_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/effab269c7c7/10579_2017_9379_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/12bc58ee102e/10579_2017_9379_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/02fbc6a35249/10579_2017_9379_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/3c6b2b852313/10579_2017_9379_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/250877aa5d78/10579_2017_9379_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/fc74fc41326a/10579_2017_9379_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/b65b8b406a06/10579_2017_9379_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/ec890ac940c7/10579_2017_9379_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/dba7abfb2995/10579_2017_9379_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/a9e8d5f815e8/10579_2017_9379_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/2ff47c9045f6/10579_2017_9379_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/55854a33c0b3/10579_2017_9379_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/a6a4245ec67a/10579_2017_9379_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/b8d19b8f0b18/10579_2017_9379_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/bcb4078eabce/10579_2017_9379_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/7b9e6e9b85f6/10579_2017_9379_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/ed8f1706ea02/10579_2017_9379_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/ac4db7ce1f4d/10579_2017_9379_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/7ccef5cd1966/10579_2017_9379_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/124fa9a0b326/10579_2017_9379_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/da352fd5abdd/10579_2017_9379_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/13efcb6de701/10579_2017_9379_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/5c0265d48cd7/10579_2017_9379_Fig23_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/4c86a7dcdabe/10579_2017_9379_Fig24_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3775/6954025/88958f61dc34/10579_2017_9379_Fig25_HTML.jpg

相似文献

1
Hindi CCGbank: A CCG treebank from the Hindi dependency treebank.印地语CCG语料库:一个来自印地语依存关系语料库的CCG语料库。
Lang Resour Eval. 2018;52(1):67-100. doi: 10.1007/s10579-017-9379-6. Epub 2017 Jan 25.
2
Locality and expectation effects in Hindi preverbal constituent ordering.印地语动词前成分语序中的局部性和预期性效应。
Cognition. 2022 Jun;223:104959. doi: 10.1016/j.cognition.2021.104959. Epub 2022 Jan 25.
3
Porting a lexicalized-grammar parser to the biomedical domain.将词汇语法分析器移植到生物医学领域。
J Biomed Inform. 2009 Oct;42(5):852-65. doi: 10.1016/j.jbi.2008.12.004. Epub 2008 Dec 25.
4
Category Locality Theory: A unified account of locality effects in sentence comprehension.范畴局部性理论:句子理解中局部性效应的统一解释。
Cognition. 2024 Jun;247:105766. doi: 10.1016/j.cognition.2024.105766. Epub 2024 Apr 6.
5
Minimization of dependency length in written English.书面英语中依存长度的最小化。
Cognition. 2007 Nov;105(2):300-33. doi: 10.1016/j.cognition.2006.09.011.
6
Cross-Lingual Universal Dependency Parsing Only From One Monolingual Treebank.
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13393-13407. doi: 10.1109/TPAMI.2023.3291388. Epub 2023 Oct 3.
7
Sentence interpretation in normal and aphasic Hindi speakers.正常和失语的印地语使用者的句子理解。
Brain Lang. 1991 Aug;41(2):250-74. doi: 10.1016/0093-934x(91)90155-t.
8
Parsing clinical text: how good are the state-of-the-art parsers?解析临床文本:最先进的解析器有多出色?
BMC Med Inform Decis Mak. 2015;15 Suppl 1(Suppl 1):S2. doi: 10.1186/1472-6947-15-S1-S2. Epub 2015 May 20.
9
From exemplar to grammar: a probabilistic analogy-based model of language learning.从范例到语法:基于概率类比的语言学习模型。
Cogn Sci. 2009 Jul;33(5):752-93. doi: 10.1111/j.1551-6709.2009.01031.x. Epub 2009 Apr 8.
10
Bidirectional incremental parsing for automatic pathway identification with combinatory categorial grammar.用于基于组合范畴语法的自动通路识别的双向增量解析
Pac Symp Biocomput. 2001:396-407.

本文引用的文献

1
DEPENDENCY THEORY: A FORMALISM AND SOME OBSERVATIONS. MEM RM-4087-PR.
Memo RM. 1964 Jul;42:1-39.