Suppr超能文献

历时语法分析语料库(GATA):包含 52 种语言的数据集。

Grammars Across Time Analyzed (GATA): a dataset of 52 languages.

机构信息

Department for Linguistic and Cultural Evolution, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany.

Institut für Linguistik, Universität Leipzig, Leipzig, Germany.

出版信息

Sci Data. 2023 Nov 28;10(1):835. doi: 10.1038/s41597-023-02659-1.

Abstract

Grammars Across Time Analyzed (GATA) is a resource capturing two snapshots of the grammatical structure of a diverse range of languages separated in time, aimed at furthering research on historical linguistics, language evolution, and cultural change. GATA comprises grammatical information on 52 diverse languages across all continents, featuring morphological, syntactic, and phonological information based on published grammars of the same language at two different time points. Here we introduce the coding scheme and design features of GATA, and we describe some salient patterns related to language change and the coverage of grammatical descriptions over time.

摘要

语法跨时分析(GATA)是一个资源,捕捉了时间上相隔的多种语言的语法结构的两个快照,旨在促进历史语言学、语言进化和文化变迁的研究。GATA 包含了来自各大洲的 52 种不同语言的语法信息,具有基于同一语言在两个不同时间点的已发表语法的形态、句法和语音信息。在这里,我们介绍了 GATA 的编码方案和设计特点,并描述了一些与语言变化和语法描述随时间的覆盖范围有关的显著模式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e9/10684564/6cc1056ea450/41597_2023_2659_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验