• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

空间语言数据协调、共享和地图创建的最佳实践——以乌拉尔语为例。

Best practices for spatial language data harmonization, sharing and map creation-A case study of Uralic.

机构信息

Department of Geography and Geology, University of Turku, Turku, Finland.

Giellagas Institute for Saami Studies, University of Oulu, Oulu, Finland.

出版信息

PLoS One. 2022 Jun 8;17(6):e0269648. doi: 10.1371/journal.pone.0269648. eCollection 2022.

DOI:10.1371/journal.pone.0269648
PMID:35675367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9176854/
Abstract

Despite remarkable progress in digital linguistics, extensive databases of geographical language distributions are missing. This hampers both studies on language spatiality and public outreach of language diversity. We present best practices for creating and sharing digital spatial language data by collecting and harmonizing Uralic language distributions as case study. Language distribution studies have utilized various methodologies, and the results are often available as printed maps or written descriptions. In order to analyze language spatiality, the information must be digitized into geospatial data, which contains location, time and other parameters. When compiled and harmonized, this data can be used to study changes in languages' distribution, and combined with, for example, population and environmental data. We also utilized the knowledge of language experts to adjust previous and new information of language distributions into state-of-the-art maps. The extensive database, including the distribution datasets and detailed map visualizations of the Uralic languages are introduced alongside this article, and they are freely available.

摘要

尽管数字语言学取得了显著进展,但仍缺乏广泛的地理语言分布数据库。这既阻碍了语言空间性的研究,也阻碍了语言多样性的公众宣传。我们通过收集和协调乌拉尔语系的分布情况作为案例研究,提出了创建和共享数字空间语言数据的最佳实践。语言分布研究采用了各种方法,研究结果通常以印刷地图或书面描述的形式呈现。为了分析语言空间性,必须将信息数字化为包含位置、时间和其他参数的地理空间数据。当编译和协调这些数据时,可以用来研究语言分布的变化,并与人口和环境数据等结合使用。我们还利用语言专家的知识,将语言分布的新旧信息调整为最新的地图。本文还介绍了包括乌拉尔语言分布数据集和详细地图可视化的广泛数据库,并免费提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f26/9176854/c89654543626/pone.0269648.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f26/9176854/d47e5901e3d2/pone.0269648.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f26/9176854/854acb891246/pone.0269648.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f26/9176854/6bdcc402d87a/pone.0269648.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f26/9176854/c89654543626/pone.0269648.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f26/9176854/d47e5901e3d2/pone.0269648.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f26/9176854/854acb891246/pone.0269648.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f26/9176854/6bdcc402d87a/pone.0269648.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f26/9176854/c89654543626/pone.0269648.g004.jpg

相似文献

1
Best practices for spatial language data harmonization, sharing and map creation-A case study of Uralic.空间语言数据协调、共享和地图创建的最佳实践——以乌拉尔语为例。
PLoS One. 2022 Jun 8;17(6):e0269648. doi: 10.1371/journal.pone.0269648. eCollection 2022.
2
Practices and Barriers in Developing and Disseminating Plain-Language Resources Reporting Medical Research Information: A Scoping Review.开发和传播医学研究信息的简明语言资源报告的实践和障碍:范围综述。
Patient. 2024 Sep;17(5):493-518. doi: 10.1007/s40271-024-00700-y. Epub 2024 Jun 15.
3
High resolution population distribution maps for Southeast Asia in 2010 and 2015.2010 年和 2015 年东南亚高分辨率人口分布地图。
PLoS One. 2013;8(2):e55882. doi: 10.1371/journal.pone.0055882. Epub 2013 Feb 13.
4
Diachronic Atlas of Comparative Linguistics (DiACL)-A database for ancient language typology.历时比较语言学图集 (DiACL)-古代语言类型学数据库。
PLoS One. 2018 Oct 11;13(10):e0205313. doi: 10.1371/journal.pone.0205313. eCollection 2018.
5
LexiRumah: An online lexical database of the Lesser Sunda Islands.雷希拉马:巽他群岛在线词汇数据库。
PLoS One. 2018 Oct 17;13(10):e0205250. doi: 10.1371/journal.pone.0205250. eCollection 2018.
6
Cultural and climatic changes shape the evolutionary history of the Uralic languages.文化和气候的变化塑造了乌拉尔语系的演化历史。
J Evol Biol. 2013 Jun;26(6):1244-53. doi: 10.1111/jeb.12107. Epub 2013 May 16.
7
At the boundaries of syntactic prehistory.句法史前史的边界。
Philos Trans R Soc Lond B Biol Sci. 2021 May 10;376(1824):20200197. doi: 10.1098/rstb.2020.0197. Epub 2021 Mar 22.
8
Linking norms, ratings, and relations of words and concepts across multiple language varieties.跨多种语言变体连接词和概念的规范、评级和关系。
Behav Res Methods. 2022 Apr;54(2):864-884. doi: 10.3758/s13428-021-01650-1. Epub 2021 Aug 6.
9
Enabling data sharing and utilization for African population health data using OHDSI tools with an OMOP-common data model.利用 OHDSI 工具和 OMOP 通用数据模型,实现非洲人口健康数据的共享和利用。
Front Public Health. 2023 Jun 9;11:1116682. doi: 10.3389/fpubh.2023.1116682. eCollection 2023.
10
Harmonization process for the identification of medical events in eight European healthcare databases: the experience from the EU-ADR project.在八个欧洲医疗保健数据库中识别医疗事件的协调过程:来自 EU-ADR 项目的经验。
J Am Med Inform Assoc. 2013 Jan 1;20(1):184-92. doi: 10.1136/amiajnl-2012-000933. Epub 2012 Sep 6.

引用本文的文献

1
A global and interoperable dataset of linguistic distributions derived from the Atlas of the World's Languages.一个源自《世界语言地图集》的全球通用且可互操作的语言分布数据集。
Sci Data. 2025 Aug 22;12(1):1466. doi: 10.1038/s41597-025-05828-6.
2
A revised digital edition of Wurm & Hattori's Language Atlas of the Pacific Area.沃姆和服部的《太平洋地区语言图集》修订数字版。
Sci Data. 2024 Aug 29;11(1):949. doi: 10.1038/s41597-024-03816-w.

本文引用的文献

1
Coding culture: challenges and recommendations for comparative cultural databases.编码文化:比较文化数据库面临的挑战与建议
Evol Hum Sci. 2020 Jun 1;2:e29. doi: 10.1017/ehs.2020.30. eCollection 2020.
2
Triangulation supports agricultural spread of the Transeurasian languages.三角测量法支持了泛欧亚语系在农业上的传播。
Nature. 2021 Nov;599(7886):616-621. doi: 10.1038/s41586-021-04108-8. Epub 2021 Nov 10.
3
Cross-Linguistic Data Formats, advancing data sharing and re-use in comparative linguistics.跨语言数据格式,促进比较语言学中的数据共享和再利用。
Sci Data. 2018 Oct 16;5:180205. doi: 10.1038/sdata.2018.205.
4
D-PLACE: A Global Database of Cultural, Linguistic and Environmental Diversity.D-PLACE:一个关于文化、语言和环境多样性的全球数据库。
PLoS One. 2016 Jul 8;11(7):e0158391. doi: 10.1371/journal.pone.0158391. eCollection 2016.
5
A comparison of worldwide phonemic and genetic variation in human populations.全球人类群体中语音和基因变异的比较。
Proc Natl Acad Sci U S A. 2015 Feb 3;112(5):1265-72. doi: 10.1073/pnas.1424033112. Epub 2015 Jan 20.