Suppr超能文献

Netlang:一款借助复杂网络对语料库进行语言分析的软件。

Netlang: A software for the linguistic analysis of corpora by means of complex networks.

作者信息

Barceló-Coblijn Lluís, Serna Salazar Diego, Isaza Gustavo, Castillo Ossa Luis F, Bedia Manuel G

机构信息

Department of Catalan Philology and General Linguistics, University of the Balearic Islands, Palma, Balearic Islands, Spain.

Departamento de Sistemas e Informatica, Universidad de Caldas, Manizales, Caldas, Colombia.

出版信息

PLoS One. 2017 Aug 23;12(8):e0181341. doi: 10.1371/journal.pone.0181341. eCollection 2017.

Abstract

To date there is no software that directly connects the linguistic analysis of a conversation to a network program. Networks programs are able to extract statistical information from data basis with information about systems of interacting elements. Language has also been conceived and studied as a complex system. However, most proposals do not analyze language according to linguistic theory, but use instead computational systems that should save time at the price of leaving aside many crucial aspects for linguistic theory. Some approaches to network studies on language do apply precise linguistic analyses, made by a linguist. The problem until now has been the lack of interface between the analysis of a sentence and its integration into the network that could be managed by a linguist and that could save the analysis of any language. Previous works have used old software that was not created for these purposes and that often produced problems with some idiosyncrasies of the target language. The desired interface should be able to deal with the syntactic peculiarities of a particular language, the options of linguistic theory preferred by the user and the preservation of morpho-syntactic information (lexical categories and syntactic relations between items). Netlang is the first program able to do that. Recently, a new kind of linguistic analysis has been developed, which is able to extract a complexity pattern from the speaker's linguistic production which is depicted as a network where words are inside nodes, and these nodes connect each other by means of edges or links (the information inside the edge can be syntactic, semantic, etc.). The Netlang software has become the bridge between rough linguistic data and the network program. Netlang has integrated and improved the functions of programs used in the past, namely the DGA annotator and two scripts (ToXML.pl and Xml2Pairs.py) used for transforming and pruning data. Netlang allows the researcher to make accurate linguistic analysis by means of syntactic dependency relations between words, while tracking record of the nature of such syntactic relationships (subject, object, etc). The Netlang software is presented as a new tool that solve many problems detected in the past. The most important improvement is that Netlang integrates three past applications into one program, and is able to produce a series of file formats that can be read by a network program. Through the Netlang software, the linguistic network analysis based on syntactic analyses, characterized for its low cost and the completely non-invasive procedure aims to evolve into a sufficiently fine grained tool for clinical diagnosis in potential cases of language disorders.

摘要

到目前为止,还没有软件能直接将对话的语言分析与网络程序连接起来。网络程序能够从包含交互元素系统信息的数据基础中提取统计信息。语言也被视为一个复杂系统并进行了研究。然而,大多数提议并非依据语言理论来分析语言,而是使用计算系统,这些系统虽能节省时间,但却忽略了许多对语言理论至关重要的方面。一些语言网络研究方法确实应用了语言学家所做的精确语言分析。到目前为止,问题在于句子分析与其融入网络之间缺乏接口,而这个接口应由语言学家管理,并且能够保存对任何语言的分析。以往的工作使用的是并非为此目的而创建的旧软件,这些软件常常因目标语言的一些特性而产生问题。理想的接口应能够处理特定语言的句法特性、用户偏好的语言理论选项以及形态句法信息(词汇类别和项目之间的句法关系)的保存。Netlang是第一个能够做到这一点的程序。最近,一种新的语言分析方法得到了发展,它能够从说话者的语言产出中提取一种复杂性模式,这种模式被描绘为一个网络,其中单词位于节点内,这些节点通过边或链接相互连接(边内的信息可以是句法、语义等)。Netlang软件已成为粗略语言数据与网络程序之间的桥梁。Netlang整合并改进了过去使用的程序的功能,即DGA注释器以及用于转换和修剪数据的两个脚本(ToXML.pl和Xml2Pairs.py)。Netlang允许研究人员通过单词之间的句法依存关系进行精确的语言分析,同时跟踪此类句法关系的性质(主语、宾语等)记录。Netlang软件作为一种新工具被推出,它解决了过去发现的许多问题。最重要的改进是Netlang将三个过去的应用程序整合到一个程序中,并且能够生成一系列可被网络程序读取的文件格式。通过Netlang软件,基于句法分析的语言网络分析以其低成本和完全非侵入性的程序为特点,旨在发展成为一种足够精细的工具,用于潜在语言障碍病例的临床诊断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd59/5568436/ba9f0e48aba1/pone.0181341.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验