McKellar Cindy A, Puttkammer Martin J
Centre for Text Technology, North-West University, South Africa.
Data Brief. 2020 Jan 14;29:105146. doi: 10.1016/j.dib.2020.105146. eCollection 2020 Apr.
This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel with four reference translations available for each of the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, Sepedi, Sesotho, Setswana, Siswati, Tshivenḓa and Xitsonga.
本数据文章介绍了Autshumato机器翻译评估集。该评估集包含可用于评估11种南非官方语言中任意两种语言之间机器翻译系统的数据。该数据集是平行语料库,为以下每种语言提供了四份参考译文:阿非利卡语、英语、恩德贝莱语、科萨语、祖鲁语、北索托语、南索托语、茨瓦纳语、斯威士语、文达语和聪加语。