Suppr超能文献

A syllable-character collaborative model for enhanced Pinyin and Chinese recognition.

作者信息

Chen Zeyuan, Zhong Cheng, Chen Danyang

机构信息

School of Computer, Electronics and Information, Guangxi University, Nanning, Guangxi, China.

Key Laboratory of Parallel, Distributed and Intelligent Computing in Guangxi Universities and Colleges, Guangxi University, Nanning, Guangxi, China.

出版信息

PLoS One. 2025 Jul 7;20(7):e0325045. doi: 10.1371/journal.pone.0325045. eCollection 2025.

Abstract

In Chinese speech recognition, end-to-end speech recognition models usually use Chinese characters as direct output and perform poorly compared with other language models. The main reason for this phenomenon is that the relationship between Chinese text and pronunciation is more complex. Inspired by the learning process of Chinese beginners, who first master initials, finals, and pinyin before learning characters, we propose the Syllable-Character Collaborative Model (SCCM), which incorporates these phonetic elements into the training process. Additionally, we design a Pinyin-Ensemble module that employs an ensemble learning approach to reduce pinyin recognition errors, which in turn leads to a reduction in text recognition errors. Experiments on AISHELL-1 show that our approach not only reduces pinyin and character error rates compared to a prior end-to-end method using pinyin as auxiliary information, but also achieves a 45.7% relative reduction in Character Error Rate (CER) over the AISHELL-1 baseline.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fc6/12233228/dd59b53572d4/pone.0325045.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验