• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

空中交通系统中的实时控制动态感应。

Real-time Controlling Dynamics Sensing in Air Traffic System.

机构信息

National Key Laboratory of Fundamental Science on Synthetic Vision, Sichuan University, Chengdu 610065, China.

National Key Laboratory of Air Traffic Control Automation System Technology, Sichuan University, Chengdu 610065, China.

出版信息

Sensors (Basel). 2019 Feb 7;19(3):679. doi: 10.3390/s19030679.

DOI:10.3390/s19030679
PMID:30736452
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6387294/
Abstract

In order to obtain real-time controlling dynamics in air traffic system, a framework is proposed to introduce and process air traffic control (ATC) speech via radiotelephony communication. An automatic speech recognition (ASR) and controlling instruction understanding (CIU)-based pipeline is designed to convert the ATC speech into ATC related elements, i.e., controlling intent and parameters. A correction procedure is also proposed to improve the reliability of the information obtained by the proposed framework. In the ASR model, acoustic model (AM), pronunciation model (PM), and phoneme- and word-based language model (LM) are proposed to unify multilingual ASR into one model. In this work, based on their tasks, the AM and PM are defined as speech recognition and machine translation problems respectively. Two-dimensional convolution and average-pooling layers are designed to solve special challenges of ASR in ATC. An encoder⁻decoder architecture-based neural network is proposed to translate phoneme labels into word labels, which achieves the purpose of ASR. In the CIU model, a recurrent neural network-based joint model is proposed to detect the controlling intent and label the controlling parameters, in which the two tasks are solved in one network to enhance the performance with each other based on ATC communication rules. The ATC speech is now converted into ATC related elements by the proposed ASR and CIU model. To further improve the accuracy of the sensing framework, a correction procedure is proposed to revise minor mistakes in ASR decoding results based on the flight information, such as flight plan, ADS-B. The proposed models are trained using real operating data and applied to a civil aviation airport in China to evaluate their performance. Experimental results show that the proposed framework can obtain real-time controlling dynamics with high performance, only 4% word-error rate. Meanwhile, the decoding efficiency can also meet the requirement of real-time applications, i.e., an average 0.147 real time factor. With the proposed framework and obtained traffic dynamics, current ATC applications can be accomplished with higher accuracy. In addition, the proposed ASR pipeline has high reusability, which allows us to apply it to other controlling scenes and languages with minor changes.

摘要

为了在航空交通系统中获得实时控制动态,本文提出了一种通过无线电通信引入和处理空中交通管制(ATC)语音的框架。设计了一个基于自动语音识别(ASR)和控制指令理解(CIU)的管道,将 ATC 语音转换为与 ATC 相关的元素,即控制意图和参数。还提出了一种校正程序来提高所提出框架获得的信息的可靠性。在 ASR 模型中,提出了声学模型(AM)、发音模型(PM)和基于音素和单词的语言模型(LM),将多语言 ASR 统一到一个模型中。在这项工作中,根据其任务,将 AM 和 PM 分别定义为语音识别和机器翻译问题。二维卷积和平均池化层被设计用于解决 ATC 中 ASR 的特殊挑战。提出了一种基于编码器-解码器架构的神经网络,将音素标签转换为单词标签,从而实现 ASR 的目的。在 CIU 模型中,提出了一种基于循环神经网络的联合模型来检测控制意图并标记控制参数,其中两个任务在一个网络中解决,以根据 ATC 通信规则相互增强性能。通过所提出的 ASR 和 CIU 模型,将 ATC 语音转换为与 ATC 相关的元素。为了进一步提高感知框架的准确性,提出了一种校正程序,根据飞行计划、ADS-B 等飞行信息,修正 ASR 解码结果中的小错误。使用真实运行数据对所提出的模型进行训练,并将其应用于中国的一个民用机场来评估其性能。实验结果表明,所提出的框架可以以高性能获得实时控制动态,仅 4%的单词错误率。同时,解码效率也可以满足实时应用的要求,即平均 0.147 实时因子。使用所提出的框架和获得的交通动态,可以以更高的精度完成当前的 ATC 应用。此外,所提出的 ASR 管道具有很高的可重用性,允许我们对其他控制场景和语言进行少量更改后应用它。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/da3445a46913/sensors-19-00679-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/166854318a85/sensors-19-00679-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/553c37423909/sensors-19-00679-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/fd0d20a8270b/sensors-19-00679-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/c069591ddee2/sensors-19-00679-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/da3445a46913/sensors-19-00679-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/166854318a85/sensors-19-00679-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/553c37423909/sensors-19-00679-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/fd0d20a8270b/sensors-19-00679-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/c069591ddee2/sensors-19-00679-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9be/6387294/da3445a46913/sensors-19-00679-g005.jpg

相似文献

1
Real-time Controlling Dynamics Sensing in Air Traffic System.空中交通系统中的实时控制动态感应。
Sensors (Basel). 2019 Feb 7;19(3):679. doi: 10.3390/s19030679.
2
A Unified Framework for Multilingual Speech Recognition in Air Traffic Control Systems.用于空中交通管制系统的多语言语音识别的统一框架。
IEEE Trans Neural Netw Learn Syst. 2021 Aug;32(8):3608-3620. doi: 10.1109/TNNLS.2020.3015830. Epub 2021 Aug 3.
3
Enhancing Air Traffic Control Communication Systems with Integrated Automatic Speech Recognition: Models, Applications and Performance Evaluation.利用集成自动语音识别增强空中交通管制通信系统:模型、应用与性能评估
Sensors (Basel). 2024 Jul 20;24(14):4715. doi: 10.3390/s24144715.
4
A lightweight speech recognition method with target-swap knowledge distillation for Mandarin air traffic control communications.一种用于普通话空中交通管制通信的具有目标交换知识蒸馏的轻量级语音识别方法。
PeerJ Comput Sci. 2023 Nov 1;9:e1650. doi: 10.7717/peerj-cs.1650. eCollection 2023.
5
Machine learning based sample extraction for automatic speech recognition using dialectal Assamese speech.基于机器学习的方言阿萨姆语语音自动识别样本提取。
Neural Netw. 2016 Jun;78:97-111. doi: 10.1016/j.neunet.2015.12.010. Epub 2015 Dec 30.
6
Mandarin Electrolaryngeal Speech Recognition Based on WaveNet-CTC.基于 WaveNet-CTC 的普通话电声语音识别。
J Speech Lang Hear Res. 2019 Jul 15;62(7):2203-2212. doi: 10.1044/2019_JSLHR-S-18-0313. Epub 2019 Jun 14.
7
Assessment and analysis of accents in air traffic control speech: a fusion of deep learning and information theory.空中交通管制语音中口音的评估与分析:深度学习与信息论的融合
Front Neurorobot. 2024 Mar 5;18:1360094. doi: 10.3389/fnbot.2024.1360094. eCollection 2024.
8
End-to-End Automatic Pronunciation Error Detection Based on Improved Hybrid CTC/Attention Architecture.基于改进的混合 CTC/注意力架构的端到端自动发音错误检测。
Sensors (Basel). 2020 Mar 25;20(7):1809. doi: 10.3390/s20071809.
9
Identification of articulation error patterns using a novel dependence network.使用新型依存网络识别发音错误模式。
IEEE Trans Biomed Eng. 2011 Nov;58(11):3061-8. doi: 10.1109/TBME.2011.2135352.
10
Two-Step Joint Optimization with Auxiliary Loss Function for Noise-Robust Speech Recognition.两步联合优化与辅助损失函数的噪声鲁棒语音识别。
Sensors (Basel). 2022 Jul 19;22(14):5381. doi: 10.3390/s22145381.

引用本文的文献

1
Study on the standardization method of radiotelephony communication in low-altitude airspace based on BART.基于BART的低空空域无线电话通信标准化方法研究
Front Neurorobot. 2025 Apr 2;19:1482327. doi: 10.3389/fnbot.2025.1482327. eCollection 2025.
2
Enhancing Air Traffic Control Communication Systems with Integrated Automatic Speech Recognition: Models, Applications and Performance Evaluation.利用集成自动语音识别增强空中交通管制通信系统:模型、应用与性能评估
Sensors (Basel). 2024 Jul 20;24(14):4715. doi: 10.3390/s24144715.
3
SLKIR: A framework for extracting key information from air traffic control instructions Using small sample learning.
SLKIR:一种使用小样本学习从空中交通管制指令中提取关键信息的框架。
Sci Rep. 2024 Apr 29;14(1):9791. doi: 10.1038/s41598-024-60675-6.