This content is not included in your SAE MOBILUS subscription, or you are not logged in.

Multi-part Analysis and Techniques for Air Traffic Speech Recognition

Journal Article
01-15-02-0014
ISSN: 1946-3855, e-ISSN: 1946-3901
Published May 25, 2022 by SAE International in United States
Multi-part Analysis and Techniques for Air Traffic Speech
                    Recognition
Sector:
Citation: Srinivasan, N. and Balasundaram, S., "Multi-part Analysis and Techniques for Air Traffic Speech Recognition," SAE Int. J. Aerosp. 15(2):145-157, 2022, https://doi.org/10.4271/01-15-02-0014.
Language: English

Abstract:

The general English speech recognition is based on the techniques of n-grams where the words before and after are predicted and the utterance prediction is produced. At the same time, having a significantly lengthier n-gram has its own impact in training and the accuracy. Shorter n-grams require the utterances to be split and predicted than using the complete utterance. This article discusses specific techniques to address the specific problems in Air Traffic Speech, which is a medium length utterance domain. Moving from the adapted language models (LMs) to rescored LM, a combined technique of syntax analysis along with a deep learning model is proposed, which improves the overall accuracy. It is explained that this technique can help to adapt the proposed method for different contexts within the same domain and can be successful.