01-15-02-0014: Multi-part Analysis and Techniques for Air Traffic Speech Recognition - Journal Article

Features

Authors

Narayanan Srinivasan

National Institute of Technology, Computer Applications, India

S. R. Balasundaram

National Institute of Technology, India

Abstract

Content: The general English speech recognition is based on the techniques of n-grams where the words before and after are predicted and the utterance prediction is produced. At the same time, having a significantly lengthier n-gram has its own impact in training and the accuracy. Shorter n-grams require the utterances to be split and predicted than using the complete utterance. This article discusses specific techniques to address the specific problems in Air Traffic Speech, which is a medium length utterance domain. Moving from the adapted language models (LMs) to rescored LM, a combined technique of syntax analysis along with a deep learning model is proposed, which improves the overall accuracy. It is explained that this technique can help to adapt the proposed method for different contexts within the same domain and can be successful.

Meta Tags

Topics: Voice / speech
Air traffic control
Machine learning
Education and training

Affiliated or Co-Author: National Institute of Technology, Computer Applications, India
National Institute of Technology, India

Details

DOI: https://doi.org/10.4271/01-15-02-0014

Pages: 17

Citation: Srinivasan, N., and Balasundaram, S., "Multi-part Analysis and Techniques for Air Traffic Speech Recognition," SAE Int. J. Aerosp. 15(2):145-157, 2022, https://doi.org/10.4271/01-15-02-0014.

Additional Details

Publisher: SAE International

Published: May 25, 2022

Product Code: 01-15-02-0014

Content Type: Journal Article

Language: English

SAE International Journal of Aerospace

SAE International Journal of Aerospace Image

Volume 15, Issue 2