This content is not included in
your SAE MOBILUS subscription, or you are not logged in.
Multi-part Analysis and Techniques for Air Traffic Speech Recognition
Journal Article
01-15-02-0014
ISSN: 1946-3855, e-ISSN: 1946-3901
Sector:
Citation:
Srinivasan, N. and Balasundaram, S., "Multi-part Analysis and Techniques for Air Traffic Speech Recognition," SAE Int. J. Aerosp. 15(2):145-157, 2022, https://doi.org/10.4271/01-15-02-0014.
Language:
English
Abstract:
The general English speech recognition is based on the techniques of n-grams
where the words before and after are predicted and the utterance prediction is
produced. At the same time, having a significantly lengthier n-gram has its own
impact in training and the accuracy. Shorter n-grams require the utterances to
be split and predicted than using the complete utterance. This article discusses
specific techniques to address the specific problems in Air Traffic Speech,
which is a medium length utterance domain. Moving from the adapted language
models (LMs) to rescored LM, a combined technique of syntax analysis along with
a deep learning model is proposed, which improves the overall accuracy. It is
explained that this technique can help to adapt the proposed method for
different contexts within the same domain and can be successful.