This content is not included in
your SAE MOBILUS subscription, or you are not logged in.
Improving In-Vehicle Voice Recognition Systems: Basis for Enhancing the Telematics Experience
Technical Paper
2000-01-3286
ISSN: 0148-7191, e-ISSN: 2688-3627
Annotation ability available
Sector:
Language:
English
Abstract
Although voice recognition is not essential for telematics application, it is certainly one of the most attractive features for the customer who buys it. The appealing is such that may determine the whole system success in the market and, therefore, deserves our attention. This paper discusses the voice recognition task and ways to improve it in the vehicle hostile environment. In order to create a robust system, artificial neural networks are being used in combination to the appropriate signal processing, which includes temporal alignment and spectral transformation for MFCCs features extraction. The problems related to in-vehicle application are described and feasible solutions are suggested. A proposed architecture is then designed considering all foreseen complications, taking advantage of sensors already available in other vehicle systems, ensuring the performance optimization at minimum cost increase.
Authors
Citation
Vollet, R., "Improving In-Vehicle Voice Recognition Systems: Basis for Enhancing the Telematics Experience," SAE Technical Paper 2000-01-3286, 2000, https://doi.org/10.4271/2000-01-3286.Also In
References
- Furui, S. Digital Speech Processing, Synthesis, and Recognition New York Marcel Dekker Inc. 1989 0-8247-7965-7
- Runstein, F. Violaro, F. Nunes, H.F. In: Simpósio Brasileiro de Telecomunicações, 13° 1995 Anais 1995 155 160
- Linford, P. W. Tattersall, G. D. “Non-linear time normalization of utterances for speech recognition using MLP's” Proceedings of the Institute of Acoustics, Speech and Signal Processing 33 3 587 594 June 1985
- Oppenheim, A. V. Schafer, R. W. Discrete-Time Signal Processing Prentice Hall Englewood Cliffs, New Jersey 1989 0-13-216292-X
- Timoszczuk, A. P. Reconhecimento Automático do Locutor com Redes Neurais Artificiais do Tipo Radia Basis Function (RBF) e Minimal Temporal Information (MTI) São Paulo 1998
- Picone, J. W. “Signal modeling techniques in speech recognition” Proceedings of the IEEE 81 9 1215 1247 September 1993
- Atal, B.S. “Automatic recognition of speakers from their voices” Proceedings of IEEE 64 4 460 475 Apr. 1976
- Corsi, P. “Speaker recognition: a survey” In: PROCEEDINGS OF THE NATO ADVANCED STUDY INSTITUTE Bonas 1982 Automatic Speech Analysis and Recognition Dordrecht D. Reidel Publishing Company 1982 277 308
- Glenn, J. W. Kleiner, N. “Speaker identification based on nasal phonation” The Journal of the Acoustic Society of America 43 4 368 372 1968
- Hertz, J. Krogh, A. Palmer, R. G. Introduction to the Theory of Neural Computation Addison-Wesley Publishing Company California 1991 0-201-50395-6
- Lippman, R. P. “An Introduction to Computing with Neural Nets” IEEE ASSP Magazine April 1987 4 21
- Carpenter, G. A. Grossberg, S. “ART2: Self-organization of Stable Category Recognition Codes for Analog Input Patterns” Applied Optics 26 4919 4930 Anderson, Pellionisz, & Rosenfeld 1990 151 162
- Fausett, L. Fundamentals of Neural Networks - Architectures, Algorithms and Applications Prentice-Hall New Jersey 1994 0-13-042250-9
- McCulloch, N. Ainsworth, W. A. Linggard, R. “Multi-layer perceptrons applied to speech technology” Br Telecom Technology Journal 6 2 April 1988