This content is not included in
your SAE MOBILUS subscription, or you are not logged in.
Task-Oriented Approaches to Automatic Speech Recognition
Annotation ability available
Sector:
Language:
English
Abstract
We describe several computer-implemented systems for automatic speech recognition. The systems are designed for specific communication tasks in which the human talker and machine interact in a disciplined dialog. One speaker-dependent system recognizes individual spoken words and provides information on airline flight schedules. The same system, combined with a programmed syntax analyzer, recognizes whole sentences that are chosen from a sub-set of natural English. Another system provides automated telephone directory assistance by recognizing voiced-spelled names and speaking back the requested telephone number. Still another system recognizes digits spoken by any speaker, and provides the capability for automatic voice dialing. All the speech recognition systems utilize dynamic programming to match spoken input with stored reference templates. The reference templates for each vocabulary word are constructed from linear predictor coefficients (LPC) which are measured over word utterances. We give performance data for the systems when they are operated over conventional dialed-up telephone connections.
Authors
Topic
Citation
Rosenberg, A., Flanagan, J., Levinson, S., and Rabiner, L., "Task-Oriented Approaches to Automatic Speech Recognition," SAE Technical Paper 800196, 1980, https://doi.org/10.4271/800196.Also In
References
- Aldefeld, B. Levinson, S. E. Szymanski, T. G. “A Minimum-distance Search Technique and its Application to Automatic Directory Assistance,”
- Atal, B. S. Rabiner, L. R. “A Pattern Recognition Approach to Voiced-Unvoiced-Silence Classification with Applications to Speech Recognition,” IEEE Transactions on Acoustics, Speech and Signal Processing June 1976 201 12
- Itakura, F. “Minimum Prediction Residual Principle Applied to Speech Recognition,” IEEE Transactions on Acoustics Speech and Signal Processing February 1975 67 72
- Lesk, M. E. McGonegal, C. A. User Operated Directory Assistance Murray Hill, NJ. Bell Laboratories Technical Memorandum 1976
- Levinson, S. E. “The Effect of Syntactic Analysis on Word Recognition Accuracy,” Bell System Tech, Journal 57 1627 1644 May–June 1978
- Levinson, S. E. Rosenberg, A. E. “Some Experiments With a Syntax Directed Speech Recognition System,” Proceedings of the IEEE ICASSP-78 Tulsa, OK. 1978
- Levinson, S. E. Rosenberg, A. E. “A New System for Continuous Speech Recognition-Preliminary Results,” Proceedings of the IEEE ICASSP-79 Washington, D.C. 239 244 April 1979
- Levinson, S. E. Rabiner, L. R. Rosenberg, A. E. Wilpon, J. G. “Interactive Clustering Techniques for Selecting Speaker-Independent Reference Templates for Isolated Word Recognition,” IEEE Trans, on Acoustics, Speech, and Signal Processing 134 141 April 1979
- Levinson, S. E. Rosenberg, A. E. Flanagan, J. L. “Evaluation of a Word Recognition System Using Syntax Analysis,” Bell System Tech. Journal 57 1619 1626 May–June 1978
- Makhoul, J. Wolf, J. The Use of a Two-pole Linear Prediction Model in Speech Recognition Cambridge MA. Bolt, Beranek and Newman, Inc. 1973
- Markel, J. D. Gray, A. H., Jr. “A Linear Prediction Vocoder Simulation Based Upon the Autocorrelation Method,” IEEE Transactions on Acoustic, Speech and Signal Processing April 1974 124 34
- Rabiner, L. R. Levinson, S. E. Rosenberg, A. E. Wilpon, J. G. “Speaker-Independent Recognition of Isolated Words Using Clustering Techniques,” IEEE Trans. on Acoustics, Speech, and Signal Processing 336 349 August 1979
- Rabiner, L. R. Rosenberg, A. E. Levinson, S. E. “Considerations in Dynamic Time Warping Algorithms for Discrete Word Recognition,” IEEE Trans, on Acoustics, Speech, and Signal Processing 575 582 December 1978
- Rabiner L. R. Schmidt C. E. “Application of Dynamic Time Warping to Connected Digit Recognition,”
- Rosenberg, A. E. Rabiner, L. R. Wilpon, J. G. “Automatic Recognition of Spoken Spelled Names Using Speaker Independent Templates,” J. Acoust. Soc. Am. 64 1 S181 Fall 1978
- Rosenberg, A. E. Schmidt, C. E. “Automatic Recognition of Spoken Spelled Names for Obtaining Directory Listings,” Bell System Tech. Journal 58 1797 1823 October 1979
- Rosenthal, L. H. Rabiner, L. R. Schafer, R. W. Cumminskey, P. Flanagan, J. L. “A Multiline Computer Voice Response System Using ADPCM Coded Speech,” IEEE Trans, on Acoustics, Speech, and Signal Processing 339 352 October 1974
- Sakoe, H. Chiba, S. “Dynamic Programming Algorithm Optimization for Spoken Word Recognition,” IEEE Trans, on Acoustics, Speech, and Signal Processing 43 49 February 1978
- Sakoe, H. Chiba, S. “A Dynamic Programming Approach to. Continuous Speech Recognition,” Proceedings of the International Congress on Acoustics Budapest, Hungary 1971
- Viterbi, A. J. “Error Bounds for Convolutional Codes and an Asymptotically Optimal Algorithm,” IEEE Transactions on Information Theory April 1967 260 69