
Application of Hidden Markov Models in Speech Recognition
now publishers Inc
1st Edition
Published on 20. February 2008
Book
Paperback/Softback
124 pages
978-1-60198-120-2 (ISBN)
Description
Hidden Markov Models (HMMs) provide a simple and effective framework for modelling time-varying spectral vector sequences. As a consequence, almost all present day large vocabulary continuous speech recognition (LVCSR) systems are based on HMMs. Whereas the basic principles underlying HMM-based LVCSR are rather straightforward, the approximations and simplifying assumptions involved in a direct implementation of these principles would result in a system which has poor accuracy and unacceptable sensitivity to changes in operating environment. Thus, the practical application of HMMs in modern systems involves considerable sophistication. The Application of Hidden Markov Models in Speech Recognition presents the core architecture of a HMM-based LVCSR system and proceeds to describe the various refinements which are needed to achieve state-of-the-art performance. These refinements include feature projection, improved covariance modelling, discriminative parameter estimation, adaptation and normalisation, noise compensation and multi-pass system combination. It concludes with a case study of LVCSR for Broadcast News and Conversation transcription in order to illustrate the techniques described. The Application of Hidden Markov Models in Speech Recognition is an invaluable resource for anybody with an interest in speech recognition technology.
More details
Series
Language
English
Place of publication
Hanover
United States
Dimensions
Height: 234 mm
Width: 156 mm
Thickness: 7 mm
Weight
186 gr
ISBN-13
978-1-60198-120-2 (9781601981202)
DOI
10.1561/2000000004
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Content
1: Introduction 2: Architecture of a HMM-Based Recogniser 3: HMM Structure Refinements 4: Parameter Estimation 5: Adaptation and Normalisation 6: Noise Robustness 7: Multi-Pass Recognition Architectures. Conclusions. Acknowledgements. Notations and Acronyms. References