Amazon cover image
Image from Amazon.com

Incorporating Knowledge Sources into Statistical Speech Recognition [electronic resource] / by Wolfgang Minker, Satoshi Nakamura, Konstantin Markov, Sakriani Sakti.

By: Contributor(s): Material type: TextTextSeries: Lecture Notes in Electrical Engineering ; 42Publisher: Boston, MA : Springer US, 2009Description: online resourceContent type:
  • text
Media type:
  • computer
Carrier type:
  • online resource
ISBN:
  • 9780387858302
Subject(s): Additional physical formats: Printed edition:: No titleOnline resources:
Contents:
and Book Overview -- Statistical Speech Recognition -- Graphical Framework to Incorporate Knowledge Sources -- Speech Recognition Using GFIKS -- Conclusions and Future Directions.
In: Springer eBooksSummary: Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible. The authors provide an efficient general framework for incorporating knowledge sources into state-of-the-art statistical ASR systems. This framework, which is called GFIKS (graphical framework to incorporate additional knowledge sources), was designed by utilizing the concept of the Bayesian network (BN) framework. This framework allows probabilistic relationships among different information sources to be learned, various kinds of knowledge sources to be incorporated, and a probabilistic function of the model to be formulated. Incorporating Knowledge Sources into Statistical Speech Recognition demonstrates how the statistical speech recognition system may incorporate additional information sources by utilizing GFIKS at different levels of ASR. The incorporation of various knowledge sources, including background noises, accent, gender and wide phonetic knowledge information, in modeling is discussed theoretically and analyzed experimentally.
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Call number Status Date due Barcode
E-Book E-Book Central Library Available E-38130

and Book Overview -- Statistical Speech Recognition -- Graphical Framework to Incorporate Knowledge Sources -- Speech Recognition Using GFIKS -- Conclusions and Future Directions.

Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible. The authors provide an efficient general framework for incorporating knowledge sources into state-of-the-art statistical ASR systems. This framework, which is called GFIKS (graphical framework to incorporate additional knowledge sources), was designed by utilizing the concept of the Bayesian network (BN) framework. This framework allows probabilistic relationships among different information sources to be learned, various kinds of knowledge sources to be incorporated, and a probabilistic function of the model to be formulated. Incorporating Knowledge Sources into Statistical Speech Recognition demonstrates how the statistical speech recognition system may incorporate additional information sources by utilizing GFIKS at different levels of ASR. The incorporation of various knowledge sources, including background noises, accent, gender and wide phonetic knowledge information, in modeling is discussed theoretically and analyzed experimentally.

There are no comments on this title.

to post a comment.

Maintained by VTU Library