Logo Goletty

Analysis of Morph-Based Language Modeling and Speech Recognition in Slovak
Journal Title Advances in Electrical and Electronic Engineering
Journal Abbreviation AEEE
Publisher Group Technical University of Ostrava (VSB)
Website http://advances.utc.sk/index.php/AEEE
PDF (509 kb)
   
Title Analysis of Morph-Based Language Modeling and Speech Recognition in Slovak
Authors Stas, Jan; Hladek, Daniel; Juhar, Jozef; Zlacky, Daniel
Abstract The inflection of the Slovak language causes a large number of unique word forms, which produces not only a large vocabulary, but also a number of out-of-vocabulary words. Morph-based language models solve this problem by decomposition of inflected word forms into small sub-word units and resolve the general problem of sparsity the training data. In this paper, we present several rule-based and data-driven approaches to the automatic segmentation of words into morphs. These data are later used in the modeling of the Slovak language for large vocabulary continuous speech recognition. Preliminary results show a significant decrease in the number of out-of-vocabulary words and reduction of resultant language model perplexity.
Publisher Faculty of Electrical Engineering and Computer Science
Date 2012-11-24
Source Advances in Electrical and Electronic Engineering Vol 10, No 4 (2012): Special Issue
Rights Authors who publish with this journal agree to the following terms:Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work´s authorship and initial publication in this journal.Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal´s published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

 

See other article in the same Issue


Goletty © 2024