Logo Goletty

Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?
Journal Title CIT. Journal of Computing and Information Technology
Journal Abbreviation CIT
Publisher Group University of Zagreb
Website http://cit.srce.unizg.hr/index.php/CIT
PDF (169 kb)
   
Title Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?
Authors Ljubešić, Nikola; Bago, Petra; Boras, Damir
Abstract This research is the first step towards developing a system for translating Croatian weather forecasts into multiple languages. This step deals with the Croatian-English language pair. The parallel corpus consists of a one-year sample of the weather forecasts for the Adriatic, consisting of 7,893 sentence pairs. Evaluation is performed by the automatic evaluation measures BLUE, NIST and METEOR, as well as by manually evaluating a sample of 200 translations. We have shown that with a small-sized training set and the state-of-the-art Moses system, decoding can be done with 96% accuracy concerning adequacy and fluency. Additional improvement is expected by increasing the training set size. Finally, the correlation of the recorded evaluation measures is explored.
Publisher University of Zagreb, University Computing Centre - SRCE
Date 2011-02-04
Source Journal of Computing and Information Technology Vol 18, No 4 (2010): Special Issue from the 2010 ITI Conference
Rights CIT. Journal of Computing and Information Technology is an open access journal.Authors who publish with this journal agree to the following terms:Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work´s authorship and initial publication in this journal.Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal´s published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

 

See other article in the same Issue


Goletty © 2024