Logo Goletty

Multiple Linear Regression for Extracting Phrase Translation Pairs
Journal Title Journal of Computers
Journal Abbreviation jcp
Publisher Group Academy Publisher
Website http://ojs.academypublisher.com
PDF (1,114 kb)
   
Title Multiple Linear Regression for Extracting Phrase Translation Pairs
Authors Liang, Ying-Hong; Lu, Zhi-Mao; Ren, Ming-Yuan; Zhang, Chun-Xiang; Sun, Da-Song; Liu, Yong
Abstract Phrase translation pairs are very useful for bilingual lexicography, machine translation system, cross-lingual information retrieval and many applications in natural language processing. Phrase translation pairs are always extracted from bilingual sentence pairs. In this paper, we extract phrase translation pairs based on word alignment results of Chinese-English bilingual sentence pairs and parsing trees of Chinese sentences, in order to decrease the influence of the grammar disagreement between Chinese and English. Discriminative features for phrase translation pairs are proposed to evaluate extracted ones in this paper, including translation literality, phrase alignment probability and phrase length difference. Multiple linear regression model combined with N-best strategy will be employed to filter phrase translation pairs, in order to improve the evaluating and filtering performance. Experimental results indicate that the filtering performance of phrase alignment probability is best in three kinds of discriminative features for evaluating Chinese-English phrase translation pairs. After multiple linear regression model combined with N-best strategy is used, its F1 achieves 86.24%.
Publisher ACADEMY PUBLISHER
Date 2011-05-03
Source Journal of Computers Vol 6, No 5 (2011): Special Issue: Selected Best Papers of the International Workshop on CSEEE 2011
Rights Copyright © ACADEMY PUBLISHER - All Rights Reserved.To request permission, please check out URL: http://www.academypublisher.com/copyrightpermission.html.

 

See other article in the same Issue


Goletty © 2024