The Open Automation and Control Systems Journal

2014, 6 : 561-565
Published online 2014 December 31. DOI: 10.2174/1874444301406010561
Publisher ID: TOAUTOCJ-6-561

Co-occurrence Degree Based Word Alignment in Statistical Machine Translation

Chenggang Mi , Yating Yang , Lei Wang and Xiao Li
Xinjiang Technical Institute of Physics and Chemistry of Chinese Academy of Sciences, Urumqi 830011, China.

ABSTRACT

To alleviate the data sparseness problem during word alignment, we propose a word alignment method based on word co-occurrence degree. In this paper, we propose a new method to get the statistical information from word cooccurrence. We combine the co-occurrence counts and the fuzzy co-occurrence weights as word co-occurrence degree. Fuzzy co-occurrence weights can be obtained by searching for fuzzy co-occurrence word pairs and computing differences of length between current word and other words in fuzzy co-occurrence word pairs. Experiments show that the quality of word alignment and the translation performance both improved.

Keywords:

Co-occurrence Degree, Statistical Machine Translation, Word Alignment.