The Open Automation and Control Systems Journal
2014, 6 : 561-565Published online 2014 December 31. DOI: 10.2174/1874444301406010561
Publisher ID: TOAUTOCJ-6-561
Co-occurrence Degree Based Word Alignment in Statistical Machine Translation
Xinjiang Technical Institute
of Physics and Chemistry of Chinese Academy of Sciences, Urumqi
830011, China.
ABSTRACT
To alleviate the data sparseness problem during word alignment, we propose a word alignment method based on word co-occurrence degree. In this paper, we propose a new method to get the statistical information from word cooccurrence. We combine the co-occurrence counts and the fuzzy co-occurrence weights as word co-occurrence degree. Fuzzy co-occurrence weights can be obtained by searching for fuzzy co-occurrence word pairs and computing differences of length between current word and other words in fuzzy co-occurrence word pairs. Experiments show that the quality of word alignment and the translation performance both improved.