The Open Cybernetics & Systemics Journal

2014, 8 : 739-744
Published online 2014 December 31. DOI: 10.2174/1874110X01408010739
Publisher ID: TOCSJ-8-739

Uyghur-Chinese Translation Disambiguation Method Research Based on Knowledge Automatic-Acquisition

Ren Ge , Yang Yong and Xu Chun
XinYi Road, Urumqi, China, Postcard: 830054.

ABSTRACT

This thesis studies the disambiguation method in Uyghur-Chinese translation, and proposes the design philosophy of automatic-acquisition in translation label library aiming at the deficiency of disambiguation corpus in Uyghur. It refers to the existing Uyghur-Chinese bilingual dictionary, Chinese corpus and the Internet, and acquires the corresponding Chinese translation label examples to Uyghur automatically. On this basis, this thesis designs a Uyghur-Chinese disambiguation model, and refines the meaning probability definition based on HMM model. It increases the model’s backwards-dependence, and the model’s ability to obtain contextual information. To use the model to learn disambiguation knowledge from translation label corpus can eliminate the ambiguity in new Uyghur ambiguous words. From the experiment result, it can prove that the Uyghur-Chinese translation disambiguation framework based on automatically acquired corpus is effective, and increases with the increasing of the scale of translation label corpus, degree of accuracy of the disambiguation result also increases accordingly.

Keywords:

Knowledge automatic-acquisition, uyghur-chinese translation disambiguation, word sense disambiguation.