The Open Cybernetics & Systemics Journal

2014, 8 : 435-441
Published online 2014 December 31. DOI: 10.2174/1874110X01408010435
Publisher ID: TOCSJ-8-435

Bayesian Spam Filtering Mechanism Based on Decision Tree of Attribute Set Dependence in the MapReduce Framework

Yanyan Guo , Lei Zhou , Kemeng He , Yuwan Gu and Yuqiang Sun
School of Information Science & Engineering, Changzhou University, Jiangsu, Changzhou, 213164, China.

ABSTRACT

Bayesian spam filtering is a classification method based on the theory of probability and statistics, and the Bayesian spam filtering based on Mapreduce can solve the defect of the traditional Bayesian spam filtering that consumes large amounts of system resources and network resources when the mail set is pre-training. It needs to classify mails manually in the pre-training phase of mail set, which consumes a lot of human and financial resources and affects the efficiency of the system. Bayesian spam filtering mechanism based on decision tree of the attribute sets dependence in the MapReduce framework which is presented in this paper. And the decision tree of attribute sets dependence is used in the training stage of the mail set, which improves execution efficiency of the system by lowering the time complexity.

Keywords:

Bayesian spam filtering, decision tree of attribute sets dependence, MapReduce.