Academic Journals Database
Disseminating quality controlled scientific knowledge

Parallel Implementation of Classification Algorithms Based on Cloud Computing Environment

Author(s): Lijuan Zhou | Hui Wang | Wenbo Wang

Journal: TELKOMNIKA : Indonesian Journal of Electrical Engineering
ISSN 2302-4046

Volume: 10;
Issue: 5;
Start page: 1087;
Date: 2012;
Original page

As an important task of data mining, Classification has been received considerable attention in many applications, such as information retrieval, web searching, etc. The enlarging volumes of information emerging by the progress of technology and the growing individual needs of data mining, makes classifying of very large scale of data a challenging task. In order to deal with the problem, many researchers try to design efficient parallel classification algorithms. This paper introduces the classification algorithms and cloud computing briefly, based on it analyses the bad points of the present parallel classification algorithms, then addresses a new model of parallel classifying algorithms. And it mainly introduces a parallel Naïve Bayes classification algorithm based on MapReduce, which is a simple yet powerful parallel programming technique. The experimental results demonstrate that the proposed algorithm improves the original algorithm performance, and it can process large datasets efficiently on commodity hardware.

Tango Jona
Tangokurs Rapperswil-Jona

     Save time & money - Smart Internet Solutions