Academic Journals Database
Disseminating quality controlled scientific knowledge

Combination of Ant Colony Optimization and Bayesian Classification for Feature Selection in a Bioinformatics Dataset

ADD TO MY LIST
 
Author(s): Mehdi Hosseinzadeh Aghdam | Jafar Tanha | Ahmad Reza Naghsh-Nilchi | Mohammad Ehsan Basiri

Journal: Journal of Computer Science & Systems Biology
ISSN 0974-7230

Volume: 02;
Issue: 03;
Start page: 186;
Date: 2009;
VIEW PDF   PDF DOWNLOAD PDF   Download PDF Original page

Keywords: Feature Selection | Ant Colony Optimization | Particle Swarm Optimization | Bayesian Classification | Bioinformatics

ABSTRACT
Feature selection is widely used as the first stage of classification task to reduce the dimension of problem, decrease noise, improve speed and relieve memory constraints by the limination of irrelevant or redundant features. One approach in the feature selection area is employing population-based optimization algorithms such as particle swarm optimization (PSO)-based method and ant colony optimization (ACO)-based method. Ant colonyoptimization algorithm is inspired by observation on real ants in their search for the shortest paths to food sources. Protein function prediction is an important problem in functional genomics. Typically, protein sequences are represented by feature vectors. A major problem of protein datasets that increase the complexity of classification models is their large number of features. This paper empowers the ant colony optimization algorithm by enabling the ACO to select features for a Bayesian classification method. The naive Bayesian classifier is a straightforward and frequently used method for supervised learning. It provides a flexible way for dealing withany number of features or classes, and is based on probability theory. This paper then compares the performance of the proposed ACO algorithm against the performance of a standard binary particle swarm optimization algorithm on the task of selecting features on Postsynaptic dataset. The criteria used for this comparison are maximizing predictive accuracy and finding the smallest subset of features. Simulation results on Postsynaptic dataset show that proposed method simplifies features effectively and obtains a higher classification accuracy compared to other feature selection methods.
Affiliate Program      Why do you need a reservation system?