Academic Journals Database
Disseminating quality controlled scientific knowledge

HIGHLY ROBUST METHODS IN DATA MINING

ADD TO MY LIST
 
Author(s): Jan Kalina

Journal: Serbian Journal of Management
ISSN 1452-4864

Volume: 8;
Issue: 1;
Start page: 9;
Date: 2013;
VIEW PDF   PDF DOWNLOAD PDF   Download PDF Original page

Keywords: Data mining | robust statistics | High-dimensional data | Cluster analysis | Logistic regression | Neuralnetworks

ABSTRACT
This paper is devoted to highly robust methods for information extraction from data, with a special attention paid to methods suitable for management applications. The sensitivity of availabledata mining methods to the presence of outlying measurements in the observed data is discussed as a major drawback of available data mining methods. The paper proposes several newhighly robustmethods for data mining, which are based on the idea of implicit weighting of individual data values.Particularly it propose a novel robust method of hierarchical cluster analysis, which is a popular data mining method of unsupervised learning. Further, a robust method for estimating parameters in thelogistic regression was proposed. This idea is extended to a robust multinomial logistic classification analysis. Finally, the sensitivity of neural networks to the presence of noise and outlying measurements in the data was discussed. The method for robust training of neural networks for the task of function approximation, which has the form of a robust estimator in nonlinear regression, was proposed.
RPA Switzerland

Robotic Process Automation Switzerland

    

Tango Jona
Tangokurs Rapperswil-Jona