Academic Journals Database
Disseminating quality controlled scientific knowledge

Uma revisão dos algoritmos de radicalização em língua portuguesa

Author(s): Angel Freddy Godoy Viera | Johnny Virgil

Journal: Information Research: an international electronic journal
ISSN 1368-1613

Volume: 12;
Issue: 3;
Start page: 315;
Date: 2007;
Original page

Keywords: A review of stemming algorithms for the Portuguese language

Introduction. The significance of information retrieval has been increased as a result of the increasing frequency with which digital files are created and used. One of the many strategies for processing texts for the indexing of their contents in information retrieval systems is called stemming, which consists in the reduction of similar words to the same unvarying representation. Stemming relies on the morphological structure of the language with which it is intended to work. Aim.This paper aims at defining the concept of stemming, presenting the algorithms available for Portuguese and commenting on several topics related to the use of this type of algorithms. Method. Literature review on stemming for Portuguese known to the authors. Results. To this day, there are only three algorithms published which deal with stemming in Portuguese: Porter's, Orengo's and Gonzalez's. Conclusion. Today,improvement in terms of stemming for the Portuguese language cannot be achieved successfully, since there is little research being done in this area.
Save time & money - Smart Internet Solutions      Why do you need a reservation system?