Academic Journals Database
Disseminating quality controlled scientific knowledge

Named Entity Recognizer employing Multiclass Support Vector Machines for the Development of Question Answering Systems

Author(s): Bindu. M.S | Sumam Mary Idicula

Journal: International Journal of Computer Applications
ISSN 0975-8887

Volume: 25;
Issue: 10;
Start page: 40;
Date: 2011;
VIEW PDF   PDF DOWNLOAD PDF   Download PDF Original page

Keywords: Named Entity Recognition | Parts-of-Speech Tagger | Phrase chunker | Compound word splitter

Named Entity Recognition 'NER' seeks to locate and classify atomic elements in text into predefined categories such as names of person, organization, location, Quantities, Percentage etc. Named entities tell us the roles of each meaning bearing word in a sentence and hence identification of these entities certainly helps us to extract the essence of the text which is very important in Question Answering'QA',", Information Extraction 'IE' and Summarization. The system presented here is a Named Entity 'NE' Classifier created using Multiclass Support Vector Machines based on linguistic grammar principles. Malayalam NER is a difficult task as each word of named entity has no specific feature such as Capitalization feature in English. NERs in other languages are not suitable for Malayalam language since its morphology, syntax and lexical semantics is different from them. Also there is no tagged corpus available for training. For testing this system, documents from well known Malayalam news papers and magazines containing passages from five different fields such as sports, health, politics, science and agriculture are selected. Experimental results show that the average precision recall and Fmeasure values are 89.12%, 89.15% and 89.13% respectively. "
Affiliate Program      Why do you need a reservation system?