Academic Journals Database
Disseminating quality controlled scientific knowledge

A New Segmentation Method for Persian/Arabic OCR Based on Baseline Processing

Author(s): Mahboubeh Shamsi | Reza Rasouli | Soudeh Shadravan

Journal: Majlesi Journal of Electrical Engineering
ISSN 2008-1413

Volume: 3;
Issue: 3;
Start page: 53;
Date: 2009;
Original page

Keywords: Persian OCR | Segmentation | Recognition | Smoothing | Arabic OCR | Baseline Method

One of the most important stages in Character Recognition Systems is “Segmentation”, because any mistake will affect to all other tasks, especially to character recognition. This operation is more complex in Persian/Arabic writing than other Latin writing like English, and there has been an ongoing research on it. Other algorithms, that has been used as base as proposed algorithm, show 85% accuracy. In this paper, a new improved method has been presented by analyzing the visual features of the Persian/Arabic language. The proposed algorithm is able to segment existing fonts up to 98.5% accuracy or even 100% on some cases. The remaining error could be refined by applying a good character recognition technique and a precise vocabulary.
Why do you need a reservation system?      Save time & money - Smart Internet Solutions