Hardware Implementation of Web Based Arabic Optical Character Recognition Units
Osama Al-Khaleela, Inad Aljarraha, Abdelrahman Idriesb, and Khaldoon Mhaidata
Department of Computer Engineering, Jordan University of Science and Technology, Irbid, Jordan
Abstract—Optical character recognition (OCR) is an important application in the field of pattern recognition. It extracts text from an image document and saves it in an editable form. Examples where OCR is used include library digitization and text searching in scanned documents. Web based applications are main tools for data processing over the net. However, implementing such applications in dedicated hardware systems would increase performance and reliability by many folds over software implementation. In this paper, we present a detailed hardware implementation of the features extraction and character matching units of an Arabic optical character recognition (AOCR) system. The hardware implementation of each of these two units is described in VerilogHDL and functionally tested using ISim from Xilinx. Furthermore, each implementation is synthesized using Xilinx ISE 13.1 targeting Xilinx Spartan6 FPGA family. Experimental results show significant speed up in the hardware implementations over software ones. We further, explore the possibility of accessing these systems over the Web. Thus, they are beneficial to wider range of people.
Index Terms—AOCR, FPGAs, character matching, segmentation, hardware Cite: Osama Al-Khaleela, Inad Aljarraha, Abdelrahman Idriesb, and Khaldoon Mhaidata, "Hardware Implementation of Web Based Arabic Optical Character Recognition Units," Journal of Emerging Technologies in Web Intelligence, Vol. 6, No. 2, pp. 210-219, May 2014. doi:10.4304/jetwi.6.2.210-219
Array