Publication: Local Binary Patterns for Arabic Optical Font Recognition


Authors: Anguelos Nicolaou, Fouad Slimane, Volker Maergner, Marcus Liwicki


Published in: DAS 2014 Tours, France.


Download pdf


Abstract:
Optical Font Recognition (OFR) has been proven to increase Optical Character Recognition (OCR) accuracy, but it can also help in harvesting semantic information from documents. It therefore becomes a part of many Document Image Analysis (DIA) pipelines. Our work is based on the hypothesis that Local Binary Patterns (LBP), as a generic texture classification method, can address several distinct DIA problems at the same time such as OFR, script detection, writer identification, etc. In this paper we strip down the Redundant Oriented LBP (RO-LBP) method, previously used in writer identification, and apply it for OFR with the goal of introducing a generic method that classifies text as oriented texture. We focus on Arabic OFR and try to perform a thorough comparison of our method and the leading Gaussian Mixture Model method that is developed specifically for the task. Depending on the nature of proposed OFR method, each method's performance is usually evaluated on different data and with different evaluation protocols. The proposed experimental procedure addresses this problem and allows us to compare OFR methods that are fundamentally different by adapting them to a common measurement protocol. In performed experiments LBP method achieves perfect results on large text blocks generated from the APTI database, while preserving its very broad generic attributes as proven by secondary experiments.


Aditional Material:


bibtex entry :
 @inproceedings{nicolaou2014local,
title={Local Binary Patterns for Arabic Optical Font Recognition},
author={Nicolaou, Anguelos and Slimane, Fouad and Maergner, Volker and Liwicki, Marcus},
booktitle={Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on},
pages={76--80},
year={2014},
organization={IEEE}
}


Creative Commons License
Any work in this page other than source code or program binaries is licensed under a Creative Commons Attribution 4.0 International License. When applicable atribution should be in the form of a citation.