Please use this identifier to cite or link to this item: http://gukir.inflibnet.ac.in:8080/jspui/handle/123456789/4273
Title: Script identification of camera based bilingual document images using SFTA features
Authors: Dhandra B.V
Mallappa S
Mukarambi G.
Keywords: GLCM
KNN
LBP
OCR
SFTA
SVM
SVT
TTBD
Issue Date: 2019
Publisher: IGI Global
Citation: International Journal of Technology and Human Interaction , Vol. 15 , 4 , p. 1 - 12
Abstract: In this article, the exhaustive experiment is carried out to test the performance of the Segmentation based Fractal Texture Analysis (SFTA) features with nt = 4 pairs, and nt = 8 pairs, geometric features and their combinations. A unified algorithm is designed to identify the scripts of the camera captured bi-lingual document image containing International language English with each one of Hindi, Kannada, Telugu, Malayalam, Bengali, Oriya, Punjabi, and Urdu scripts. The SFTA algorithm decomposes the input image into a set of binary images from which the fractal dimension of the resulting regions are computed in order to describe the segmented texture patterns. This motivates use of the SFTA features as the texture features to identify the scripts of the camera-based document image, which has an effect of non-homogeneous illumination (Resolution). An experiment is carried on eleven scripts each with 1000 sample images of block sizes 128 × 128, 256 × 256, 512 × 512 and 1024 × 1024. It is observed that the block size 512 × 512 gives the maximum accuracy of 86.45% for Gujarathi and English script combination and is the optimal size. The novelty of this article is that unified algorithm is developed for the script identification of bilingual document images. Copyright © 2019, IGI Global.
URI: 10.4018/IJTHI.2019100101
http://gukir.inflibnet.ac.in:8080/jspui/handle/123456789/4273
Appears in Collections:1. Journal Articles

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.