Please use this identifier to cite or link to this item: http://gukir.inflibnet.ac.in:8080/jspui/handle/123456789/4963
Full metadata record
DC FieldValueLanguage
dc.contributor.authorDhandra B.V
dc.contributor.authorHangarge M.
dc.date.accessioned2020-06-12T15:05:48Z-
dc.date.available2020-06-12T15:05:48Z-
dc.date.issued2007
dc.identifier.citationJournal of Multimedia , Vol. 2 , 6 , p. 26 - 33en_US
dc.identifier.uri10.4304/jmm.2.6.26-33
dc.identifier.urihttp://gukir.inflibnet.ac.in:8080/jspui/handle/123456789/4963-
dc.description.abstractFor Optical Character Recognition (OCR) of bilingual or multilingual document containing text words in regional language and numerals in English, it is necessary to identify different script forms before running an individual OCR of the scripts. In this paper, an attempt is made for separation of English numerals at word level from bilingual and trilingual documents representing Kannada, Devnagari, Tamil, Odiya and Malayalam scripts by using discriminating features such as aspect ratio, strokes densities, eccentricity, etc. as a tool. The k-nearest neighbour algorithm is used to classify the new word images and the algorithm is tested on 6000 sample words with a five fold cross validation test. The algorithm is robust with respect to font styles, sizes and noise. The results obtained are quite encouraging. © 2007 ACADEMY PUBLISHER.en_US
dc.publisherAcademy Publisher
dc.subjectAnd cross validation
dc.subjectEccentricity
dc.subjectMorphological reconstruction
dc.subjectOCR
dc.subjectScript identification
dc.titleOn separation of english numerals from multilingual document imagesen_US
dc.typeArticle
Appears in Collections:1. Journal Articles

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.