Optical character recognition for Telugu handwritten text using SqueezeNet convolutional neural networks model
Abstract
Optical character recognition (OCR) is a process that recognizes and converts data from scanned images, including both handwritten and printed documents, into an accessible format. The challenges in Telugu OCR arise from compound characters, an extensive character set, limited datasets, character similarities, and difficulties in segmenting overlapping characters. To tackle these segmentation complexities, an algorithm has been developed, prioritizing the preservation of essential features during character segmentation. For distinguishing between structurally similar characters, we used convolutional neural networks (CNN) due to their feature-extracting properties. We have employed the CNN model, the SqueezeNet for feature extraction, resulting in an impressive character recognition rate of 94% and a word recognition rate of 80%.
Full Text:
PDFDOI: http://doi.org/10.11591/ijaas.v13.i3.pp487-496
Refbacks
- There are currently no refbacks.
International Journal of Advances in Applied Sciences (IJAAS)
p-ISSN 2252-8814, e-ISSN 2722-2594
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.