Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN)

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Traditional systems of handwriting recognition have relied on handcrafted features and a large amount of prior knowledge. Training an Optical character recognition (OCR) system based on these prerequisites is a challenging task. Research in the handwriting recognition field is focused around deep learning techniques and has achieved breakthrough performance in the last few years. Still, the rapid growth in the amount of handwritten data and the availability of massive processing power demands improvement in recognition accuracy and deserves further investigation. Convolutional neural networks (CNNs) are very effective in perceiving the structure of handwritten characters/words in ways that help in automatic extraction of distinct features and make CNN the most suitable approach for solving handwriting recognition problems. Our aim in the proposed work is to explore the various design options like number of layers, stride size, receptive field, kernel size, padding and dilution for CNN-based handwritten digit recognition. In addition, we aim to evaluate various SGD optimization algorithms in improving the performance of handwritten digit recognition. A network’s recognition accuracy increases by incorporating ensemble architecture. Here, our objective is to achieve comparable accuracy by using a pure CNN architecture without ensemble architecture, as ensemble architectures introduce increased computational cost and high testing complexity. Thus, a CNN architecture is proposed in order to achieve accuracy even better than that of ensemble architectures, along with reduced operational complexity and cost. Moreover, we also present an appropriate combination of learning parameters in designing a CNN that leads us to reach a new absolute record in classifying MNIST handwritten digits. We carried out extensive experiments and achieved a recognition accuracy of 99.87% for a MNIST dataset.

Related collections

Most cited references 70

Record: found
Abstract: found
Article: not found

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Kevin Murphy, Iasonas Kokkinos, George Papandreou … (2018)

In this work we address the task of semantic image segmentation with Deep Learning and make three main contributions that are experimentally shown to have substantial practical merit. First, we highlight convolution with upsampled filters, or 'atrous convolution', as a powerful tool in dense prediction tasks. Atrous convolution allows us to explicitly control the resolution at which feature responses are computed within Deep Convolutional Neural Networks. It also allows us to effectively enlarge the field of view of filters to incorporate larger context without increasing the number of parameters or the amount of computation. Second, we propose atrous spatial pyramid pooling (ASPP) to robustly segment objects at multiple scales. ASPP probes an incoming convolutional feature layer with filters at multiple sampling rates and effective fields-of-views, thus capturing objects as well as image context at multiple scales. Third, we improve the localization of object boundaries by combining methods from DCNNs and probabilistic graphical models. The commonly deployed combination of max-pooling and downsampling in DCNNs achieves invariance but has a toll on localization accuracy. We overcome this by combining the responses at the final DCNN layer with a fully connected Conditional Random Field (CRF), which is shown both qualitatively and quantitatively to improve localization performance. Our proposed "DeepLab" system sets the new state-of-art at the PASCAL VOC-2012 semantic image segmentation task, reaching 79.7 percent mIOU in the test set, and advances the results on three other datasets: PASCAL-Context, PASCAL-Person-Part, and Cityscapes. All of our code is made publicly available online.

0 comments Cited 2437 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Conference Proceedings: not found

Very deep convolutional networks for large-scale image recognition

K Simonyan, A. Zisserman, A ZISSERMAN … (2014)

0 comments Cited 279 times – based on 0 reviews

Bookmark

Record: found
Abstract: not found
Conference Proceedings: not found

Adam: a method for stochastic 7 optimization

D. P. Kingma, J. Ba, J. L. Ba … (2025)

0 comments Cited 239 times – based on 0 reviews

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Sensors (Basel)

Journal ID (iso-abbrev): Sensors (Basel)

Journal ID (publisher-id): sensors

Title: Sensors (Basel, Switzerland)

Publisher: MDPI

ISSN (Electronic): 1424-8220

Publication date (Electronic): 12 June 2020

Publication date Collection: June 2020

Volume: 20

Issue: 12

Electronic Location Identifier: 3344

Affiliations

[1 ]Department of Computer Science and Engineering, Maharaja Surajmal Institute of Technology, New Delhi 110058, India; savita.ahlawat@ 123456gmail.com

[2 ]Department of Computer Science, Maharaja Surajmal Institute, New Delhi 110058, India; amit.choudhary69@ 123456gmail.com

[3 ]Graduate School, Duy Tan University, Da Nang 550000, Vietnam; anandnayyar@ 123456duytan.edu.vn

[4 ]Department of Industrial & Systems Engineering, Dongguk University, Seoul 04620, Korea; saurabh89@ 123456dongguk.edu

Author notes

[* ]Correspondence: postman3@ 123456dongguk.edu

Author information

Saurabh Singh https://orcid.org/0000-0003-1118-9569

Byungun Yoon https://orcid.org/0000-0002-1110-4011

Article

Publisher ID: sensors-20-03344

DOI: 10.3390/s20123344

PMC ID: 7349603

PubMed ID: 32545702

SO-VID: 097a8d7a-cf8b-4a19-a15b-0bb60e752414

License:

Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

History

Date received : 25 May 2020

Date accepted : 09 June 2020

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

Cited by 20

See all cited by

Most referenced authors 1,027

See all reference authors

Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN)

Read this article at

Abstract

Related collections

Journal of Disability Research

Most cited references 70

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Very deep convolutional networks for large-scale image recognition

Adam: a method for stochastic 7 optimization

Author and article information

Journal

Affiliations

Author notes

Author information

Article

History

Categories

Comments

Comment on this article

Similar content 381

Cited by 20

Most referenced authors 1,027