Design of a Low-area Digit Recognition Accelerator Using MNIST Database

Joonyub Kwon; Sunhee Kim

doi:10.30630/joiv.6.1.855

Design of a Low-area Digit Recognition Accelerator Using MNIST Database

Joonyub Kwon - Department of Semiconductor System Engineering, Sangmyung University, 31, Sangmyeongdae-gil, Dongnam-gu, Cheonan-si, Chungcheongnam-do, 31066, Republic of Korea
Sunhee Kim - Department of Semiconductor System Engineering, Sangmyung University, 31, Sangmyeongdae-gil, Dongnam-gu, Cheonan-si, Chungcheongnam-do, 31066, Republic of Korea

Citation Format:

DOI: http://dx.doi.org/10.30630/joiv.6.1.855

Abstract

Deep neural networks, which is a field of artificial intelligence, have been used in various fields. Deep learning is processed on high-performance GPUs or TPUs. It requires high cost as much as its good performance. Recently, as the demand for edge computing increases, many studies have been conducted to perform complex deep learning operations in a low-computing processor. Among them, a typical study is to lighten the deep learning network. In this paper, we propose a handwritten digit recognition hardware accelerator suitable for edge computing using MNIST database. After setting the correction rate for MNIST to 94% and performing network lighting processes, a hardware structure that can reduce the area of hardware and minimize memory access is proposed. Basically, the network is set as a two-layer fully connected network. The network is modeled with Python and lighten while checking the performance. Network parameters, weighs and biases, are quantized. The pixel number and bit number of MNIST input data are also reduced. The number of MAC units and the processing order of the hardware accelerator are determined so that there is no not used MACs while performing the MAC operations in parallel. It is designed with Verilog HDL and its functions are checked in Modelsim. And then it is implemented in Xilinx Zynq ZC-702 to verify the operations. The designed number recognition accelerator is expected to be widely used in edge devices by reducing the area and memory access.

Keywords

MNIST; accelerator; digit recognition; edge computing; fully-connected network.

Full Text:

PDF

References

S. Li, W. Song, L. Fang, Y. Chen, P. Ghamisi and J. A. Benediktsson, â€œDeep Learning for Hyperspectral Image Classification: An Overview,â€ IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 9, pp. 6690-6709, Sept. 2019, doi: 10.1109/TGRS.2019.2907932.

S.L. Oh, Y. Hagiwara, U. Raghavendra, R. Yuvaraj, N. Arunkumar, M. Murugappan and U. R. Acharya, â€œA deep learning approach for Parkinsonâ€™s disease diagnosis from EEG signals,â€ Neural Comput & Applic.. vol. 32, pp. 10927â€“10933, 2020. 10.1007/s00521-018-3689-5.

L. Jiao and J. Zhao, â€œA Survey on the New Generation of Deep Learning in Image Processing,â€ IEEE Access, vol. 7, pp. 172231-172263, 2019. 10.1109/ACCESS.2019.2956508.

N. Justesen, P. Bontrager, J. Togelius and S. Risi, â€œDeep Learning for Video Game Playing,â€ IEEE Transactions on Games, vol. 12, no. 1, pp. 1-20, March 2020. 10.1109/TG.2019.2896986.

T. Liang, J. Glossner, L. Wang, S. Shi, and X. Zhang, â€œPruning and Quantization for Deep Neural Network Acceleration: A Survey,â€ Neurocomputing, vol. 461, pp. 370-403, Oct. 2021, 10.1016/j.neucom.2021.07.045.

V. Lebedev, V. Lempitsky, â€œSpeeding-up convolutional neural networks: A survey,â€ Bulletin of the Polish Academy of Sciences. Technical Science, vol. 66, no. 6, pp. 799-811, 2018, 10.24425/bpas.2018.125927.

M. D. Zeiler, and R. Fergus. â€œStochastic pooling for regualization of deep convolutional neural networks,â€ arXiv preprint arXiv:1301.3557, 2013, 10.48550/arXiv.1301.3557.

J. Y. Wu, C. Yu, S. W. Fu, C. T. Liu, S. Y. Chien and Y. Tsao, â€œIncreasing Compactness of Deep Learning Based Speech Enhancement Models With Parameter Pruning and Quantization Techniques,â€ IEEE Signal Processing Letters, vol. 26, no. 12, pp. 1887-1891, Dec. 2019. 10.1109/LSP.2019.2951950.

J. Guo, W. Liu, W. Wang, J. Han, R. Li, Y. Lu, S. Hu, â€œAccelerating Distributed Deep Learning By Adaptive Gradient Quantization,â€ in Proc. ICASSP, Barcelona, Spain, 2020, pp. 1603-1607, doi: 10.1109/ICASSP40776.2020.9054164.

C. Wang, L. Gong, X. Li, and X. Zhou, â€œA Ubiquitous Machine Learning Accelerator With Automatic Parallelization on FPGA,â€ IEEE Transactions on Parallel and Distributed Systems, vol. 31, no. 10, pp. 2346-2359, 1 Oct. 2020. 10.1109/TPDS.2020.2990924.

Y. Toyama, K. Yoshioka, K. Ban, S. Maya, A. Sai and K. Onizuka, â€œAn 8 Bit 12.4 TOPS/W Phase-Domain MAC Circuit for Energy-Constrained Deep Learning Accelerators,â€ IEEE Journal of Solid-State Circuits, vol. 54, no. 10, pp. 2730-2742, Oct. 2019. 10.1109/JSSC.2019.2926649.

Y. Wang, Y. Wang, H. Li and X. Li, â€œAn Efficient Deep Learning Accelerator Architecture for Compressed Video Analysis,â€ IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. pp. 1-1, 2021. 10.1109/TCAD.2021.3120076.

J. Chen and X. Ran, â€œDeep Learning With Edge Computing: A Review,â€ Proceedings of the IEEE, vol. 107, no. 8, pp. 1655-1674, Aug. 2019. 10.1109/JPROC.2019.2921977.

K. Cao, Y. Liu, G. Meng and Q. Sun, â€œAn Overview on Edge Computing Research,â€ IEEE Access, vol. 8, pp. 85714-85728, 2020. 10.1109/ACCESS.2020.2991734.

W. Z. Khan, E. Ahmed, S. Hakak, I. Yaqoob, and A. Ahmed, â€œEdge computing: A survey,â€ Future Generation Computer Systems, vol. 97, pp. 219-235, 2019. 10.1016/j.future.2019.02.050.

S. W. Yang, â€œEfficient Deep Learning on Limited System Resources in FPGAs Performance Comparison on Floating Points,â€ M.S. thesis, Dept. Computer & Information Technology, Korea Univ., Seoul, Korea, 2019.

A. Iwata, Y. Yoshida, S. Matsuda, Y. Sato, and N. Suzumura, â€œAn artificial neural network accelerator using general purpose 24 bits floating point digital signal processors,â€ in IJCNN, Washington, DC, USA, vol. 2, 1989, pp.171â€“175, doi: 10.1109/IJCNN.1989.118695.

J. Civit-Masot, F. Luna-PerejÃ³n, S. Vicente-DÃaz, J. M. RodrÃguez Corral and A. Civit, â€œTPU Cloud-Based Generalized U-Net for Eye Fundus Image Segmentation,â€ IEEE Access, vol. 7, pp. 142379-142387, 2019. 10.1109/ACCESS.2019.2944692.

R. Murillo, A. A. D. Barrio, and G. Botella, â€œDeep PeNSieve: A deep learning framework based on the posit number system,â€ Digital Signal Processing, vol. 102, 102762, 2020. 10.1016/j.dsp.2020.102762.

Q. H. Vo, N. Linh Le, F. Asim, L. W. Kim and C. S. Hong, â€œA Deep Learning Accelerator Based on a Streaming Architecture for Binary Neural Networks,â€ IEEE Access, vol. 10, pp. 21141-21159, 2022. 10.1109/ACCESS.2022.3151916.

H. F. Langroudi, Z. Carmichael, and D. Kudithipudi, â€œDeep Learning Training on the Edge with Low-Precision Posits,â€ arXiv preprint arXiv.1907.13216, 2019. 10.48550/arXiv.1907.13216.

H. W. Son, D. Y. Lee, and H. W. Kim, â€œCompact CNN Accelerator Chip Design with Optimized MAC And Pooling Layers,â€ Journal of the Korea Institute of Information and Communication Engineering, vol. 25, no. 9, pp. 1158-1165, Sept. 2021, 10.6109/JKIICE.2021.25.9.1158.

S. F. Hsiao, K. C. Chen, C. C. Lin, H. J. Chang, and B. C. Tsai, â€œDesign of a Sparsity-Aware Reconfigurable Deep Learning Accelerator Supporting Various Types of Operations,â€ IEEE Journal on Emerging and Selected Topics in Circuits and Systems, vol. 10, no. 3, pp. 376-387, Sept. 2020. 10.1109/JETCAS.2020.3015238.

L. Kang, H. Li, X. Li, and H. Zheng, â€œDesign of Convolution Operation Accelerator based on FPGA,â€ in Proc. Int. Conf. MLBDBI, Taiyuan, China, 2020, pp. 80-84, doi:10.1109/MLBDBI51377.2020.00021.

Y. LeCun, C. Cortes, and C. J. C. Burges, "The Mnist Database of handwritten digits," [Online]. Available: http://yann.lecun.com/exdb/mnist/

L. Wan, M. Zeiler, S. Zhang, Y. L. Cun, and R. Fergus, â€œRegularization of Neural Networks using DropConnect,â€ in Proc. the International Conference on Machine Learning, PMLR, Atlanta, Georgia, USA, 2013, pp. 1058-1066.

Siham Tabik, Ricardo F. Alvear-Sandoval, MarÃa M. Ruiz, JosÃ©-Luis Sancho-GÃ³mez, AnÃbal R. Figueiras-Vidal, Francisco Herrera, â€œMNIST-NET10: A heterogeneous deep networks fusion based on the degree of certainty to reach 0.1% error rate. Ensembles overview and proposal,â€ Information Fusion, vol. 62, pp. 73-80, Oct. 2020. 10.1016/j.inffus.2020.04.002.

Matuzas77, â€œMNIST-0.17,â€ [Online]. Available: https://github.com/Matuzas77/MNIST-0.17

S. S. Kadam, A. C. Adamuthe, and A. B. Patil, â€œCNN Model for Image Classification on MNIST and Fashion-MNIST Dataset,â€ Journal of Scientific Research, vol. 64, no. 2, pp. 374-384, 2020. 10.37398/JSR.2020.640251.

R. F. Alvear-Sandoval, J. L. Sancho-GÃ³mez, and A. R. Figueiras-Vidal, â€œOn improving CNNs performance: The case of MNIST,â€ Information Fusion, vol. 52, pp. 106-109, Dec. 2019. 10.1016/j.inffus.2018.12.005.

A. Baldominos, Y. Saez, and P. Isasi, â€œA Survey of Handwritten Character Recognition with MNIST and EMNIST,â€ Appl. Sci., vol. 9, no. 15, 3169, Aug. 2019, 10.3390/app9153169.

A. Velichko A, â€œNeural Network for Low-Memory IoT Devices and MNIST Image Recognition Using Kernels Based on Logistic Map,â€ Electronics, vol. 9, no. 9, 1432, 2020. 10.3390/electronics9091432

S. S, Mor, S. Solanki, S. Gupta, S. Dhingra, M. Jain, and R. Saxena, â€œHandwritten Text Recognition: with Deep Learning and Android,â€ IJEAT, vol. 8, no. 3S, pp. 819-825, Feb. 2019.

S. Himanshu, â€œActivation Functions : Sigmoid, tanh, ReLU, Leaky ReLU, PReLU, ELU, Threshold ReLU and Softmax basics for Neural Networks and Deep Learning,â€ Jan. 19, 2019. [Online]. Available: https://himanshuxd.medium.com/activation-functions-sigmoid-relu-leaky-relu-and-softmax-basics-for-neural-networks-and-deep-8d9c70eed91e

Username
Password
Remember me