Modified LeNet-5 Architecture to Classify High Variety of Tourism Object: A Case Study of Tourism Object for Education in Tinalah Village

Antonius Wijaya - Universitas Kristen Immanuel
Desideria Murti - Universitas Kristen Immanuel
Victoria Handoko - Universitas Kristen Immanuel


Citation Format:



DOI: http://dx.doi.org/10.30630/joiv.7.3.2095

Abstract


This research aims to modify a CNN (Convolutional Neural Network) based on LeNet-5 to reduce overfitting in a Tinalah Tourism Village dataset object detection. Tinalah Tourism Village has many objects that can be identified for tourism education and enhanced tourist experience. While these objects, spread across the different sites of Tinalah do vary, some share similarities in their histogram patterns. Visually, if the size of a picture is reduced in the LeNet-5 ‘preferred size’ feature, it will inevitably lose some of its information, making pictures too similar reducing accuracy. In order to learn and classify objects, this research performs a modification on LeNet-5 architecture to provide a better performance geared toward larger input imaging. The previous state-of-the-art architecture showed an overfitting performance where the training accuracy performed too much better than the testing accuracy in our dataset. We brought in a dropout layer to reduce overfitting, increase the dense layer's size, and add a convolution layer. We then compared the modified LeNet-5 with other state-of-the art architecture, such as LeNet-5 and AlexNet. Results showed that a modified LeNet-5 outperformed other architectures, especially in performing accuracy for testing the Tinalah dataset, reaching 0.913 or (91,3 %). This research discusses the dataset, the modified LeNet-5 architecture, and performance comparison between state-of-the-art CNN architecture. Our CNN architecture can be developed by involving a transfer learning mechanism to provide greater accuracy for further research.


Full Text:

PDF

References


Elsaadouny, M., Barowski, J. and Rolfes, I., 2020, March. Extracting the Features of the Shallowly Buried Objects using LeNet Convolutional Network. In 2020 14th European Conference on Antennas and Propagation (EuCAP) (pp. 1-4). IEEE.

Kayed, M., Anter, A. and Mohamed, H., 2020, February. Classification of garments from fashion MNIST dataset using CNN LeNet-5 architecture. In 2020 International Conference on Innovative Trends in Communication and Computer Engineering (ITCE) (pp. 238-243). IEEE.

Zhao, X., Tao, R., Li, W., Li, H.C., Du, Q., Liao, W. and Philips, W., 2020. Joint classification of hyperspectral and LiDAR data using hierarchical random walk and deep CNN architecture. IEEE Transactions on Geoscience and Remote Sensing, 58(10), pp.7355-7370.

Lee, S.G., Sung, Y., Kim, Y.G. and Cha, E.Y., 2018. Variations of AlexNet and GoogLeNet to improve Korean character recognition performance. Journal of information processing systems, 14(1), pp.205-217.

Öztürk, Ş. and Özkaya, U., 2020. Gastrointestinal tract classification using improved LSTM based CNN. Multimedia Tools and Applications, 79(39), pp.28825-28840.

Palanisamy, K., Singhania, D. and Yao, A., 2020. Rethinking CNN models for audio classification. arXiv preprint arXiv:2007.11154.

Demir, F., Turkoglu, M., Aslan, M. and Sengur, A., 2020. A new pyramidal concatenated CNN approach for environmental sound classification. Applied Acoustics, 170, p.107520.

Bi, Z., Yu, L., Gao, H., Zhou, P. and Yao, H., 2021. Improved VGG model-based efficient traffic sign recognition for safe driving in 5G scenarios. International Journal of Machine Learning and Cybernetics, 12(11), pp.3069-3080.

Shao, X., Nie, X., Zhao, X., Zheng, R. and Guo, D., 2020, June. Gait Recognition based on Improved LeNet Network. In 2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) (Vol. 1, pp. 1534-1538). IEEE.

Ul Islam, T., Awasthi, L.K. and Garg, U., 2021. Gait based Age Estimation using LeNet-50 inspired Gait-Net. International Journal of Advanced Engineering Research and Science, 8, p.8.

Banik, P.P., Saha, R. and Kim, K.D., 2020. An automatic nucleus segmentation and CNN model-based classification method of white blood cell. Expert Systems with Applications, 149, p.113211.

Lu, J., Tan, L. and Jiang, H., 2021. Review on Convolutional Neural Network (CNN) Applied to Plant Leaf Disease Classification. Agriculture, 11(8), p.707.

Qin, J., Pan, W., Xiang, X., Tan, Y. and Hou, G., 2020. A biological image classification method based on improved CNN. Ecological Informatics, 58, p.101093.

Vaddi, R. and Manoharan, P., 2020. Hyperspectral image classification using CNN with spectral and spatial features integration. Infrared Physics & Technology, 107, p.103296.

Barbhuiya, A.A., Karsh, R.K. and Jain, R., 2021. CNN based feature extraction and classification for sign language. Multimedia Tools and Applications, 80(2), pp.3051-3069.

Hsieh, T.H. and Kiang, J.F., 2020. Comparison of CNN algorithms on hyperspectral image classification in agricultural lands. Sensors, 20(6), p.1734.

Mateen, M., Wen, J., Song, S. and Huang, Z., 2019. Fundus image classification using VGG-19 architecture with PCA and SVD. Symmetry, 11(1), p.1.

Gayathri, S., Gopi, V.P. and Palanisamy, P., 2020. A lightweight CNN for Diabetic Retinopathy classification from fundus images. Biomedical Signal Processing and Control, 62, p.102115.

Zhang, Y.D., Govindaraj, V.V., Tang, C., Zhu, W. and Sun, J., 2019. High performance multiple sclerosis classification by data augmentation and AlexNet transfer learning model. Journal of Medical Imaging and Health Informatics, 9(9), pp.2012-2021.

Aliyu, H.A., Razak, M.A.A., Sudirman, R. and Ramli, N., 2020. A deep learning AlexNet model for classification of red blood cells in sickle cell anaemia. Int J Artif Intell, 9(2), pp.221-228.

Sitaula, C. and Hossain, M.B., 2021. Attention-based VGG-16 model for COVID-19 chest X-ray image classification. Applied Intelligence, 51(5), pp.2850-2863.

Islam, M.R. and Matin, A., 2020, December. Detection of COVID-19 from CT Image by The Novel LeNet-5 CNN Architecture. In 2020 23rd International Conference on Computer and Information Technology (ICCIT) (pp. 1-5). IEEE.

Agnes, S.A., Anitha, J., Pandian, S.I.A. and Peter, J.D., 2020. Classification of mammogram images using multiscale all convolutional neural network (MA-CNN). Journal of medical systems, 44(1), pp.1-9.

Vasan, D., Alazab, M., Wassan, S., Safaei, B. and Zheng, Q., 2020. Image-Based malware classification using ensemble of CNN architectures (IMCEC). Computers & Security, 92, p.101748.

Fawaz, H.I., Lucas, B., Forestier, G., Pelletier, C., Schmidt, D.F., Weber, J., Webb, G.I., Idoumghar, L., Muller, P.A. and Petitjean, F., 2020. Inceptiontime: Finding AlexNet for time series classification. Data Mining and Knowledge Discovery, 34(6), pp.1936-1962.

Zhu, Y., Li, G., Wang, R., Tang, S., Su, H. and Cao, K., 2021. Intelligent fault diagnosis of hydraulic piston pump combining improved LeNet-5 and PSO hyperparameter optimization. Applied Acoustics, 183, p.108336.

Sun, Y., Xue, B., Zhang, M., Yen, G.G. and Lv, J., 2020. Automatically designing CNN architectures using the genetic algorithm for image classification. IEEE Transactions on Cybernetics, 50(9), pp.3840-3854.

Hosny, K.M., Kassem, M.A. and Fouad, M.M., 2020. Classification of skin lesions into seven classes using transfer learning with AlexNet. Journal of digital imaging, 33(5), pp.1325-1334.

Zhang, C.W., Yang, M.Y., Zeng, H.J. and Wen, J.P., 2019. Pedestrian detection based on improved LeNet-5 convolutional neural network. Journal of Algorithms & Computational Technology, 13, p.1748302619873601.

Li, T., Jin, D., Du, C., Cao, X., Chen, H., Yan, J., Chen, N., Chen, Z., Feng, Z. and Liu, S., 2020. The image-based analysis and classification of urine sediments using a LeNet-5 neural network. Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, 8(1), pp.109-114.

Dai, Q., Luo, X. and Meng, Z., 2020, October. Research on Face Recognition Based on Gabor-LeNet Convolutional Neural Network Model. In Journal of Physics: Conference Series (Vol. 1650, No. 3, p. 032035). IOP Publishing.

Cui, W., Lu, Q., Qureshi, A.M., Li, W. and Wu, K., 2021. An adaptive LeNet-5 model for anomaly detection. Information Security Journal: A Global Perspective, 30(1), pp.19-29.

Zaibi, A., Ladgham, A. and Sakly, A., 2021. A Lightweight Model for Traffic Sign Classification Based on Enhanced LeNet-5 Network. Journal of Sensors, 2021.

Priyadharshini, R.A., Arivazhagan, S., Arun, M. and Mirnalini, A., 2019. Maize leaf disease classification using deep convolutional neural networks. Neural Computing and Applications, 31(12), pp.8887-8895.

Shanthi, T. and Sabeenian, R.S., 2019. Modified AlexNet architecture for classification of diabetic retinopathy images. Computers & Electrical Engineering, 76, pp.56-64.

Momeny, M., Jahanbakhshi, A., Jafarnezhad, K. and Zhang, Y.D., 2020. Accurate classification of cherry fruit using deep CNN based on hybrid pooling approach. Postharvest Biology and Technology, 166, p.111204.

Rehman, A., Khan, M.A., Saba, T., Mehmood, Z., Tariq, U. and Ayesha, N., 2021. Microscopic brain tumor detection and classification using 3D CNN and feature selection architecture. Microscopy Research and Technique, 84(1), pp.133-149.

Yu, C., Han, R., Song, M., Liu, C. and Chang, C.I., 2020. A simplified 2D-3D CNN architecture for hyperspectral image classification based on spatial–spectral fusion. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 13, pp.2485-2501.

Kuo, C.C.J., 2016. Understanding convolutional neural networks with a mathematical model. Journal of Visual Communication and Image Representation, 41, pp.406-413.