Implementation of CRNN Method for Lung Cancer Detection based on Microarray Data

Azka Khoirunnisa - School of Computing, Telkom University, Bandung, Indonesia
- Adiwijaya - School of Computing, Telkom University, Bandung, Indonesia
Didit Adytia - School of Computing, Telkom University, Bandung, Indonesia


Citation Format:



DOI: http://dx.doi.org/10.30630/joiv.7.2.1339

Abstract


Lung Cancer is one of the cancer types with the most significant mortality rate, mainly because of the disease's slow detection. Therefore, the early identification of this disease is crucial. However, the primary issue of microarray is the curse of dimensionality. This problem is related to the characteristic of microarray data, which has a small sample size yet many attributes. Moreover, this problem could lower the accuracy of cancer detection systems. Various machines and deep learning techniques have been researched to solve this problem. This paper implemented a deep learning method named Convolutional Recurrent Neural Network (CRNN) to build the Lung Cancer detection system. Convolutional neural networks (CNN) are used to extract features, and recurrent neural networks (RNN) are used to summarize the derived features. CNN and RNN methods are combined in CRNN to derive the advantages of each of the methods. Several previous research uses CRNN to build a Lung Cancer detection system using medical image biomarkers (MRI or CT scan). Thus, the researchers concluded that CRNN achieved higher accuracy than CNN and RNN independently. Moreover, CRNN was implemented in this research by using a microarray-based Lung Cancer dataset. Furthermore, different drop-out values are compared to determine the best drop-out value for the system. Thus, the result shows that CRNN gave a higher accuracy than CNN and RNN. The CRNN method achieved the highest accuracy of 91%, while the CNN and RNN methods achieved 83% and 71% accuracy, respectively.

Keywords


Microarray data; lung cancer; classification; deep learning

Full Text:

PDF

References


H. Sung et al., “Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries,†CA Cancer J Clin, vol. 71, no. 3, pp. 209–249, 2021.

B. C. Bade and C. S. dela Cruz, “Lung cancer 2020: epidemiology, etiology, and prevention,†Clin Chest Med, vol. 41, no. 1, pp. 1–24, 2020.

O. M. Al-Quteimat and A. M. Amer, “The impact of the COVID-19 pandemic on cancer patients,†Am J Clin Oncol, 2020.

S. Ahmad et al., “Epigenetic underpinnings of inflammation: Connecting the dots between pulmonary diseases, lung cancer and COVID-19,†in Seminars in cancer biology, 2022, vol. 83, pp. 384–398.

H. Aydadenta and A. Adiwijaya, “A clustering approach for feature selection in microarray data classification using random forest,†Journal of Information Processing Systems, vol. 14, no. 5, pp. 1167–1175, 2018.

A. Khoirunnisa, A. A. Rohmawati, and others, “Implementing principal component analysis and multinomial logit for cancer detection based on microarray data classification,†in 2019 7th international conference on information and communication technology (ICoICT), 2019, pp. 1–6.

V. Bolón-Canedo, N. Sánchez-Marono, A. Alonso-Betanzos, J. M. Ben’itez, and F. Herrera, “A review of microarray datasets and applied feature selection methods,†Inf Sci (N Y), vol. 282, pp. 111–135, 2014.

D. R. Rhodes et al., “ONCOMINE: a cancer microarray database and integrated data-mining platform,†Neoplasia, vol. 6, no. 1, pp. 1–6, 2004.

D. Q. Zeebaree, H. Haron, and A. M. Abdulazeez, “Gene selection and classification of microarray data using convolutional neural network,†in 2018 International Conference on Advanced Science and Engineering (ICOASE), 2018, pp. 145–150.

D. Moitra and R. K. Mandal, “Automated AJCC staging of non-small cell lung cancer (NSCLC) using deep convolutional neural network (CNN) and recurrent neural network (RNN),†Health Inf Sci Syst, vol. 7, no. 1, pp. 1–12, 2019.

C. Wang et al., “3D localization of lung tumors on cone beam CT projections via a convolutional recurrent neural network,†Med Phys, vol. 47, no. 3, pp. 1161–1166, 2020.

C. Wang et al., “Predicting spatial esophageal changes in a multimodal longitudinal imaging study via a convolutional recurrent neural network,†Phys Med Biol, vol. 65, no. 23, p. 235027, 2020.

J. Lu, R. Jin, E. Song, G. Ma, and M. Wang, “Lung-CRNet: A convolutional recurrent neural network for lung 4DCT image registration,†Med Phys, vol. 48, no. 12, pp. 7900–7912, 2021.

M. M. N. Abid, T. Zia, M. Ghafoor, and D. Windridge, “Multi-view convolutional recurrent neural networks for lung cancer nodule identification,†Neurocomputing, vol. 453, pp. 299–311, 2021.

S. Chowdhury, X. Dong, and X. Li, “Recurrent neural network based feature selection for high dimensional and low sample size micro-array data,†in 2019 IEEE International Conference on Big Data (Big Data), 2019, pp. 4823–4828.

M. Daoud and M. Mayo, “A survey of neural network-based cancer prediction models from microarray data,†Artif Intell Med, vol. 97, pp. 204–214, 2019.

A. Khoirunnisa, “Microarray Data Classification using Minimum Redundancy Maximum Relevance and Modified Logistic Regression (for high accuracy cancer detection),†2020.

S. H. Shah, M. J. Iqbal, I. Ahmad, S. Khan, and J. J. P. C. Rodrigues, “Optimized gene selection and classification of cancer from microarray gene expression data using deep learning,†Neural Comput Appl, pp. 1–12, 2020.

S. Iqbal et al., “Prostate cancer detection using deep learning and traditional techniques,†IEEE Access, vol. 9, pp. 27085–27100, 2021.

W. J. Sori, J. Feng, A. W. Godana, S. Liu, and D. J. Gelmecha, “DFD-Net: lung cancer detection from denoised CT scan image using deep learning,†Front Comput Sci, vol. 15, no. 2, pp. 1–13, 2021.

M. Y. A. Bakar and A. Adiwijaya, “Klasifikasi Teks Hadis Bukhari Terjemahan Indonesia Menggunakan Recurrent Convolutional Neural Network (CRNN),†Jurnal Teknologi Informasi dan Ilmu Komputer, vol. 8, no. 5, pp. 907–918, 2021.

J. Li and H. Liu, “Kent ridge bio-medical data set repository,†Institute for Infocomm Research. http://sdmc. lit. org. sg/GEDatasets/Datasets. html, 2002.

W. Hu, Y. Huang, L. Wei, F. Zhang, and H. Li, “Deep convolutional neural networks for hyperspectral image classification,†J Sens, vol. 2015, 2015.

J. D. Prusa and T. M. Khoshgoftaar, “Improving deep neural network design with new text data representations,†J Big Data, vol. 4, no. 1, pp. 1–16, 2017.

R. Yamashita, M. Nishio, R. K. G. Do, and K. Togashi, “Convolutional neural networks: an overview and application in radiology,†Insights Imaging, vol. 9, no. 4, pp. 611–629, 2018.

S. Sharma, S. Sharma, and A. Athaiya, “Activation functions in neural networks,†towards data science, vol. 6, no. 12, pp. 310–316, 2017.

J. Feng and S. Lu, “Performance analysis of various activation functions in artificial neural networks,†in Journal of physics: conference series, 2019, vol. 1237, no. 2, p. 22030.

A. F. Agarap, “Deep learning using rectified linear units (relu),†arXiv preprint arXiv:1803.08375, 2018.

C. Nwankpa, W. Ijomah, A. Gachagan, and S. Marshall, “Activation functions: Comparison of trends in practice and research for deep learning,†arXiv preprint arXiv:1811.03378, 2018.

K. Choi, G. Fazekas, M. Sandler, and K. Cho, “Convolutional recurrent neural networks for music classification,†in 2017 IEEE International conference on acoustics, speech and signal processing (ICASSP), 2017, pp. 2392–2396.

C. Garbin, X. Zhu, and O. Marques, “Drop-out vs. batch normalization: an empirical study of their impact to deep learning,†Multimed Tools Appl, vol. 79, no. 19, pp. 12777–12815, 2020.

N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Drop-out: a simple way to prevent neural networks from overfitting,†The journal of machine learning research, vol. 15, no. 1, pp. 1929–1958, 2014.