Efficient processing of GRU based on word embedding for text classification

Muhammad Zulqarnain; Rozaida Ghazali; Muhammad Ghulam Ghouse; Muhammad Faheem Mushtaq

doi:10.30630/joiv.3.4.289

Efficient processing of GRU based on word embedding for text classification

Muhammad Zulqarnain - Universiti Tun Hussein Onn Malaysia, Johor, Malaysia
Rozaida Ghazali - Universiti Tun Hussein Onn Malaysia, Johor, Malaysia
Muhammad Ghouse - Universiti Tun Hussein Onn Malaysia, Johor, Malaysia
Muhammad Mushtaq - Universiti Tun Hussein Onn Malaysia, Johor, Malaysia

Citation Format:

DOI: http://dx.doi.org/10.30630/joiv.3.4.289

Abstract

Text classification has become very serious problem for big organization to manage the large amount of online data and has been extensively applied in the tasks of Natural Language Processing (NLP). Text classification can support users to excellently manage and exploit meaningful information require to be classified into various categories for further use. In order to best classify texts, our research efforts to develop a deep learning approach which obtains superior performance in text classification than other RNNs approaches. However, the main problem in text classification is how to enhance the classification accuracy and the sparsity of the data semantics sensitivity to context often hinders the classification performance of texts. In order to overcome the weakness, in this paper we proposed unified structure to investigate the effects of word embedding and Gated Recurrent Unit (GRU) for text classification on two benchmark datasets included (Google snippets and TREC). GRU is a well-known type of recurrent neural network (RNN), which is ability of computing sequential data over its recurrent architecture. Experimentally, the semantically connected words are commonly near to each other in embedding spaces. First, words in posts are changed into vectors via word embedding technique. Then, the words sequential in sentences are fed to GRU to extract the contextual semantics between words. The experimental results showed that proposed GRU model can effectively learn the word usage in context of texts provided training data. The quantity and quality of training data significantly affected the performance. We evaluated the performance of proposed approach with traditional recurrent approaches, RNN, MV-RNN and LSTM,Â the proposed approach is obtained better results on two benchmark datasets in the term of accuracy and error rate.

Keywords

RNN; GRU; LSTM; Word embedding; Text classification; Natural language processing;

Full Text:

PDF

References

J. Protasiewicz, â€œA recent overview of the state-of-the-art elements of text classification,â€ Expert Syst. Appl., vol. 106, pp. 36â€“54, 2018.

W. Sharif, N. A. Samsudin, M. M. Deris, and M. Aamir, â€œImproved relative discriminative criterion feature ranking technique for text classification,â€ Int. J. Artif. Intell., vol. 15, no. 2, pp. 61â€“78, 2017.

A. Garcia-Garcia, S. Orts-Escolano, S. Oprea, V. Villena-Martinez, and J. Garcia-Rodriguez, â€œA Review on Deep Learning Techniques Applied to Semantic Segmentation,â€ pp. 1â€“23, 2017.

M. Oquab, L. Bottou, I. Laptev, and J. Sivic, â€œLearning and Transferring Mid-Level Image Representations using Convolutional Neural Networks,â€ IEEE Conf. Comput. Vis. Pattern Recognit., pp. 1717â€“1724, 2014.

D. Tang, F. Wei, B. Qin, N. Yang, T. Liu, and M. Zhou, â€œSentiment Embeddings with Applications to Sentiment Analysis,â€ IEEE Trans. Knowl. Data Eng., vol. 28, no. October, pp. 496â€“509, 2016.

R. Zhao, W. Ouyang, H. Li, and X. Wang, â€œSaliency detection by multi-context deep learning,â€ Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., vol. 07â€“12â€“June, pp. 1265â€“1274, 2015.

O. I. and C. Cardie, â€œDeep Recursive Neural Networks for Compositionality in Language,â€ Adv. neural Inf. Process. Syst., pp. 2096â€“2104, 2014.

A. Dahou, M. A. Elaziz, J. Zhou, and S. Xiong, â€œArabic Sentiment Classification Using Convolutional Neural Network and Differential Evolution Algorithm,â€ Comput. Intell. Neurosci., vol. 2019, pp. 1â€“16, 2019.

S. Hochreiter, â€œLong Short Term Memory,â€ Neural Comput., vol. 9, no. 8, pp. 1â€“32, 1997.

K. Cho, â€œOn the Properties of Neural Machine Translation: Encoderâ€“Decoder Approaches,â€ arXiv, vol. 5, pp. 1â€“9, 2014.

R. Collobert and J. Weston, â€œA unified architecture for natural language processing,â€ Proc. 25th Int. Conf. Mach. Learn. - ICML â€™08, pp. 160â€“167, 2008.

R. Socher, A. Perelygin, and J. Wu, â€œRecursive deep models for semantic compositionality over a sentiment treebank,â€ Proc. â€¦, no. October, pp. 1631â€“1642, 2013.

M. Iyyer, J. Boyd-Graber, L. Claudino, R. Socher, and H. DaumÃ© III, â€œA Neural Network for Factoid Question Answering over Paragraphs,â€ Proc. 2014 Conf. Empir. Methods Nat. Lang. Process., pp. 633â€“644, 2014.

S. R. Bowman, G. Angeli, C. Potts, and C. D. Manning, â€œA large annotated corpus for learning natural language inference,â€ 2015.

A. Kumar et al., â€œAsk Me Anything: Dynamic Memory Networks for Natural Language Processing,â€ vol. 48, 2015.

M. Ravanelli, P. Brakel, M. Omologo, and Y. Bengio, â€œLight Gated Recurrent Units for Speech Recognition,â€ IEEE Trans. Emerg. Top. Comput. Intell., vol. 2, no. 2, pp. 92â€“102, 2018.

T. Mikolov, J. Kopecky, L. Burget, O. Glembek, and J. Cernocky, â€œNeural network based language models for highly inflective languages,â€ Icassp-2009, pp. 4725â€“4728, 2009.

T. Mikolov, G. Corrado, K. Chen, and J. Dean, â€œEfficient Estimation ofWord Representations in Vector Space,â€ arXiv Prepr. arXiv1301.3781, pp. 1â€“12, 2013.

M. J. Berger, â€œLarge Scale Multi-label Text Classification with Semantic Word Vectors,â€ Tech. Rep., pp. 1â€“8, 2014.

P. Liu, X. Qiu, and X. Huang, â€œRecurrent Neural Network for Text Classification with Multi-Task Learning,â€ Proc. 25th Int. Jt. Conf. Artif. Intell. IJCAI-16, p. to appear, 2016.

Y. Xiao and K. Cho, â€œEfficient Character-level Document Classification by Combining Convolution and Recurrent Layers,â€ arXiv, vol. 1602, no. 00367, 2016.

A. Karpathy, â€œDeep Visual-Semantic Alignments for Generating Image Descriptions.â€

M. Sundermeyer, H. Ney, and R. SchlÃ¼ter, â€œFrom Feedforward to Recurrent LSTM Neural Networks for Language Modeling,â€ IEEE/ACM Trans. Audio, Speech, Lang. Process, vol. 23, no. 3, pp. 517â€“529, 2015.

Z. H. I. Li, F. A. N. Yang, and Y. Luo, â€œContext Embedding Based on Bi-LSTM in Semi-Supervised Biomedical Word Sense Disambiguation,â€ IEEE Access, vol. 7, pp. 72928â€“72935, 2019.

V. Srividhya and R. Anitha, â€œEvaluating Preprocessing Techniques in Text Categor ization,â€ Int. J. Comput. Sci. Appl., vol. 47, no. April, pp. 49â€“51, 2010.

R. Johnson and T. Zhang, â€œEffective Use of Word Order for Text Categorization with Convolutional Neural Networks,â€ no. 2011, 2014.

J. Pennington, R. Socher, and C. D. Manning, â€œGloVe : Global Vectors for Word Representation,â€ Proc. Conf. Empir. Methods Nat. Lang. Process., no. October, pp. 1532â€“1543, 2014.

A. Cochez et al., â€œThis is an electronic reprint of the original article . This reprint may differ from the original in pagination and typographic detail . Global RDF Vector Space Embeddings,â€ Int. Semant. Web Conf., pp. 190â€“207, 2017.

R. Socher, J. Pennington, E. H. Huang, A. Y. Ng, and C. D. Manning, â€œSemi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions,â€ Proc. Conf. Empir. methods Nat. Lang. Process., no. ii, pp. 151â€“161, 2011.

X. Phan, â€œLearning to Classify Short and Sparse Text & Web with Hidden Topics from Large-scale Data Collections,â€ Proc. 17th Int. Conf. World Wide Web, pp. 91â€“100, 2008.

D. Roth, â€œLearning Question Classifiers Â£,â€ Proc. 19th Int. Conf. Comput. Linguist., vol. 1, no. August, pp. 1â€“7, 2002.

H. Lee, â€œfor Modeling Sentences and Documents,â€ Proc. 15th Annu. Conf. North Am. Chapter Assoc. Comput., no. June, pp. 1512â€“1521, 2015.

D. P. Kingma and J. L. Ba, â€œA method for stochastic optimization,â€ arXiv, no. March, pp. 1â€“15, 2015.

G. Hinton, â€œDropout : A Simple Way to Prevent Neural Networks from Overfitting,â€ J. Mach. Learn. Res. 2014, 15, vol. 15, pp. 1929â€“1958, 2014.

Username
Password
Remember me