Text Summarization on Verdicts of Industrial Relations Disputes Using the Cross-Latent Semantic Analysis and Long Short-Term Memory

Galih Wasis Wicaksono; Muhammad Nafi Maula Hakim; Nur Hayatin; Nur Putri Hidayah; Tiara Intana Sari

doi:10.30630/joiv.7.3.2052

Text Summarization on Verdicts of Industrial Relations Disputes Using the Cross-Latent Semantic Analysis and Long Short-Term Memory

Galih Wicaksono - University of Muhammadiyah Malang, Malang, Indonesia
Muhammad Hakim - University of Muhammadiyah Malang, Malang, Indonesia
Nur Hayatin - University of Muhammadiyah Malang, Malang, Indonesia
Nur Hidayah - University of Muhammadiyah Malang, Malang, Indonesia
Tiara Sari - University of Muhammadiyah Malang, Malang, Indonesia

Citation Format:

DOI: http://dx.doi.org/10.30630/joiv.7.3.2052

Abstract

The information presented in the documents regarding industrial relations disputes constitutes four legal disputes. However, too much information leads to difficulty for readers to find essential points highlighted in industrial relations dispute documents. This research aims to summarize automated documents of court decisions over industrial relations disputes with permanent legal force. This research involved 35 documents of court decisions obtained from Indonesiaâ€™s official Supreme Court website and employed an extractive summarization approach to summarize the documents by utilizing Cross Latent Semantic Analysis (CLSA) and Long Short-Term Memory (LSTM) methods. The two methods are compared to obtain the best results CLSA was employed to analyze the connection between phrases, requiring the ordering of related words before they were converted into a complete summary. Then, the use of LSTM is combined with the Attention module to decoder and encoder the information entered so that it becomes a form that can be understood by the system and provides a variety of splitting of documents to be trained and tested to see the highest performance that the system can generate. The research has found out that the CLSA method gave a precision of 79.1%, recall score of 39.7%, and ROUGE-1 score of 50.9%, and the use of LSTM was able to improve the performance of the CLSA method with the results obtained 93.6%, recall score of 94.5 %, and ROUGE-1 score of 93.9% on the variation of splitting 95% training and 5% testing.

Keywords

extractive summarization; cross latent semantic analysis; long short-term memory; legal document

Full Text:

PDF

References

M. O. Riedl, â€œHuman-centered artificial intelligence and machine learning,â€ Hum. Behav. Emerg. Technol., vol. 1, no. 1, pp. 33â€“36, 2019, doi: 10.1002/hbe2.117.

D. C. Brock, â€œLearning from artificial intelligenceâ€™s previous awakenings: The history of expert systems,â€ AI Magazine, vol. 39, no. 3, pp. 3â€“15, 2018.

I. Roll and R. Wylie, â€œEvolution and Revolution in Artificial Intelligence in Education,â€ Int. J. Artif. Intell. Educ., vol. 26, no. 2, pp. 582â€“599, 2016, doi: 10.1007/s40593-016-0110-3.

G. Malik, D. K. Tayal, and S. Vij, â€œAn Analysis of the Role of Artificial Intelligence in Education and Teaching,â€ Adv. Intell. Syst. Comput., vol. 707, pp. 407â€“417, 2019, doi: 10.1007/978-981-10-8639-7_42.

K. Y. Lee and J. Kim, â€œArtificial Intelligence Technology Trends and IBM Watson References in the Medical Field,â€ Korean Med. Educ. Rev., vol. 18, no. 2, pp. 51â€“57, 2016, doi: 10.17496/kmer.2016.18.2.51.

L. Cui, S. Huang, F. Wei, C. Tan, C. Duan, and M. Zhou, â€œSuperagent: A customer service chatbot for E-commerce websites,â€ ACL 2017 - 55th Annu. Meet. Assoc. Comput. Linguist. Proc. Syst. Demonstr., pp. 97â€“102, 2017, doi: 10.18653/v1/P17-4017.

B. Alarie, A. Niblett, and A. H. Yoon, â€œHow artificial intelligence will affect the practice of law,â€ University of Toronto Law Journal, vol. 68, no. March 2017. pp. 106â€“124, 2018, doi: 10.3138/utlj.2017-0052.

K. Yoko, V. C. Mawardi, and J. Hendryli, â€œSISTEM PERINGKAS OTOMATIS ABSTRAKTIF DENGAN MENGGUNAKAN RECURRENT NEURAL NETWORK,â€ Comput. J. Comput. Sci. Inf. Syst., vol. 2, no. 1, p. 65, 2018, doi: 10.24912/computatio.v2i1.1481.

X. Duan et al., â€œLegal summarization for multi-role debate dialogue via controversy focus mining and multi-task learning,â€ Int. Conf. Inf. Knowl. Manag. Proc., pp. 1361â€“1370, 2019, doi: 10.1145/3357384.3357940.

A. Kanapala, S. Pal, and R. Pamula, â€œText summarization from legal documents: a survey,â€ Artif. Intell. Rev., vol. 51, no. 3, pp. 371â€“402, 2019, doi: 10.1007/s10462-017-9566-2.

N. F. Saraswati, Indriati, and R. S. Perdana, â€œPeringkasan Teks Otomatis Menggunakan Metode Maximum Marginal Relevance Pada Hasil Pencarian Sistem Temu Kembali Informasi Untuk Artikel Berbahasa Indonesia,â€ J. Pengemb. Teknol. Inf. dan Ilmu Komput. Univ. Brawijaya, vol. 2, no. 11, pp. 5494â€“5502, 2018, doi: 10.1016/s1010-6030(01)00380-x.

N. Savanti, W. Gotami, and R. K. Dewi, â€œPeringkasan Teks Otomatis Secara Ekstraktif Pada Artikel Berita Kesehatan Berbahasa Indonesia Dengan Menggunakan Metode Latent Semantic Analysis,â€ J. Pengemb. Teknol. Inf. dan Ilmu Komput. Univ. Brawijaya, vol. 2, no. 9, pp. 2821â€“2828, 2018.

Y. M. Sari and N. S. Fatonah, â€œAutomatic Text Summarization in Indonesian Language Learning Module Using Cross Latent Semantic Analysis (CLSA) Method,â€ J. Edukasi dan Penelit. Inform., vol. 7, no. 2, p. 153, 2021, doi: 10.26418/jp.v7i2.47768.

G. Mandar and G. Gunawan, â€œPeringkasan dokumen berita bahasa indonesia menggunakan metode cross latent semantic analysis,â€ Regist. J. Ilm. Teknol. Sist. Inf., vol. 3, no. 2, pp. 94â€“104, 2017, doi: 10.26594/register.v3i2.1161.

D. Suleiman and A. A. Awajan, â€œDeep Learning Based Extractive Text Summarization: Approaches, Datasets and Evaluation Measures,â€ in 2019 6th International Conference on Social Networks Analysis, Management and Security, SNAMS 2019, Oct. 2019, pp. 204â€“210, doi: 10.1109/SNAMS.2019.8931813.

W. Jiang, Y. Zou, T. Zhao, Q. Zhang, and Y. Ma, â€œA hierarchical bidirectional LSTM sequence model for extractive text summarization in electric power systems,â€ in Proceedings - 2020 13th International Symposium on Computational Intelligence and Design, ISCID 2020, Dec. 2020, pp. 290â€“294, doi: 10.1109/ISCID51228.2020.00071.

R. M. Patel and A. J. Goswami, â€œAbstractive Text Summarization with LSTM using Beam Search Inference Phase Decoder and Attention Mechanism,â€ Jun. 2021, doi: 10.1109/ICCISc52257.2021.9484880.

R. Karmakar, K. Nirantar, P. Kurunkar, P. Hiremath, and D. Chaudhari, â€œIndian Regional Language Abstractive Text Summarization using Attention-based LSTM Neural Network,â€ Jun. 2021, doi: 10.1109/CONIT51480.2021.9498309.

D. Patel, N. Shah, V. Shah, and V. Hole, â€œAbstractive Text Summarization on Google Search Results,â€ Proc. Int. Conf. Intell. Comput. Control Syst. ICICCS 2020, pp. 538â€“543, May 2020, doi: 10.1109/ICICCS48265.2020.9120998.

K. Merchant and Y. Pande, â€œNLP Based Latent Semantic Analysis for Legal Text Summarization,â€ in 2018 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2018, Nov. 2018, pp. 1803â€“1807, doi: 10.1109/ICACCI.2018.8554831.

K. Agrawal, â€œLegal case summarization: An application for text summarization,â€ Jan. 2020, doi: 10.1109/ICCCI48352.2020.9104093.

A. W. Pradana and M. Hayaty, â€œThe Effect of Stemming and Removal of Stopwords on the Accuracy of Sentiment Analysis on Indonesian-language Texts,â€ Kinet. Game Technol. Inf. Syst. Comput. Network, Comput. Electron. Control, vol. 4, no. 3, pp. 375â€“380, 2019, doi: 10.22219/kinetik.v4i4.912.

A. Tabassum and R. R. Patil, â€œA Survey on Text Pre-Processing & Feature Extraction Techniques in Natural Language Processing,â€ Int. Res. J. Eng. Technol., no. June, pp. 4864â€“4867, 2020.

S. Alam and N. Yao, â€œThe impact of preprocessing steps on the accuracy of machine learning algorithms in sentiment analysis,â€ Comput. Math. Organ. Theory, vol. 25, no. 3, pp. 319â€“335, 2019, doi: 10.1007/s10588-018-9266-8.

S. Vijayarani, M. J. Ilamathi, M. Nithya, A. Professor, and M. P. Research Scholar, â€œPreprocessing Techniques for Text Mining -An Overview,â€ vol. 5, no. 1, pp. 7â€“16.

A. Amalia, D. Gunawan, Y. Fithri, and I. Aulia, â€œAutomated Bahasa Indonesia essay evaluation with latent semantic analysis,â€ J. Phys. Conf. Ser., vol. 1235, no. 1, 2019, doi: 10.1088/1742-6596/1235/1/012100.

K. Al-Sabahi, Z. Zhang, J. Long, and K. Alwesabi, â€œAn Enhanced Latent Semantic Analysis Approach for Arabic Document Summarization,â€ Arab. J. Sci. Eng., vol. 43, no. 12, pp. 8079â€“8094, 2018, doi: 10.1007/s13369-018-3286-z.

J. W. G. Putra, â€œPengenalan Konsep Pembelajaran Mesin dan Deep Learning,â€ vol. 4, no. August, pp. 1â€“235, 2019.

G. Van Houdt, C. Mosquera, and G. NÃ¡poles, â€œA review on the long short-term memory model,â€ Artif. Intell. Rev., vol. 53, no. 8, pp. 5929â€“5955, 2020, doi: 10.1007/s10462-020-09838-1.

H. Chung and K. S. Shin, â€œGenetic algorithm-optimized long short-term memory network for stock market prediction,â€ Sustain., vol. 10, no. 10, 2018, doi: 10.3390/su10103765.

K. Kurniawan and S. Louvan, â€œINDOSUM : A New Benchmark Dataset for Indonesian Text Summarization,â€ 2018 Int. Conf. Asian Lang. Process., pp. 215â€“220, 2018.

S. Ghodratnama, A. Beheshti, M. Zakershahrak, and F. Sobhanmanesh, â€œExtractive Document Summarization Based on Dynamic Feature Space Mapping,â€ IEEE Access, vol. 8, pp. 139084â€“139095, 2020, doi: 10.1109/ACCESS.2020.3012539.

Username
Password
Remember me