Feature Selection to Enhance DDoS Detection Using Hybrid N-Gram Heuristic Techniques

Andi Maslan; Kamaruddin Malik Bin Mohamad; Abdul Hamid; Hotma Pangaribuan; Sunarsan Sitohang

doi:10.30630/joiv.7.3.1533

Feature Selection to Enhance DDoS Detection Using Hybrid N-Gram Heuristic Techniques

Andi Maslan - Universitas Putera Batam, Indonesia
Kamaruddin Malik Mohamad - Universiti Tun Hussein Onn Malaysia, Batu Pahat Johor, Malaysia
Abdul Hamid - Universiti Tun Hussein Onn Malaysia, Batu Pahat Johor, Malaysia
Hotma Pangaribuan - Universitas Putera Batam, Indonesia
Sunarsan Sitohang - Universitas Putera Batam, Indonesia

Citation Format:

DOI: http://dx.doi.org/10.30630/joiv.7.3.1533

Abstract

Various forms of distributed denial of service (DDoS) assault systems and servers, including traffic overload, request overload, and website breakdowns. Heuristic-based DDoS attack detection is a combination of anomaly-based and pattern-based methods, and it is one of three DDoS attack detection techniques available. The pattern-based method compares a sequence of data packets sent across a computer network using a set of criteria. However, it cannot identify modern assault types, and anomaly-based methods take advantage of the habits that occur in a system. However, this method is difficult to apply because the accuracy is still low, and the false positives are relatively high. Therefore, this study proposes feature selection based on Hybrid N-Gram Heuristic Techniques. The research starts with the conversion process, package extract, and hex payload analysis, focusing on the HTTP protocol. The results show the Hybrid N-Gram Heuristic-based feature selection for the CIC-2017 dataset with the SVM algorithm on the CSDPayload+N-Gram feature with a 4-Gram accuracy rate of 99.86%, MIB- Dataset 2016 with the 2016 algorithm. SVM and CSPayload feature +N-Gram with 100% accuracy for 4-Gram, H2N-Payload Dataset with SVM Algorithm, and CSDPayload+N-Gram feature with 100% accuracy for 4-Gram. As a comparison, the KNN algorithm for 4-Gram has an accuracy rate of 99.44%, and the Neural Network Algorithm has an accuracy rate of 100% for 4-Gram. Thus, the best algorithm for DDoS detection is SVM with Hybrid N-Gram (4-Gram).

Keywords

Chi-square distance; DDoS; Heuristic; N-Gram; Payload

Full Text:

PDF

References

J. J. Kim, Y. S. Lee, J. Y. Moon, and J. M. Park, â€œNetwork payload and correlation analysis in bigdata environments,â€ Int. J. Grid Distrib. Comput., vol. 11, no. 3, pp. 109â€“124, 2018, doi: 10.14257/ijgdc.2018.11.3.10.

M. Alkasassbeh, A. B. A. Hassanat, and G. Al-naymat, â€œDetecting Distributed Denial of Service Attacks Using Data Mining Techniques,â€ vol. 7, no. 1, pp. 436â€“445, 2016.

A. W. Muhammad and I. Riadi, â€œDDoS Attack Detection Using Neural Network with Fixed Moving Average Window Function,â€ vol. 1, no. 3, pp. 115â€“122, 2017.

A. Rahmatulloh, G. M. Ramadhan, I. Darmawan, N. Widiyasono, and D. Pramesti, â€œIdentification of Mirai Botnet in IoT Environment through Denial-of-Service Attacks for Early Warning System,â€ vol. 6, no. September, pp. 623â€“628, 2022.

K. M. I. A. Fouda, â€œPayload Based Signature Generation for DDoS Attacks,â€ University of Twente, 2017.

K. M. Prasad, A. R. M. Reddy, and K. V. Rao, â€œDoS and DDoS Attacks: Defense, Detection and TracebackMechanisms -A Survey,â€ vol. 14, no. 7, 2014.

A. Oza, â€œHTTP Attack Detection using N-gram Analysis,â€ 2013.

M. Najafimehr, S. Zarifzadeh, and S. Mostafavi, A hybrid machine learning approach for detecting unprecedented DDoS attacks, no. 0123456789. Springer US, 2022.

D. Ariu, R. Tronci, and G. Giacinto, â€œHMMPayl: An intrusion detection system based on Hidden Markov Models,â€ Comput. Secur., vol. 30, no. 4, pp. 221â€“241, 2011, doi: 10.1016/j.cose.2010.12.004.

K. Kato and V. Klyuev, â€œAn Intelligent DDoS Attack Detection System Using Packet Analysis and Support Vector Machine,â€ Int. J. Intell. Comput. Res., vol. 5, no. 3, pp. 464â€“471, 2014.

J. David and C. Thomas, â€œDDoS attack detection using fast entropy approach on flow-based network traffic,â€ Procedia Comput. Sci., vol. 50, no. August, pp. 30â€“36, 2015, doi: 10.1016/j.procs.2015.04.007.

N. Bindra and M. Sood, â€œEvaluating the impact of feature selection methods on the performance of the machine learning models in detecting DDoS attacks,â€ Rom. J. Inf. Sci. Technol., vol. 23, no. 3, pp. 250â€“261, 2020.

K. Bouzoubaa, Y. Taher, and B. Nsiri, â€œPredicting DOS-DDOS Attacks: Review and Evaluation Study of Feature Selection Methods based on Wrapper Process,â€ Int. J. Adv. Comput. Sci. Appl., vol. 12, no. 5, pp. 132â€“145, 2021, doi: 10.14569/IJACSA.2021.0120517.

Q. Niyaz, W. Sun, and A. Y. Javaid, â€œA Deep Learning Based DDoS Detection System in Software-Defined Networking (SDN),â€ ICST Trans. Secur. Saf., vol. 4, no. 12, p. 153515, 2017, doi: 10.4108/eai.28-12-2017.153515.

W. Khreich, B. Khosravifar, A. Hamou-Lhadj, and C. Talhi, â€œAn anomaly detection system based on variable N-gram features and one-class SVM,â€ Inf. Softw. Technol., vol. 91, pp. 186â€“197, 2017, doi: https://doi.org/10.1016/j.infsof.2017.07.009.

S. Sridharan, â€œDefeating n-gram Scores for HTTP Attack Detection,â€ SJSU Sch. Work., vol. 6, no. San Jose State University, pp. 1â€“37, 2016, doi: 10.31979/etd.japx-z6eu.

S. Khunkitti, A. Siritaratiwat, and S. Premrudeepreechacharn, â€œMulti-objective optimal power flow problems based on slime mould algorithm,â€ Sustain., vol. 13, no. 13, 2021, doi: 10.3390/su13137448.

L. Csikar, â€œDecision making in the sciences : understanding heuristic use by students in problem solving,â€ p. 124, 2018, doi: 10.25777/dzej-k872.

R. R. Rumare and H. T. Ciptaningtyas, â€œHTTP Attack Detection Application Using N-Gram,â€ J. Tek. ITS, vol. 6, no. 2, pp. 2â€“5, 2017, doi: 10.12962/j23373539.v6i2.24230.

S. Bista and R. Chitrakar, â€œDDoS Attack Detection Using Heuristics Clustering Algorithm and Nave Bayes Classification,â€ J. Inf. Secur., vol. 09, no. 01, pp. 33â€“44, 2018, doi: 10.4236/jis.2018.91004.

A. Maslan, K. M. Mohamad, and C. F. M. Foozy, â€œEnhancement detection distributed denial of service attacks using hybrid n-gram techniques,â€ Telkomnika (Telecommunication Comput. Electron. Control., vol. 20, no. 1, pp. 61â€“69, 2022, doi: 10.12928/TELKOMNIKA.v20i1.18103.

H. Zhao, Z. Chang, G. Bao, and X. Zeng, â€œMalicious Domain Names Detection Algorithm Based on N -Gram,â€ vol. 2019, 2019.

W. B. Cavnar and J. M. Trenkle, â€œN-Gram-Based Text Categorization N-Gram-Based Text Categorization,â€ Proc. Third Annu. Symp. Doc. Anal. Inf. Retr., no. May, pp. 1â€“14, 2001.

J. Daniel and J. H. Martin, â€œstanford n-gram_Speech and Language Processing,â€ 2021.

F. Angiulli, L. Argento, and A. Furfaro, â€œExploiting n-gram location for intrusion detection,â€ CS.CR, vol. 3, no. Cornell University, pp. 1â€“6, 2016, doi: 10.1109/ICTAI.2015.155.

Z. Bazrafshan, H. Hashemi, S. M. H. Fard, and A. Hamzeh, â€œA survey on heuristic malware detection techniques,â€ IKT 2013 - 2013 5th Conf. Inf. Knowl. Technol., no. May, pp. 113â€“120, 2013, doi: 10.1109/IKT.2013.6620049.

W. Halim, â€œDeteksi Malware dengan Menggunakan API Calls,â€ Paper, p. 15, 2020.

A. Shabtai, R. Moskovitch, C. Feher, S. Dolev, and Y. Elovici, â€œDetecting unknown malicious code by applying classification techniques on OpCode patterns,â€ Secur. Inform., vol. 1, no. 1, p. 1, 2012, doi: 10.1186/2190-8532-1-1.

T. Abou-Assaleh, N. Cercone, V. Keselj, and R. Sweidan, â€œN-gram-based detection of new malicious code,â€ in Proc of the 28th Annual International Computer Software and Applications Conference, IEEE Computer Society, 2004, vol. 2, pp. 41â€“42 vol.2, doi: 10.1109/CMPSAC.2004.1342667.

I. Journal, O. F. Engineering, C. Of, M. Virus, and U. N. Gram, â€œInternational journal of engineering sciences & research technology classification of metamorphic virus using n gram analysis,â€ vol. 6, no. 2, pp. 364â€“370, 2017.

L. Tan, â€œThe worst-case execution time tool challenge 2006,â€ STTT, vol. 11, pp. 133â€“152, 2009, doi: 10.1109/ISoLA.2006.72.

T. McCabe, â€œA Complexity Measure,â€ IEEE Trans. Softw. Eng., vol. SE-2, pp. 308â€“320, 1976.

P. Jalote, An Integrated Approach to Software Engineering. 1997.

M. A. H. Azmi, C. F. M. Foozy, K. A. M. Sukri, N. A. Abdullah, I. R. A. Hamid, and H. Amnur, â€œFeature Selection Approach to Detect DDoS Attack Using Machine Learning Algorithms,â€ Int. J. Informatics Vis., vol. 5, no. 4, pp. 395â€“401, 2021, doi: 10.30630/JOIV.5.4.734.

A. MartÃn, R. Lara-Cabrera, and D. Camacho, â€œAndroid malware detection through hybrid features fusion and ensemble classifiers: The AndroPyTool framework and the OmniDroid dataset,â€ Inf. Fusion, vol. 52, no. December, pp. 128â€“142, 2019, doi: 10.1016/j.inffus.2018.12.006.

S. Almutairi, S. Mahfoudh, S. Almutairi, and J. S. Alowibdi, â€œHybrid Botnet Detection Based on Host and Network Analysis,â€ J. Comput. Networks Commun., vol. 2020, no. Hindawi, pp. 1â€“16, 2020, doi: 10.1155/2020/9024726.

C. Ma, X. Du, and L. Cao, â€œAnalysis of multi-Types of flow features based on hybrid neural network for improving network anomaly detection,â€ IEEE Access, vol. 7, pp. 148363â€“148380, 2019, doi: 10.1109/ACCESS.2019.2946708.

Z. Chiba, N. Abghour, K. Moussaid, A. El, and M. Rida, â€œIntelligent and Improved Self-Adaptive Anomaly based Intrusion Detection System for Networks,â€ vol. 11, no. 2, pp. 312â€“330, 2019.

T. Mahjabin, Y. Xiao, G. Sun, and W. Jiang, â€œA survey of distributed denial-of-service attack, prevention, and mitigation techniques,â€ Int. J. Distrib. Sens. Networks, vol. 13, no. 12, 2017, doi: 10.1177/1550147717741463.

Username
Password
Remember me