No-Show Passenger Prediction for Flights

Wei-Song Chin - Multimedia University, 63000 Cyberjaya, Selangor, Malaysia
Choo-Yee Ting - Multimedia University, 63000 Cyberjaya, Selangor, Malaysia
Chin-Leei Cham - Multimedia University, 63000 Cyberjaya, Selangor, Malaysia


Citation Format:



DOI: http://dx.doi.org/10.30630/joiv.7.3-2.2328

Abstract


In aviation, “no-show†refers to a customer who booked a reservation but failed to show up. No-shows can result in various resource wastes, such as vacant seats, leading to income loss and flight delays. As a result, no-show passengers can cause considerable problems for airlines, ultimately affecting their bottom line. Recent research has shown the use of machine learning algorithms to reduce the rate of no-shows. For example, a researcher in healthcare is using a predictive model to identify no-shows’ patients to increase efficiency. Therefore, this study aimed to develop prediction models to predict passenger no-shows. In this work, we used a dataset supplied by a local airline company consisting of 1,046,486 rows and 8 columns. Additional datasets like weather data, public holiday data of different countries, aircraft details, and foot traffic data are used to carry out the dataset's feature enrichment task to complement the original dataset. As a result, feature selection has become an important stage in this research to identify and pick the most relevant and useful features from the enormous number of columns. The findings showed that the model built using Random Forest has the highest accuracy of 90.4%, while Decision Tree performed at 90.2%, Gradient Boosting at 86.5%, and Neural Networks at 67.6%. To enhance the accuracy of the models, further research efforts are essential to integrate supplementary passenger information.

Keywords


No-Show; Aviation; Prediction; Machine Learning; Classification; Feature Enrichment; Feature Selection; Random Forest; Decision Tree; Gradient Boosting; Neural Networks

Full Text:

PDF

References


S. AlMuhaideb, O. Alswailem, N. Alsubaie, I. Ferwana, and A. Alnajem, “Prediction of hospital no-show appointments through artificial intelligence algorithms,†Ann Saudi Med, vol. 39, no. 6, pp. 373–381, Dec. 2019, doi: 10.5144/0256-4947.2019.373.

A. Alshammari, R. Almalki, and R. Alshammari, “Developing a Predictive Model of Predicting Appointment No-Show by Using Machine Learning Algorithms,†Journal of Advances in Information Technology, vol. 12, no. 3, 2021, doi: 10.12720/jait.12.3.234-239.

C. Amberger and D. Schreyer, “What do we know about noâ€show behavior? A systematic, interdisciplinary literature review,†J Econ Surv, Sep. 2022, doi: 10.1111/joes.12534.

D. Carreras-García, D. Delgado-Gómez, F. Llorente-Fernández, and A. Arribas-Gil, “Patient No-Show Prediction: A Systematic Literature Review,†Entropy, vol. 22, no. 6, p. 675, Jun. 2020, doi: 10.3390/e22060675.

T. Daghistani, H. AlGhamdi, R. Alshammari, and R. H. AlHazme, “Predictors of outpatients’ no-show: big data analytics using apache spark,†J Big Data, vol. 7, no. 1, p. 108, Dec. 2020, doi: 10.1186/s40537-020-00384-9.

G. Fan, Z. Deng, Q. Ye, and B. Wang, “Machine learning-based prediction models for patients no-show in online outpatient appointments,†Data Science and Management, vol. 2, pp. 45–52, Jun. 2021, doi: 10.1016/j.dsm.2021.06.002.

S. L. Harris and M. Samorani, “On selecting a probabilistic classifier for appointment no-show prediction,†Decis Support Syst, vol. 142, p. 113472, Mar. 2021, doi: 10.1016/j.dss.2020.113472.

D. Marbouh et al., “Evaluating the Impact of Patient No-Shows on Service Quality,†Risk Manag Healthc Policy, vol. Volume 13, pp. 509–517, Jun. 2020, doi: 10.2147/RMHP.S232114.

I. Mohammadi, H. Wu, A. Turkcan, T. Toscos, and B. N. Doebbeling, “Data Analytics and Modeling for Appointment No-show in Community Health Centers,†J Prim Care Community Health, vol. 9, p. 215013271881169, Jan. 2018, doi: 10.1177/2150132718811692.

A. Perez, “Models for Fitting Correlated Non-identical Bernoulli Random Variables with Applications to an Airline Data Problem,†Doctoral Dissertation, Temple University, 2021.

K. Topuz, H. Uner, A. Oztekin, and M. B. Yildirim, “Predicting pediatric clinic no-shows: a decision analytic framework using elastic net and Bayesian belief network,†Ann Oper Res, vol. 263, no. 1–2, pp. 479–499, Apr. 2018, doi: 10.1007/s10479-017-2489-0.

C. Wang, R. Wu, L. Deng, Y. Chen, Y. Li, and Y. Wan, “A Bibliometric Analysis on No-Show Research: Status, Hotspots, Trends and Outlook,†Sustainability, vol. 12, no. 10, p. 3997, May 2020, doi: 10.3390/su12103997.

Syed Arbab Mohd Shihab, Caleb Logemann, Deepak-George Thomas, and Peng Wei, “Autonomous Airline Revenue Management: A Deep Reinforcement Learning Approach to Seat Inventory Control and Overbooking,†Cornell University, 2019.

B. P. Berg et al., “Estimating the Cost of No-Shows and Evaluating the Effects of Mitigation Strategies,†Medical Decision Making, vol. 33, no. 8, pp. 976–985, Nov. 2013, doi: 10.1177/0272989X13478194.

M. Z. I. Chowdhury and T. C. Turin, “Variable selection strategies and its importance in clinical prediction modelling,†Fam Med Community Health, vol. 8, no. 1, p. e000262, Feb. 2020, doi: 10.1136/fmch-2019-000262.

Y. Ding, “Predicting flight delay based on multiple linear regression,†IOP Conf Ser Earth Environ Sci, vol. 81, p. 012198, Aug. 2017, doi: 10.1088/1755-1315/81/1/012198.

N. IDRUS and N. MOHAMED, “FORECASTING THE NUMBER OF AIRPLANE PASSENGERS USING BOX-JENKINS AND ARTIFICIAL NEURAL NETWORK IN MALAYSIA,†Universiti Malaysia Terengganu Journal of Undergraduate Research, vol. 2, no. 4, pp. 89–100, Oct. 2020, doi: 10.46754/umtjur.v2i4.183.

S. T. Lim, J. Y. Yuan, K. W. Khaw, and X. Chew, “Predicting Travel Insurance Purchases in an Insurance Firm through Machine Learning Methods after COVID-19,†Journal of Informatics and Web Engineering, vol. 2, no. 2, pp. 43–58, Sep. 2023, doi: 10.33093/jiwe.2023.2.2.4.

X. Xu, M. Hu, and X. Li, “Coping with no-show behaviour in appointment services: a multistage perspective,†Journal of Service Theory and Practice, vol. 32, no. 3, pp. 452–474, Apr. 2022, doi: 10.1108/JSTP-08-2020-0196.

R. J. Mieloszyk, J. I. Rosenbaum, C. S. Hall, D. S. Hippe, M. L. Gunn, and P. Bhargava, “Environmental Factors Predictive of No-Show Visits in Radiology: Observations of Three Million Outpatient Imaging Visits Over 16 Years,†Journal of the American College of Radiology, vol. 16, no. 4, pp. 554–559, Apr. 2019, doi: 10.1016/j.jacr.2018.12.046.

D. Samano, S. Saha, T. C. Kot, J. E. Potter, and L. M. Duthely, “Impact of Extreme Weather on Healthcare Utilization by People with HIV in Metropolitan Miami,†Int J Environ Res Public Health, vol. 18, no. 5, p. 2442, Mar. 2021, doi: 10.3390/ijerph18052442.

S. Alodhaibi, R. L. Burdett, and P. KDV. Yarlagadda, “Framework for Airport Outbound Passenger Flow Modelling,†Procedia Eng, vol. 174, pp. 1100–1109, 2017, doi: 10.1016/j.proeng.2017.01.263.

Y. Li, X. Gao, Z. Xu, and X. Zhou, “Network-based queuing model for simulating passenger throughput at an airport security checkpoint,†J Air Transp Manag, vol. 66, pp. 13–24, Jan. 2018, doi: 10.1016/j.jairtraman.2017.09.013.

H. Yamada et al., “Modeling and Managing Airport Passenger Flow Under Uncertainty: A Case of Fukuoka Airport in Japan,†2017, pp. 419–430. doi: 10.1007/978-3-319-67256-4_33.

O. Perdikaki, S. Kesavan, and J. M. Swaminathan, “Effect of Traffic on Sales and Conversion Rates of Retail Stores,†Manufacturing & Service Operations Management, vol. 14, no. 1, pp. 145–162, Jan. 2012, doi: 10.1287/msom.1110.0356.

Y. Zhou, D. Dong, and W. Jiang, “Influence Factors of Patient No Show in a Outpatient Department,†IOP Conf Ser Mater Sci Eng, vol. 439, p. 032047, Nov. 2018, doi: 10.1088/1757-899X/439/3/032047.

A. R. Teo, C. W. Forsberg, H. E. Marsh, S. Saha, and S. K. Dobscha, “No-Show Rates When Phone Appointment Reminders Are Not Directly Delivered,†Psychiatric Services, vol. 68, no. 11, pp. 1098–1100, Nov. 2017, doi: 10.1176/appi.ps.201700128.

A. Brieden and P. Gritzmann, “Predicting show rates in air cargo transport,†in 2020 International Conference on Artificial Intelligence and Data Analytics for Air Transportation (AIDA-AT), IEEE, Feb. 2020, pp. 1–9. doi: 10.1109/AIDA-AT48540.2020.9049209.

D. Dalalah, U. Ojiako, and M. Chipulu, “Voluntary overbooking in commercial airline reservations,†J Air Transp Manag, vol. 86, p. 101835, Jul. 2020, doi: 10.1016/j.jairtraman.2020.101835.

Shinta Saylindra, Nurul Islami, Tito Warsito, Ira Rachman, and Imam Ozali, “THE UNDERSTANDING OF AIRLINES OVERBOOKING BY SOME AIRLINES AT THE SOEKARNO HATTA INTERNATIONAL AIRPORT,†Advances in Transportation and Logistics Research, vol. 1, pp. 71–87, 2018.

D. Zenkert, “No-show forecast using passenger booking data,†Lund University, 2017.

O. A. C. Dewi, “Revenue management model based on capacity sharing and overbooking in the airline,†Journal of Engineering and Management in Industrial System, vol. 6, no. 2, pp. 86–94, Dec. 2018, doi: 10.21776/ub.jemis.2018.006.02.3.

D. Victor and M. Stevens, “United Airlines passenger is dragged from an overbooked flight,†The New York Times.

M. Somboon and K. Amaruchkul, “Applied Two-Class Overbooking Model in Thailand’s Passenger Airline Data,†The Asian Journal of Shipping and Logistics, vol. 33, no. 4, pp. 189–198, Dec. 2017, doi: 10.1016/j.ajsl.2017.12.002.

J. An, A. Mikhaylov, and S.-U. Jung, “A Linear Programming approach for robust network revenue management in the airline industry,†J Air Transp Manag, vol. 91, p. 101979, Mar. 2021, doi: 10.1016/j.jairtraman.2020.101979.

L. Lin, X. Liu, X. Liu, T. Zhang, and Y. Cao, “A prediction model to forecast passenger flow based on flight arrangement in airport terminals,†Energy and Built Environment, vol. 4, no. 6, pp. 680–688, Dec. 2023, doi: 10.1016/j.enbenv.2022.06.006.

N. Vojtek, B. Petrović, and P. Milošević, “Decision Support System for Predicting the Number of No-Show Passengers in Airline Industry,†Tehnicki vjesnik - Technical Gazette, vol. 28, no. 1, Feb. 2021, doi: 10.17559/TV-20191215144655.

N. M. Asrah, M. E. Nor, S. N. A. Rahim, and W. K. Leng, “Time Series Forecasting of the Number of Malaysia Airlines and AirAsia Passengers,†J Phys Conf Ser, vol. 995, p. 012006, Apr. 2018, doi: 10.1088/1742-6596/995/1/012006.

P. H. K Tissera, A. N. M. R. S. P. llwana, K. T. Waduge, M. A. l. Perera, D. P. Nawinna, and D. Kasthurirathna, “Predictive Analytics Platform for Airline Industry,†in 2020 2nd International Conference on Advancements in Computing (ICAC), IEEE, Dec. 2020, pp. 108–113. doi: 10.1109/ICAC51239.2020.9357244.