Have a personal or library account? Click to login
An Automated Machine Learning Based Decision Support System to Predict Hotel Booking Cancellations Cover

An Automated Machine Learning Based Decision Support System to Predict Hotel Booking Cancellations

Open Access
|Jul 2019

Figures & Tables

dsj-18-913-g1.png
Figure 1

Cancellation ratio per year.

dsj-18-913-g2.png
Figure 2

Cancellation ratio per month.

Table 1

Models’ estimation parameters selection source code.

ParameterR source code
colsample_bytreerunif(1, 0.4, 0.8)
etarunif(1, 0.01, 0.3)
gammarunif(1, 0, 0.2)
lambdarunif(1, 0, 0.5)
max_delta_stepsample(1:5, 1)
max_depthsample(2:4, 1)
min_child_weightsample(1:5, 1)
dsj-18-913-g3.png
Figure 3

H1 bookings status at a moment in time.

dsj-18-913-g4.png
Figure 4

System architecture diagram.

dsj-18-913-g5.png
Figure 5

Daily automation cycle diagram.

dsj-18-913-g6.png
Figure 6

Observations weighting mechanism diagram.

dsj-18-913-g7.png
Figure 7

Prototype’s main screen – Planning.

Table 2

Performance metrics on the 31st of August 2017.

HotelDatasetAccuracyPrecisionF1ScoreAUCSensitivitySpecificity
H1Train0.86460.84840.74100.92270.65770.9510
Test0.84860.82050.70160.88640.61280.9452
H2Train0.87010.88490.84600.94380.81030.9171
Test0.85630.87310.82740.92760.78620.9110
Table 3

A/B testing effective cancellation summary.

HotelGroupCanceledNot canceledTotal% CanceledActions% Actions
H1A4861,4891,97524.6%N/AN/A
B4831,5262,00924.0%1095.4%
H2A1,0433,0604,10325.4%N/AN/A
B1,0253,0864,11124.9%1964.8%
dsj-18-913-g8.png
Figure 8

Cancellation ratio by minimum frequency.

Note: MF threshold levels were selected based on the users’ criteria to select the bookings to contact. The majority of time, users only selected bookings with a MF equal to or greater than 50%.

Table 4

“B” group cancellation results summary.

HotelActionMF ≥ 0% (all bookings)MF ≥ 50%
CanceledNot canceled% CanceledCanceledNot canceled% Canceled
H1No4711,42924.8%12515345.0%
Yes129711.0%6707.9%
H2No1,0102,90525.8%26932545.3%
Yes151817.7%91117.5%
Table A.1

Features description.

FeatureTypeDescription
ADRThirdQuartileDeviationN, ERatio calculated by the division of the booking ADR by the third quartile value, of all bookings of the same distribution channel, same reserved room type, for the same expected week/year of arrival.
AdultsN, INumber of adults
AgentC, IID of agency (if booked via an agency)
BabiesN, INumber of babies
BookingChangesN, EHeuristic created by summing the number of booking changes (amendments) prior to arrival that can indicate cancellation intentions (arrival or departure dates, number of persons, type of meal, ADR, or reserved room type)
ChildrenN, INumber of children
CompanyC, IID of company/corporation (if an account was associated with it)
CustomerTypeC, EType of customer (group, contract, transient, or transient-party); this last category is a heuristic built when the booking is transient but is fully or partially paid in conjunction with other bookings (e.g., small groups, such as families who require more than one room)
DaysInWaitingListN, INumber of days the booking was on a waiting list prior to confirming the availability and being confirmed as a booking
DepositTypeC, ESince hotels had different cancellation and deposit policies, a heuristic was developed to define the deposit type (nonrefundable, refundable, no deposit): payment made in full before the arrival date was considered to be a “nonrefundable” deposit, and a partial payment before arrival was considered to be a “refundable” deposit; otherwise, it was considered to be “no deposit”
DistributionChannelC, IDistribution channel used to make the booking
IsRepeatedGuestC, EBinary value that indicates if the booking holder, at the time of booking creation, was a repeat guest at the hotel (0: no; 1: yes); created by comparing the time of booking with the guest profile creation record
LiveTimeN, ENumber of days from booking creation according to the booking status: for “A” type bookings, it was calculated as the number of days between booking creation and arrival; for “B” bookings, the elapsed number of days between the date of booking creation and the cancellation date was employed; for “C” bookings, the elapsed number of days between the date of creation and the processing date (current date)
MarketSegmentC, IMarket segment to which the booking was classified as
MealC, IID of meal requested by the guest
PreviousCancellationRatioN, ERatio created by the division of the guest’s number of previous cancellations by the guest’s previous number of bookings at the hotel
StaysInWeekendNightsN, EFrom the total length of stay, the number of weekend nights (Saturday and Sunday)
StaysInWeekNightsN, EFrom the total length of stay, the number of weekday nights (Monday through Friday)
TotalOfSpecialRequestsN, ENumber of special requests (e.g., fruit basket and sea view)
WasInWaitingListC, IBinary value that indicates if the booking was entered on a waiting list or directly entered as a booking (0: normal booking; 1: waiting list);

[i] Type legend: C- Categorical, E- Engineered, I-Input, N- Numerical.

Language: English
Submitted on: Dec 5, 2018
Accepted on: Jun 19, 2019
Published on: Jul 8, 2019
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2019 Nuno Antonio, Ana de Almeida, Luis Nunes, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.