Impact of the COVID-19 Pandemic on the Revenue of the Catering Industry: Taiwan as an Example

Abstract

Due to the impact of the COVID-19 pandemic, people have reduced eating out, resulting in a severe drop in the revenue of the catering industry. Health risks have become a major factor affecting the revenue of this industry. Predicting the revenue of the catering industry during the COVID-19 pandemic will not only allow practitioners to adjust their business strategies, but also provide a reference for governments to formulate relief measures. To this end, this study proposes a fuzzy big data analytics approach in which random forests, recursive feature elimination, fuzzy c-means, and deep neural networks are jointly applied. First, random forests and recursive feature elimination are used to select the most influential factors. The data is then divided into clusters by fuzzy c-means. Subsequently, a deep neural network is built for each cluster to make predictions. The prediction results of individual clusters are then aggregated to improve prediction accuracy. The proposed methodology has been applied to forecast the revenue of the catering industry in Taiwan. The results of the experiment showed that the impact of new deaths on the revenue of the catering industry was far greater than the number of newly diagnosed COVID-19 cases.

Keywords

big data analytics fuzzy catering industry forecasting COVID-19

Introduction

This study aims to predict the revenue of the catering industry during the COVID-19 pandemic. The background of this study is described below.

Since 2019, due to the impact of the COVID-19 pandemic, the number of tourists has decreased (T. T. C. Chen et al., 2022), and many areas have prohibited catering businesses from dining indoor (T. Chen & Chiu, 2022). At the same time, people go out less and their eating behavior changes, which seriously affects the operation of the catering industry (Ministry of Finance, 2022; Norris et al., 2021). In Taiwan, affected by the COVID-19 pandemic, the annual revenue of the catering industry declined for the first time in 2019, as shown in Figure 1. Therefore, this study aims to predict the revenue of the catering industry in Taiwan amid the COVID-19 pandemic.

Figure 1.

Annual revenue of the catering industry in Taiwan (Data source: Ministry of Finance, 2022).

The significance of this study is explained below:

This topic was seldom discussed in the past, because the revenue of the catering industry is a time series affected by many factors and full of stochasticity (Li et al., 2020; Mayurnikova et al., 2015; Ugarova et al., 2019; Xie et al., 2008).

There are many factors that may affect the revenue of the catering industry, such as wage levels, labor supply, raw material prices, economic conditions, commodity prices, weather, business tax rates, food safety-related news, etc., as shown in Figure 2. In addition, many factors are constantly fluctuating and can be measured in various ways, leading to a complex big data problem (Nageshwaran et al., 2021).

The revenue of the catering industry has been growing steadily in the past (Ministry of Finance, 2022), which provided enough incentive for practitioners to enter the industry and/or make investment decisions. These practitioners are at a loss as to what to do in the face of the impact of the COVID-19 pandemic.

Furthermore, unlike before, during the COVID-19 pandemic, the revenue of the catering industry is greatly affected by health risks (Lin & Chen, 2022). Therefore, existing methods may not be directly applicable (Wu et al., 2020).

Figure 2.

Factors affecting the revenue of the catering industry during the COVID-19 pandemic.

The contributions of this study are:

This study is one of the first attempts to predict the revenue of the catering industry taking into account the health risks posed by the COVID-19 pandemic.

The prediction results can help the government formulate supplementary measures (such as relief subsidy budget) or tax standards (such as business tax, entertainment tax and other taxes) for the catering industry (Liu et al., 2013; Whitfield & Duffy, 2013).

Relevant forecasts can also provide reference for investors’ investment timing, direction, amount and plan (Goodman et al., 2014).

For catering practitioners, the results of this study provide some suggestions for them in making production plan, scheduling plan, business operation plan, and employment plan (Ren & Lv, 2014).

To predict the revenue of the catering industry during the COVID-19 pandemic, this study proposes a fuzzy big data analytics approach in which random forest (RF), recursive feature elimination (RFE), fuzzy c-means (FCM), and deep neural network (DNN) are applied jointly. In the proposed methodology, first, RF and RFE are used to select the most influential factors for predicting the revenue of the catering industry during the COVID-19 pandemic. The data is then divided into clusters by FCM. Both of these measures reduce the dimensionality or amount of data processed at one time. Subsequently, a DNN is built for each cluster to make predictions. By optimizing the network architectures, the effectiveness of these DNNs in predicting the revenue of the catering industry during the COVID-19 pandemic is improved.

The rest of this paper is organized as follows. Section 2 is dedicated to the literature review. Section 3 presents the fuzzy big data analytics approach proposed in this study. Section 4 details the application of the fuzzy big data analytics approach to forecast the revenue of the catering industry in Taiwan during the COVID-19 pandemic. Finally, Section 5 summarizes this study and provides some directions for future research.

Literature Review

Forecasting in the Catering Industry Amid the COVID-19 Pandemic

Some related literature is reviewer below. Several studies have established regression or time series models, such as autoregressive integrated moving average (ARIMA) and panel data models, to predict changes in sales and revenue in the catering industry (Su, 2020; Yang et al., 2020). Yang et al. (2020) developed a two-way fixed-effects panel data model to predict restaurant sales and stay-at-home orders across US counties. Unexpectedly, with the increase in daily confirmed cases, both the sales and stay-at-home orders of restaurants have dropped, especially stay-at-home orders. Oblander and McCarthy (2021) modeled the average order size of a restaurant customer as a homogeneous log-linear function of the customer’s tenure, which is a simple time series model. Panzone et al. (2021) built a multiplicative seasonal auto regressive integrated moving average (SARIMA) model to predict annual restaurant sales in the UK to quantify the impact of the COVID-19 pandemic on the dining industry. Machine learning techniques have also been applied to achieve this goal (De Silva et al., 2021; Hornstein et al., 2021). For example, Xie et al. (2008) constructed a support vector machine (SVM) to predict sales in the catering industry in China. The inputs to the SVM were the sales of past periods. Sun et al. (2021) built a long short-term memory (LSTM) network to predict the sales of a catering business. LSTMs are deep neural networks (DNNs). Like some previous studies, the inputs of the LSTM network were the sales of the previous periods, and the sales of the next periods were predicted accordingly. Tanizaki et al. (2021) compared the performances of several machine learning, including gradient boosting regression (GBR), Bayesian linear regression, boosted decision tree, decision forest, RF, and deep learning methods (including recurrent neural network and LSTM) in predicting the number of customers visiting restaurants during the COVID-19 pandemic. Based on experimental results, the machine (or deep) learning method that achieved the highest accuracy varied from restaurant to restaurant. Furthermore, the prediction accuracy achieved using deep learning methods was not necessarily higher than that achieved with machine learning methods. In sum, existing methods typically predict future sales or revenue based on past data, assuming no sudden changes will occur. On the contrary, the COVID-19 pandemic has had a severe impact on the global economy, and the economic losses in various industries are incalculable. Under such drastic changes, the forecasting accuracy using existing methods may be far from satisfactory. To address this issue, it is necessary to explore the relationship between health risks posed by the COVID-19 pandemic and the revenue of the catering industry.

Fuzzy Methods for Big Data Forecasting

Fuzzy methods have been widely used to consider various uncertainties in big data prediction tasks (Alvisi & Franchini, 2011; T. Chen et al., 2021; M.-C. Chiu et al., 2020). First, fuzzy sets with range and different possibilities can be used to represent the inputs of fuzzy forecasting methods that may be fluctuating, qualitative or subjective (T. Chen & Wang, 2019; Hadjimichael et al., 2002; Khemavuk & Leenatham, 2021). For example, to predict the demand for a product, Khemavuk and Leenatham (2021) considered qualitative/subjective factors such as product quality, customer satisfaction, and competitive effects. The values of these factors were represented by fuzzy sets and fed into an adaptive network-based fuzzy inference system (ANFIS). To make the production plan of a ubiquitous healthcare system of 3D printing facilities for making dentures, M. C. Chiu and Chen (2022) predicted the printing time of dentures and the transportation time to deliver printed dentures using type-II fuzzy sets.

In addition, fuzzy classifiers are applied to fuzzily divide the collected big data into several possibly overlapping clusters (T. Chen & Wang, 2016; Guha & Veeranjaneyulu, 2019; Wu et al., 2020). In other words, each data belongs to all clusters, but to varying degrees. For example, T. Chen and Wang (2016) used an artificial neural network (ANN) to estimate the workload of a simulation task on a cloud-based simulation system. To this end, FCM was applied to divide the simulation tasks into multiple clusters, which are processed on different clouds. Guha and Veeranjaneyulu (2019) predict a firm’s bankruptcy risk by considering its financial performance, in which the firms to be evaluated were classified using FCM.

Furthermore, since the actual value of a predicted target is rarely equal to the predicted value, estimating the target’s range is an alternative. Fuzzy sets are a suitable choice to express this (T. Chen & Chiu, 2021). For example, T. Chen and Chiu (2021) proposed a fuzzy collaborative forecasting method for dynamic random access memory (DRAM) yield forecasting, which was a typical big data problem because of many influencing factors. Triangular fuzzy numbers (TFNs) are used to represent fuzzy yield forecasts. Rubio et al. (2017) used trapezoidal fuzzy numbers (TrFNs) to predict stock indices in several stock markets, which was a typical big data problem. For this purpose, a weighted sum of past values was calculated and a new weighting rule was defined. However, a fuzzy forecast should have a narrow range while still containing the actual value (Wang et al., 2021).

Methodology

Theoretical Background

A fuzzy big data analytics approach consists of five main steps: feature selection, pre-classification, DNN training, aggregation, and prediction performance evaluation:

Feature selection: The first step is to perform feature selection to select features that are more relevant for the prediction purpose. There are many existing methods, including variance threshold (i.e., removing features with low variance), univariate feature selection (i.e., correlation analysis) (Gogtay & Thatte, 2017), the RFE method (Yan & Zhang, 2015), L1-based feature selection (Shekar & Dagnew, 2020), tree-based feature selection (Suresh & Bharathi, 2016), sequential feature selection (Aggrawal & Pal, 2020), etc. Among them, the RFE method has received the most attention in big data related applications (Park & Kim, 2020; Ustebay et al., 2018; Yan & Zhang, 2015).

Pre-classification: Pre-classification is to divide the data into multiple clusters that can be processed at the same time, which improves the processing efficiency and may also improve the prediction accuracy. Traditional classifiers, such as k-means (kM), FCM, self-organizing map (SOM), decision tree, classification and regression tree (CART), and RF, are applicable. However, traditional classifiers may not be able to handle the imbalance between classes. To address this problem, three approaches can be taken (Fernández et al., 2017): data-level processing to rebalance the training set, algorithmic processing to adapt the learning phase to a small cluster, and cost-sensitive processing that takes into account the costs caused by cluster imbalance. FCM, as a traditional classifier, can also solve this problem, because each data belongs to all clusters, but with different memberships (T. C. T. Chen et al., 2020).

Prediction: Prediction is one of the most critical applications of big data analytics methods (T. C. T. Chen, 2022a). There are many existing big data prediction methods, and different methods are suitable for different purposes, such as the dynamic factor model for predicting the diffusion index, the multi-factor augmented Bayesian shrinkage model for predicting employment, the factor augmented error correction model for predicting bilateral exchange rates, Bayesian regression model for predicting price index, an artificial neural network (ANN) for predicting household expenditure (Hassani & Silva, 2015), principal component regression for predicting job cycle time, principal component analysis (PCA)-ANN for predicting price index, etc (T. C. T. Chen, 2022b). Hassani and Silva (2015) divided existing big data forecasting methods into two categories: statistical techniques and data mining techniques.

Aggregation: In a fuzzy big data analytics approach, a data cannot be absolutely classified into a single cluster. Therefore, the prediction methods of all clusters can be applied to make predictions for the data. Then, the predictions produced by all prediction methods need to be aggregated. Existing aggregation techniques include weighted average, fuzzy interaction (T. Chen & Lin, 2008; Yolcu et al., 2016), back propagation network (Lin & Chen, 2019), etc.

Prediction performance evaluation: The prediction performance of fuzzy big data analytic methods needs to be evaluated from three aspects. First, efficiency, in terms of execution time, is critical for big data analytics problems. Furthermore, the forecast accuracy of forecasting can be measured by mean absolute error (MAE), mean absolute percentage error (MAPE), and root mean squared error (RMSE). In addition, the average range and hit rate of fuzzy forecasts can be used to evaluate the precision that a fuzzy big data analytics method can achieve.

Procedure

The implementation process of the proposed methodology comprises the following steps (shown in Figure 3):

Step 1. Feature selection.

Step 2. Data splitting.

Step 3. Pre-classification.

Step 4. Prediction.

Step 5. Aggregation.

Step 6. Prediction performance evaluation.

Figure 3.

Implementation process of the proposed methodology.

Two characteristics of the proposed methodology are fuzzy logic and big data analytics. Among these steps, fuzzy techniques are applied to pre-classification and aggregation, while big data analytics is applied in feature selection, prediction, and aggregation.

The definitions of parameters and variables in the proposed methodology are given in Table 1.

Table 1.

Definitions of Parameters and Variables in the Proposed Methodology.

Parameters/Variables	Definition
$μ_{tk}$	Membership of example t belonging to cluster k
$μ_{t (k)}^{(r)}$	Membership that example t belongs to cluster k after the r-th iteration
$θ_{l_{ξ}}^{h (ξ)}$	Threshold on the $l_{ξ}$ -th node in the ξ-th hidden layer
$θ^{o}$	Threshold on the output node
D	Real number representing the threshold of membership convergence
$e_{tk}$	Distance from example t to the centroid of cluster k
$h_{t l_{ξ}}^{(ξ)}$	Output of example t from the $l_{ξ}$ -th node in the ξ-th hidden layer
$I_{t l_{ξ}}^{h (ξ)}$	Input of example t to the $l_{ξ}$ -th node in the ξ-th hidden layer
$I_{t}^{o}$	Input of example t to the output node
$J_{m}$	Weighted sum of within-cluster variances
$o_{t}$	Output of example t from the output node
$w_{i l_{1}}^{h (1)}$	Connection weight between the i-th input node and the $l_{1}$ node in the first hidden layer
$w_{l_{ξ - 1} l_{ξ}}^{h (ξ)}$	Connection weight between the $l_{ξ - 1}$ -th node in the (ξ–1)-th hidden layer and the $l_{ξ}$ -th node in the ξ-th hidden layer
$w_{l_{ξ}}^{o}$	Connection weight between the $l_{ξ}$ -th node in the ξ-th hidden layer and the output node
$x_{ti}$	The i-th factor of example t
${\bar{x}}_{(k)}$	Centroid of cluster k
$y_{t}$	Revenue at period t; t = 1 ~ T
${\hat{y}}_{t}$	Predicted revenue at period t; t = 1 ~ T

Feature Selection

In the fuzzy big data analytics approach, two of the most widely used feature selection techniques, the RF method with RFE (Abedinia et al., 2017; Darst et al., 2018; Park & Kim, 2020; Ustebay et al., 2018) and correlation analysis (Gogtay & Thatte, 2017) are jointly applied. The former is a post-selection method that selects features to optimize prediction performance, while the latter is a pre-selection method that selects features based on their values.

First, bootstrap sampling (i.e., random sampling with replacement) is used to randomly select samples from the collected data. Due to replacement after random sampling, some data may be double-selected and some will never be selected. The latter is called out-of-bag (OOB) data. Random samples are used to build/train a forest of multiple decision trees to predict the revenue of the catering industry. The trained decision trees are then applied to make predictions for the OOB data. The predictions produced by all decision trees are averaged, on this basis the prediction performance is evaluated in terms of mean squared error (MSE) as

MSE = \frac{\sum_{t = 1}^{T} {(y_{t} - {\hat{y}}_{t})}^{2}}{T}

(1)

where $y_{t}$ and ${\hat{y}}_{t}$ indicate the revenue at period t and the predicted revenue, respectively. t = 1 ∼ T. Trial and error can be used to determine the number of trees, that is, adding one tree at a time and choose the number that contributes to the best prediction performance. Furthermore, the number of factors (input variables) is determined by removing the factors that are least important to the prediction performance each time. The importance of factors can be assessed in a number of ways (Płoński, 2020), for example, observing the increase in MSE after removing a factor. The RFE deletes factors one by one until the minimum number of factors is reached.

RFE can be combined with cross-validation methods, such as k-fold cross-validation or leave-one-out cross-validation (LOOCV) (Wong, 2015), to reliably assess prediction performance. In the proposed methodology, LOOCV is applied because the collected data will be divided into clusters. Some clusters may be very small. In this way, only one sample is left as validation data, and the rest are used to train/build decision trees. The training and validation process is repeated until all samples have been used as validation data.

Subsequently, the appropriateness of factors selected using the RF method with RFE is confirmed by performing a correlation analysis between the revenue of the catering industry and each of the selected factors.

Pre-classification

FCM is applied in the proposed methodology to pre-classify the collected data. FCM classifies collected data by minimizing the following objective function (T. Chen, 2011; Guha & Veeranjaneyulu, 2019; Wu & Chen, 2015):

Min J_{m} = \sum_{k = 1}^{K} \sum_{t = 1}^{T} (μ_{t (k)}^{m} e_{t (k)}^{2})

(2)

where K is the required number of clusters; $μ_{tk}$ represents the membership of example t belonging to cluster k; $e_{tk}$ measures the distance from example t to the centroid of cluster k; m∈(1, ∞) is a parameter to increase or decrease the fuzziness. With a higher value of m, the results will become fuzzier. For normal data, m is usually set to 2.0. In FCM, all factors are equally important when classifying the collected data.

The objective function can be optimized according to the following procedure (T. C. T. Chen & Honda, 2020):

Step 1. Generate initial classification results.

Step 2. (Iterations) Obtain the centroid of each cluster as

{\bar{x}}_{(k) i} = \frac{\sum_{t = 1}^{T} (μ_{t (k)}^{m} x_{ti})}{\sum_{t = 1}^{T} μ_{t (k)}^{m}}

(3)

where

μ_{t (k)} = \frac{1}{\sum_{l = 1}^{K} {(\frac{e_{t (k)}}{e_{t (l)}})}^{\frac{2}{m - 1}}}

(4)

e_{t (k)} = \sqrt{\sum_{i = 1}^{n} {(x_{ti} - {\bar{x}}_{(k) i})}^{2}}

(5)

where $x_{ti}$ indicates the i-th factor of example t; ${{\bar{x}}_{(k) i} | i = 1 ~ n}$ is the centroid of cluster k.

Step 3. Re-measure the distance of each example to the centroid of every cluster, then recalculate the membership.

Step 4. Stop if the following condition is met. Otherwise, return to Step 2:

max_{k} max_{t} | μ_{t (k)}^{(r)} - μ_{t (k)}^{(r - 1)} | < d

(6)

where $μ_{t (k)}^{(r)}$ is the membership that example t belongs to cluster k after the r-th iteration; d is a real number representing the threshold of membership convergence.

In this study, the optimal number of clusters is determined by varying the number of clusters to maximize prediction accuracy.

Prediction

A DNN is built to predict the revenue of the catering industry for each cluster. These DNNs can be configured differently. Each DNN has three to five layers: an input layer, one to three hidden layers, and an output layer, as shown in Figure 4.

Figure 4.

Architecture of the DNN.

Inputs to the DNN are the factors related to predicting the revenue of the catering industry in period t: { $x_{ti}$ | i = 1 ∼ n}, which are propagated through the DNN as follows. First, from the input layer to the first hidden layer, the following operations are performed:

I_{t l_{1}}^{h (1)} = \sum_{i = 1}^{n} (w_{i l_{1}}^{h (1)} \cdot x_{ti})

(7)

n_{t l_{1}}^{h (1)} = I_{t l_{1}}^{h (1)} - θ_{l_{1}}^{h (1)}

(8)

h_{t l_{1}}^{(1)} = \frac{1}{1 + e^{- n_{t l_{1}}^{h (1)}}}

(9)

Between two consecutive hidden layers, the following operations are performed:

I_{t l_{ξ}}^{h (ξ)} = \sum_{l_{ξ - 1} = 1}^{L_{ξ - 1}} (w_{l_{ξ - 1} l_{ξ}}^{h (ξ)} \cdot h_{t l_{ξ - 1}}^{(ξ - 1)}); ξ = 2 ~ 3

(10)

n_{t l_{ξ}}^{h (ξ)} = I_{t l_{ξ}}^{h (ξ)} - θ_{l_{ξ}}^{h (ξ)}; ξ = 2 ~ 3

(11)

h_{t l_{1}}^{(ξ)} = \frac{1}{1 + e^{- n_{t l_{ξ}}^{h (ξ)}}}; ξ = 2 ~ 3

(12)

Outputs from the last hidden layer are aggregated at the output node:

I_{t}^{o} = \sum_{l_{3} = 1}^{L_{3}} (w_{l_{3}}^{o} \cdot h_{t l_{3}}^{(3)}),

(13)

then the output is

o_{t} = \frac{1}{1 + e^{- n_{t}^{o}}},

(14)

where

n_{t}^{o} = I_{t}^{o} - θ^{o} .

(15)

$o_{t}$ is unnormalized as

U (o_{t}) = o_{t} (max_{s} o_{s} - min_{s} o_{s}) + min_{s} o_{s} .

(16)

To determine the values of network parameters, the DNN is trained using the Levenberg–Marquardt (LM) algorithm (Suzuki, 2011).

Aggregation

The DNNs of all clusters can be applied to predict the revenue of the catering industry. Let the revenue predicted by the k-th DNN be denoted by $o_{t} (k)$ . The membership that the future period belongs to the cluster is $μ_{tk}$ . Then, the weighted sum method is applied to aggregate the predictions of all DNNs:

o_{t} (aggregated) = \sum_{k = 1}^{K} (μ_{tk} o_{t} (k))

(17)

Case Study

Background

The proposed methodology was applied to forecast the monthly revenue of the Taiwanese catering industry during the COVID-19 pandemic, that is, from January 2020 to January 2022. Therefore, the data collected includes monthly revenues for 25 months.

Factors that could impact the revenue of the catering industry during the COVID-19 pandemic were used as inputs to the proposed methodology. As shown in Table 2, there are a total of 23 input variables, named X1 to X23, including government statistics on the catering industry and COVID-19 statistics. In this study, 85% of the collected data was used as training set and the rest was reserved for testing/evaluation.

Table 2.

Factors That May Affect the Revenue of the Catering Industry During the COVID-19 Pandemic.

Symbol	Factor
X1	Newly confirmed COVID-19 cases
X2	New death count
X3	Cumulative number of people that have received their first dose of COVID-19 vaccine
X4	Cumulative number of people that have received their second dose of COVID-19 vaccine
X5	Cumulative number of releases from isolation
X6	Number of new releases from isolation
X7	Epidemic alert level
X8	Number of transactions processed by the credit card processing center
X9	Gross monthly income per capita in the catering industry (NTD)
X10	Number of employees employed in the catering industry
X11	Gross monthly income per capita (NTD)
X12	Total working hours
X13	Total number of restaurants
X14	Unemployment rate
X15	Consumer price index (NTD)
X16	Import price basic sub-index (NTD)
X17	Export price basic sub-index (NTD)
X18	Number of visitors to Taiwan
X19	Prosperity leading indicator
X20	Wholesale price index (NTD)
X21	Prosperity coincident indicator
X22	Prosperity lagging indicator
X23	Prosperity countermeasure signal

Application of the Proposed Methodology: Model Building

The importance of each factor was first assessed using RFs. As can be seen from Figure 5, the importance of gross monthly income per capita in the catering industry (X9) was the highest, while the importance of the number of people that have received the second dose of COVID-19 vaccine (X4) was the lowest. The former was 98 times more important than the latter.

Figure 5.

Importance of each factor evaluated using the RF method.

Then, LOOCV was used for cross-validation, and the factors with the lowest importance level were excluded one by one. After eliminating a factor, the prediction accuracy was re-evaluated. Finally, the number of factors that achieved the highest prediction accuracy was taken, as shown in Figure 6. In this case, six factors were used as input to the DNNs, as shown in Table 3.

Figure 6.

Determining the optimal number of factors.

Table 3.

Factors Used as Inputs to the DNNs.

Symbol	Factor	Importance
X9	Gross monthly income per capita in the catering industry (NTD)	5.68× $10^{13}$
X14	Unemployment rate	3.83× $10^{14}$
X2	New death count	2.55× $10^{14}$
X11	Gross monthly income per capita (NTD)	2.06× $10^{14}$
X6	Number of new releases from isolation	2.14× $10^{14}$
X12	Total working hours	2.43× $10^{14}$

To confirm whether these six factors do have an impact on the revenue of the catering industry, the Pearson product-moment correlation coefficient between each factor and revenue was calculated, as shown in Table 4. Clearly, these six factors were highly correlated with the revenue of the catering industry.

Table 4.

Pearson Product-Moment Correlation Coefficient Between Each Factor and the Revenue.

Factor	Pearson product-moment correlation coefficient	Interpretation
X9	.677	Highly positively correlated
X14	−.772	Highly negatively correlated
X2	−.631	Highly negatively correlated
X11	.509	Highly positively correlated
X6	−.590	Highly negatively correlated
X12	.654	Highly positively correlated

The collected data was then divided into two to four clusters using FCM. The clustering results are summarized in Figure 7. Each example can be divided into all clusters, but with different memberships. The number of examples in each cluster was counted if each example was classified into the cluster with the highest membership. The results are summarized in Table 5. The optimal number of clusters was determined by changing the number of clusters to optimize prediction accuracy.

Figure 7.

Clustering results: (a) two clusters, (b) three clusters, and (c) four clusters.

Table 5.

Number of Examples in Each Cluster.

No. of examples.	2 Clusters	3 Clusters	4 Clusters
Cluster #1	17	17	10
Cluster #2	4	2	2
Cluster #3	-	2	1
Cluster #4	-	-	8

For each cluster, a DNN was constructed to predict the revenue of the catering industry from the values of six factors based on the training data belonging to that cluster. The training algorithm was the LM algorithm. The configuration of the DNN was optimized by varying the number of hidden layers and the nodes in these layers to minimize RMSE and MAE. For example, the collected data was divided into two clusters, and for the first cluster, a DNN with three hidden layers was constructed. Then, the numbers of nodes in the hidden layers were changed to optimize the prediction performance. The results are summarized in Table 6. Finally, the optimal numbers of nodes in the hidden layers were 6, 4, and 5, respectively. The optimal configuration of the DNN is shown in Figure 8.

Table 6.

Number of Nodes in Each Hidden Layer and the Prediction Performance.

No. of nodes in hidden layer #1	No. of nodes in hidden layer #2	No. of nodes in hidden layer #3	RMSE	MAE
1	1	1	0.1683	0.1345
		2	0.1660	0.1325
		3	0.1442	0.1161
		4	0.1688	0.1343
		5	0.1612	0.1289
		6	0.1376	0.1120
	2	1	0.1413	0.1155
		2	0.1469	0.1172
		3	0.1551	0.1237
		4	0.1359	0.1112
		5	0.1408	0.1167
		6	0.1339	0.1094
⋮	⋮	⋮	⋮	⋮
6	4	1	0.1334	0.1078
		2	0.1351	0.1097
		3	0.1244	0.1005
		4	0.1182	0.0948
		5	0.1109	0.0902
		6	0.1210	0.0972
	5	1	0.1276	0.1044
		2	0.1196	0.0966
		3	0.1200	0.0980
		4	0.1183	0.0949
		5	0.1224	0.0988
		6	0.1304	0.1030
	6	1	0.1275	0.1036
		2	0.1233	0.0992
		3	0.1282	0.1040
		4	0.1224	0.0980
		5	0.1190	0.0948
		6	0.1192	0.0959

Figure 8.

Optimal configuration of the DNN.

Application of the Proposed Methodology: Evaluation

Then, the trained DNNs of all clusters were used to predict the revenue of the catering industry for the test data. Then, the predictions of all DNNs were aggregated using the weighted sum method. The predicted results are summarized in Figure 9.

Figure 9.

Forecasting results.

Discussion

Based on the experimental results, the following discussions were made:

Among factors reflecting health risks, the number of newly diagnosed COVID-19 cases was far less important than the number of new deaths. This was not surprising, since the former was distorted by government manipulation of such cases and the need for people to evade PCR testing to avoid post-diagnosis isolation.

The accuracy of predicting the revenue of the catering industry in Taiwan using the proposed methodology was evaluated as

MAE = 1,756,984 (NTD)

MAPE = 2.9%

RMSE = 1,907,823 (NTD)

The MAPE using the proposed methodology was only 2.9%, showing very good prediction accuracy. Therefore, the revenue forecast provided a reliable basis for the government and practitioners to take relevant actions.

As expected, high unemployment and new deaths during the COVID-19 pandemic led to a lower revenue in December 2021.

The effects of big data analytics techniques, such as feature selection and data clustering, on prediction performance was also analyzed. The results are summarized in Table 7.

To further elaborate the effectiveness of the proposed methodology, several existing methods including multiple linear regression (MLR) (Su, 2020; Yang et al., 2020), RF (Tanizaki et al., 2021), RFE+RF (Park & Kim, 2020; Ustebay et al., 2018) DNN (Sun et al., 2021), and RFE+DNN (Abedinia et al., 2017; Darst et al., 2018; Lin et al., 2019; Suganya & Shanthi, 2012; Wang & Chen, 2019) were also applied to the collected data for comparison. The configuration of each DNN has been optimized. Table 8 compares the prediction performances using various methods. When the collected data was divided into two clusters and each DNN had a single hidden layer, the prediction accuracy measured by MAE, MAPE or RMSE was optimized using the proposed methodology. Compared with the baseline method, MLR, the proposed methodology outperformed by up to 90%.

Figure 10 shows the prediction accuracy using RF (according to RMSE) when the number of trees varied. Obviously, the prediction accuracy did not always improve with the number of trees. As a result, when these were 7 trees, the prediction accuracy was optimized, giving a minimum RMSE of 7.75 × 10⁶, which was still worse than that using the proposed methodology.

Table 7.

Effects of Big Data Analytics Techniques on Prediction Performance (2 Clusters and 2 Hidden Layers).

Applied big data analytics techniques	RMSE	MAE	MAPE
No	4.73× $10^{6}$	3.40× $10^{6}$	7.53%
Feature selection	4.05× $10^{6}$	3.52× $10^{6}$	6.78%
Feature selection+Data clustering	3.99× $10^{6}$	3.36× $10^{6}$	5.34%

Table 8.

Comparison of the Prediction Performance Using Various Methods.

Forecasting method	RMSE	MAE	MAPE
RFE + Multiple linear regression	1.83× $10^{7}$	1.36× $10^{7}$	28.96%
RF	7.67× $10^{6}$	7.20× $10^{6}$	13.94%
RFE + RF	6.50× $10^{6}$	5.34× $10^{6}$	11.13%
DNN (1 hidden layer)	1.09× $10^{7}$	9.60× $10^{6}$	16.80%
DNN (2 hidden layers)	4.73× $10^{6}$	3.40× $10^{6}$	7.53%
DNN (3 hidden layers)	5.09× $10^{6}$	4.27× $10^{6}$	8.52%
RFE + DNN (1 hidden layer)	7.78× $10^{6}$	4.55× $10^{6}$	6.16%
RFE + DNN (2 hidden layers)	4.05× $10^{6}$	3.52× $10^{6}$	6.78%
RFE + DNN (3 hidden layers)	5.25× $10^{6}$	4.46× $10^{6}$	7.23%
RFE + FCM (2 clusters) +DNN (1 hidden layer)	1.91× $10^{6}$	1.76× $10^{6}$	2.87%
RFE + FCM (2 clusters) +DNN (2 hidden layers)	3.99× $10^{6}$	3.36× $10^{6}$	5.34%
RFE + FCM (2 clusters) +DNN (3 hidden layers)	6.80× $10^{6}$	6.31× $10^{6}$	11.43%
RFE + FCM (3 clusters) +DNN (1 hidden layer)	2.13× $10^{6}$	1.90× $10^{6}$	2.87%
RFE + FCM (3 clusters) +DNN (2 hidden layers)	3.65× $10^{6}$	2.78× $10^{6}$	6.11%
RFE + FCM (3 clusters) +DNN (3 hidden layers)	5.24× $10^{6}$	4.50× $10^{6}$	9.19%

Figure 10.

Prediction accuracy using RF when the number of trees varied.

Conclusions

From mid-May 2021, Taiwan issued a level 3 pandemic alert due to the COVID-19 pandemic, which was not officially lifted until July 26. During the more than 2-month-long lockdown, the catering industry was banned from indoor dining and foreign tourists were few and far between. As a result, people went out less and the catering industry saw a severe drop in revenue, forcing many restaurants to close. Predicting the revenue of the catering industry during the COVID-19 pandemic will not only allow practitioners to adjust their business strategies, but also provide a reference for governments to formulate relief measures. To this end, this study proposes a fuzzy big data analytics approach. The proposed methodology combines RF, RFE, and FCM to select relevant factors and cluster the collected data, and then build a DNN for each cluster to make predictions.

The effectiveness of the fuzzy big data analytics approach was tested using real data from January 2020 to January 2022. According to the experimental results,

Compared with the baseline MLR method, the fuzzy big data analytics approach improved the prediction accuracy by 90% in terms of MAE, MAPE and RMSE.

Compared with other big data analytics methods, the proposed methodology also enhanced the prediction performance by reducing the RMSE by 16%.

In theory, a DNN with more hidden layers can better fit a more complex nonlinear function, and further improve the prediction performance using the DNN. The experimental results confirmed this belief. When the DNN had three hidden layers, its prediction performance was optimized.

In contrast, dividing the collected data into two clusters had the best prediction accuracy. More clusters may be bad for the prediction performance.

The following managerial implications are derived from the experimental results:

After the outbreak of COVID-19, the factors that have a greater impact on the revenue of the catering industry in Taiwan included gross monthly income per capita in the catering industry, unemployment rate, new deaths, gross monthly income per capita, the number of new releases from isolation, and total working hours. Many of these are related to the health risks posed by COVID-19. The government and enterprises should try to eliminate related shocks.

The number of new deaths was far more important than the number of newly confirmed COVID-19 cases, as the latter was subject to government manipulation and the need for people to evade PCR testing to avoid post-diagnosis isolation.

The proposed methodology can be applied to predict another index of the catering industry or another industry that also suffers from the COVID-19 pandemic. Additionally, factors affecting the revenue of the catering industry are likely to continue to change as COVID-19 develops. The same analysis needs to be done again in the near future.

Acronyms

Acronym	Meaning
ANFIS	Adaptive network-based fuzzy inference system
ANN	Artificial neural network
ARIMA	Autoregressive integrated moving average
CART	Classification and regression tree
COVID	Coronavirus disease
DNN	Deep neural network (DNN)
DRAM	Dynamic random access memory
FCM	Fuzzy c-means
GBR	Gradient boosting regression
kM	k-means
LM	Levenberg–Marquardt
LOOCV	Leave-one-out cross-validation
LSTM	Long short-term memory
MAE	Mean absolute error.
MAPE	mean absolute percentage error
MSE	Mean squared error
NTD	New Taiwan dollars
OOB	Out-of-bag
PCA	Principal component analysis
PCR	Polymerase Chain Reaction
RF	Random forest
RFE	Recursive feature elimination
RMSE	Root mean squared error
SARIMA	Seasonal auto regressive integrated moving average
SOM	Self-organizing map
SVM	Support vector machine
TFN	Triangular fuzzy number
TrFN	Trapezoidal fuzzy number

Footnotes

Acknowledgements

Not available.

Author Contributions

All authors contributed equally to the writing of this paper.

Ethical Approval

Not required.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

Guarantor

Not required.

ORCID iD

Tin-Chih Toly Chen

Data Availability Statement

Data cannot be shared openly but are available on request from authors.

References

Abedinia

Amjady

Zareipour

(2017). A new feature selection technique for load and price forecast of electrical power systems. IEEE Transactions on Power Systems, 32(1), 62–74.

Aggrawal

Pal

(2020). Sequential feature selection and machine learning algorithm-based patient’s death events prediction and diagnosis in heart disease. Computer Science, 1(6), 344.

Alvisi

Franchini

(2011). Fuzzy neural networks for water level and discharge forecasting with uncertainty. Environmental Modelling & Software, 26(4), 523–537.

Chen

(2011). Applying the hybrid fuzzy c-means-back propagation network approach to forecast the effective cost per die of a semiconductor product. Computers & Industrial Engineering, 61(3), 752–759.

Chen

Chiu

M. C.

(2021). An interval fuzzy number-based fuzzy collaborative forecasting approach for DRAM yield forecasting. Complex & Intelligent Systems, 7, 111–122.

Chen

Chiu

M. C.

(2022). A fuzzy collaborative intelligence approach to group decision-making: A case study of post-COVID-19 restaurant transformation. Cognitive Computation, 14(2), 531–546.

Chen

T. C. T.

(2022a). Big data analytics for semiconductor manufacturing. In Production planning and control in semiconductor manufacturing: Big data analytics and Industry 4.0 applications (pp. 1–19). Springer International Publishing.

Chen

T. C. T.

(2022b). Cycle time prediction and output projection. In Production planning and control in semiconductor manufacturing: Big data analytics and Industry 4.0 applications (pp. 41–62). Springer International Publishing.

Chen

T. C. T.

Honda

(2020). Fuzzy collaborative forecasting and clustering: Methodology, system architecture, and applications. Springer International Publishing.

10.

Chen

T. C. T.

Wang

Y. C.

Lin

C. W.

(2020). A fuzzy collaborative forecasting approach considering experts’ unequal levels of authority. Applied Soft Computing, 94, 106455.

11.

Chen

Lin

Y. C.

(2008). A fuzzy-neural system incorporating unequally important expert opinions for semiconductor yield forecasting. International Journal of Uncertainty Fuzziness and Knowledge-Based Systems, 16(01), 35–58.

12.

Chen

T. T. C.

H. C.

Hsu

K. W.

(2022). A fuzzy analytic hierarchy process-enhanced fuzzy geometric mean-fuzzy technique for order preference by similarity to ideal solution approach for suitable hotel recommendation amid the COVID-19 pandemic. Digital Health, 8, 20552076221084457.

13.

Chen

Wang

Y. C.

(2016). Estimating simulation workload in cloud manufacturing using a classifying artificial neural network ensemble approach. Robotics and Computer-Integrated Manufacturing, 38, 42–51.

14.

Chen

Wang

Y. C.

(2019). An advanced fuzzy approach for modeling the yield improvement of making aircraft parts using 3D printing. International Journal of Advanced Manufacturing Technology, 105, 4085–4095.

15.

Chen

Wang

Y. C.

Chiu

M. C.

(2021). A type-II fuzzy collaborative forecasting approach for productivity forecasting under an uncertainty environment. Journal of Ambient Intelligence and Humanized Computing, 12, 2751–2763.

16.

Chiu

M.-C.

Chen

T. C. T.

Hsu

K.-W.

(2020). Modeling an uncertain productivity learning process using an interval fuzzy methodology. Mathematics, 8, 998.

17.

Chiu

M. C.

Chen

T. T.

(2022). A ubiquitous healthcare system of 3D printing facilities for making dentures: Application of type-II fuzzy logic. Digital Health, 8, 20552076221092540.

18.

Darst

B. F.

Malecki

K. C.

Engelman

C. D.

(2018). Using recursive feature elimination in random forest to account for correlated variables in high dimensional data. BMC Genetics, 19(Suppl 1), 65–66.

19.

De Silva

Enticott

Barton

Forbes

Saha

Nikam

(2021). Use and performance of machine learning models for type 2 diabetes prediction in clinical and community care settings: Protocol for a systematic review and meta-analysis of predictive modeling studies. Digital Health, 7, 20552076211047390.

20.

Fernández

del Río

Chawla

N. V.

Herrera

(2017). An insight into imbalanced big data classification: Outcomes and challenges. Complex & Intelligent Systems, 3, 105–120.

21.

Gogtay

N. J.

Thatte

U. M.

(2017). Principles of correlation analysis. Journal of the Association of Physicians of India, 65(3), 78–81.

22.

Goodman

T. H.

Neamtiu

Shroff

White

H. D.

(2014). Management forecast quality and capital investment decisions. Accounting Review, 89(1), 331–365.

23.

Guha

Veeranjaneyulu

(2019). Prediction of bankruptcy using big data analytics based on fuzzy c-means algorithm. IAES International Journal of Artificial Intelligence (IJ-AI), 8(2), 168–174.

24.

Hadjimichael

Kuciauskas

A. P.

Tag

P. M.

Bankert

R. L.

Peak

J. E.

(2002). A meteorological fuzzy expert system incorporating subjective user input. Knowledge and Information Systems, 4, 350–369.

25.

Hassani

Silva

E. S.

(2015). Forecasting with big data: A review. Annals of Data Science, 2, 5–19.

26.

Hornstein

Forman-Hoffman

Nazander

Ranta

Hilbert

(2021). Predicting therapy outcome in a digital mental health intervention for depression and anxiety: A machine learning approach. Digital Health, 7, 20552076211060659.

27.

Khemavuk

Leenatham

(2021). A conceptual model for uncertainty demand forecasting by artificial neural network and adaptive neuro - fuzzy inference system based on quantitative and qualitative data. International Journal of Operations and Quantitative Management, 26(4), 285–302.

28.

Lin

Y. C.

Chen

(2019). An advanced fuzzy collaborative intelligence approach for fitting the uncertain unit cost learning process. Complex & Intelligent Systems, 5, 303–313.

29.

Lin

Y. C.

Chen

T. C. T.

(2022). An intelligent system for assisting personalized COVID-19 vaccination location selection: Taiwan as an example. Digital Health, 8, 20552076221109062.

30.

Lin

Y. C.

Wang

Y. C.

Chen

T. C. T.

Lin

H. F.

(2019). Evaluating the suitability of a smart technology application for fall detection using a fuzzy collaborative intelligence approach. Mathematics, 7(11), 1097.

31.

(2020). Impact of the “NCP” on the catering industry through the sars perspective based on the perspective of big data [Conference session]. International Conference on Applications and Techniques in Cyber Security and Intelligence (pp. 429–434).

32.

Liu

Ren

Choi

T. M.

Hui

C. L.

S. F.

(2013). Sales forecasting for fashion retailing service industry: A review. Mathematical Problems in Engineering, 2013, 738675.

33.

Mayurnikova

L. A.

Krapiva

T. V.

Davydenko

N. I.

Samoylenko

K. V.

(2015). Analysis and prospects of catering market in regions. Food Processing: Techniques and Technology, 1, 141–147.

34.

Ministry of Finance. (2022). Financial statistics database query. https://web02.mof.gov.tw/njswww/WebMain.aspx?sys=100&funid=defjspf2

35.

Nageshwaran

Harris

R. C.

Guerche-Seblain

C. E.

(2021). Review of the role of big data and digital technologies in controlling COVID-19 in Asia: Public health interest vs. Privacy. Digital Health, 7, 20552076211002953.

36.

Norris

C. L.

Taylor

Jr Taylor

D. C.

(2021). Pivot! How the restaurant industry adapted during COVID-19 restrictions. International Hospitality Review, 35, 132–155.

37.

Oblander

E. S.

McCarthy

(2021). How has covid-19 impacted customer relationship dynamics at restaurant food delivery businesses. 23, 2139.

38.

Panzone

L. A.

Larcom

She

P. W.

(2021). Estimating the impact of the first COVID-19 lockdown on UK food retailers and the restaurant sector. Global Food Security, 28, 100495.

39.

Park

Kim

(2020). Predicting the variables that determine university (re-)entrance as a career development using support vector machines with recursive feature elimination: The case of South Korea. Sustainability, 12(18), 7365.

40.

Płoński

(2020). Random forest feature importance computed in 3 ways with Python. https://mljar.com/blog/feature-importance-in-random-forest/

41.

Ren

(2014). A study of practice of collaborative process-based supply chain in Chinese catering industry [Conference session]. 11th International Conference on Service Systems and Service Management (pp. 1–6).

42.

Rubio

Bermúdez

J. D.

Vercher

(2017). Improving stock index forecasts by using a new weighted fuzzy-trend time series method. Expert Systems with Applications, 76, 12–20.

43.

Shekar

B. H.

Dagnew

(2020). L1-regulated feature selection and classification of microarray cancer data using deep learning [Conference session]. Proceedings of 3rd International Conference on Computer Vision and Image Processing, 2 (pp. 227–242).

44.

C. Y.

(2020). Passenger flow forecast of catering business based on autoregressive integrated moving average and smoothing index prediction model [Conference session]. International Signal Processing, Communications and Engineering Management Conference (pp. 53–57).

45.

Suganya

Shanthi

(2012). Fuzzy c-means algorithm-A review. International Journal of Scientific and Research Publications, 2(11), 1.

46.

Sun

Feng

Zhao

Cao

Yao

(2021). Deep learning based customer preferences analysis in Industry 4.0 environment. Mobile Networks and Applications, 26(6), 2329–2340.

47.

Suresh

Bharathi

C. R.

(2016). Sentiment classification using decision tree based feature selection. International Journal of Control Theory and Applications, 9(36), 419–425.

48.

Suzuki

(2011). Artificial neural networks: Industrial and control engineering applications. Intech.

49.

Tanizaki

Kozuma

Shimmura

(2021). Forecasting the number of customers visiting restaurants using machine learning and statistical method. IFIP Advances in Information and Communication Technology, 632, 189–197.

50.

Ugarova

Bolkhovitina

E. N.

Davydenko

N. I.

(2019). The development of the regional catering market: The case of the Altai region [Conference session]. International Conference on Sustainable Development of Cross-Border Regions: Economic, Social and Security Challenges (pp. 165–169).

51.

Ustebay

Turgut

Aydin

M. A.

(2018). Intrusion detection system with recursive feature elimination by using random forest and deep learning classifier [Conference session]. 2018 International Congress on Big Data, Deep Learning and Fighting Cyber Terrorism (pp. 71–76).

52.

Wang

Y. C.

Chen

T. C.

(2019). A partial-consensus posterior-aggregation FAHP method—supplier selection problem as an example. Mathematics, 7(2), 179.

53.

Wang

Y. C.

Tsai

H. R.

Chen

(2021). A selectively fuzzified back propagation network approach for precisely estimating the cycle time range in wafer fabrication. Mathematics, 9(12), 1430.

54.

Whitfield

R. I.

Duffy

A. H. B.

(2013). Extended revenue forecasting within a service industry. International Journal of Production Economics, 141(2), 505–518.

55.

Wong

T. T.

(2015). Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation. Pattern Recognition, 48(9), 2839–2846.

56.

H. C.

Chen

(2015). CART–BPN approach for estimating cycle time in wafer fabrication. Journal of Ambient Intelligence and Humanized Computing, 6(1), 57–67.

57.

H. C.

Wang

Y. C.

Chen

T. C. T.

(2020). Assessing and comparing COVID-19 intervention strategies using a varying partial consensus fuzzy collaborative intelligence approach. Mathematics, 8(10), 1725.

58.

Xie

Ding

(2008). Forecasting the retail sales of China’s catering industry using support vector machines [Conference session]. 7th World Congress on Intelligent Control and Automation (pp. 4458–4462).

59.

Yang

Liu

Chen

(2020). COVID-19 and restaurant demand: early effects of the pandemic and stay-at-home orders. International Journal of Contemporary Hospitality Management, 32(12), 3809–3834.

60.

Yan

Zhang

(2015). Feature selection and analysis on correlated gas sensor data with recursive feature elimination. Sensors and Actuators B Chemical, 212, 353–363.

61.

Yolcu

O. C.

Yolcu

Egrioglu

Aladag

C. H.

(2016). High order fuzzy time series forecasting method based on an intersection operation. Applied Mathematical Modelling, 40(19-20), 8750–8765.