Online Optimization of Collaborative Web Service QoS Prediction Based on Approximate Dynamic Programming

Abstract

More recently, with the increasing demand of web services on the World Wide Web used in the Internet of Things (IoTs), there has been a growing interest in the study of efficient web service quality evaluation approaches based on prediction strategies to obtain accurate quality-of-service (QoS) values. However, it is obvious that the web service quality changes significantly under the unpredictable network environment. Such changes impose very challenging obstacles to web service QoS prediction. Most of the traditional web service QoS prediction approaches are implemented only using a set of static model parameters with the help of designer's a priori knowledge. Unlike the traditional QoS prediction approaches, our algorithm in this paper is realized by incorporating approximate dynamic programming- (ADP-) based online parameter tuning strategy into the QoS prediction approach. Through online learning and optimization, the proposed approach provides the QoS prediction with automatic parameter tuning capability, and prior knowledge or identification of the prediction model is not required. Therefore, the near-optimal performance of QoS prediction can be achieved. Experimental studies are carried out to demonstrate the effectiveness of the proposed ADP-based prediction approach.

1. Introduction

Recently, with the increasing presence and adoption of cloud computing, the new idea of “anything as a service (XaaS)” is becoming more and more popular. XaaS enables the consumers to use the software with the form of “Use and Not Have”. Therefore, it plays an important role in the applications of Internet of Things (IoTs). However, with the emergence of a huge number of cloud services, it is more and more difficult to choose an appropriate service in accordance with demand from the users. A number of web service composition and web service selection approaches have been proposed. It has led to the development of the service of computing (SOC) [1–4].

Obviously, only considering the services from the function has been unable to meet the requirements from users. Then, the service recommendation based on nonfunctional indexes (e.g., quality of service (QoS)) has become one of the attractive research fields in SOC [5, 6]. QoS represents the real user experience of a cloud service. Generally speaking, the QoS data from the user or server includes the availability, response time, throughput, delay, and delay variation and loss. While recommending a service based on the QoS data, one of the biggest problems is that the QoS data we have is not complete [7–12]. Actually, the QoS values of web services can be collected from the server side or the client side. At the server side, QoS values are usually provided and collected by the service providers. Here, we only focus on the QoS values measured at the client side. Due to the influence of the unpredictable network connections and complex user application environment on the Internet, QoS values vary widely at the client side. Thus, the web service evaluation is conducted for obtaining detailed and accurate QoS values at the client side. However, because there are huge amounts of web services on today's Internet, it will take too much time to evaluate all the web service candidates for service users. Therefore, it might be difficult or even impractical to fulfil the above web service evaluation task at the client side.

To obtain accurate web service QoS values on condition that there are no sufficient service evaluations at the client side, some effective approaches with the help of prediction strategies are widely studied. The traditional QoS prediction algorithms are simple, and they are generally implemented by carrying out the arithmetic average operation for QoS values. Here, they use the average QoS values to predict the unknown QoS values. The major drawback of such methods is that they ignore some personalized factors, which may lead to a low prediction accuracy. In view of it, the collaborative filtering based approaches for making personalized QoS value prediction for the service users have been proposed. Specifically, the collaborative filtering technique is developed to automatically predict QoS values of the current user by collecting information from other similar users or items. In general, the collaborative filtering based QoS prediction approaches can be categorized into three major groups. The first group is the user-based collaborative filtering method using Pearson correlation coefficient (PCC), namely, UPCC. It is a very classical method. The QoS value prediction is implemented by employing similar users [13, 14]. The second group is the item-based collaborative filtering method using PCC, namely, IPCC. It is widely used in industrial companies (e.g., Amazon). This approach employs similar web services (i.e., items) for the QoS value prediction [15]. The third group is the probabilistic matrix factorization (PMF) based collaborative prediction. This idea was proposed by Salakhutdinov and Mnih [16], where the user preference matrix is fitted by a product of two lower-rank matrices. This method may perform well on some large, sparse, and imbalanced data sets. Furthermore, through the use of fusion techniques [17, 18], some advanced methods are also developed. For instance, Zheng et al. proposed a neighborhood-integrated matrix factorization (NIMF) approach by fusing the neighborhood-based and model-based collaborative filtering approaches to improve the prediction accuracy [19].

Although those approaches mentioned above play important roles in applications, there are also some limitations. Most of the QoS prediction approaches depend on designers’ a priori knowledge about the prediction model parameters. It is obvious that the information about a prediction model or more specifically precise knowledge about a prediction system is quite difficult to obtain in some practical applications. Then, model parameter identification through experiments is usually needed. However, it is time consuming for some large-scale experiments. This practical limitation imposes challenging obstacles to the applicability of those QoS prediction methods. For instance, in [19], a series of experiments were conducted for the purpose of parameter identification used in QoS prediction. But it was computationally intensive for the scale of web service QoS data set. Furthermore, it was not suitable for online parameter tuning under complex network environment on the Internet.

In this paper, an optimal online parameter tuning methodology based on approximate dynamic programming (ADP) is proposed to improve the QoS prediction approach, and the experiment is conducted on a large-scale web service QoS data set that has some characteristics of big data. Because of the fast adaptability and approximation capabilities of neural network (NN) in its model-free reinforcement learning scheme, ADP using NNs is a powerful tool for computing optimal solution of a multistage dynamic decision process while avoiding the “curse of dimensionality” [20–23]. Under the ADP scheme with actor-critic architecture, a critic network is designed for approximating cost function and an action network is designed for generating optimal actions [24]. We propose a model-free online ADP learning algorithm for parameter tuning used in collaborative web service QoS prediction. By designing an ADP architecture and defining an appropriate reinforcement signal, the ADP tunes the QoS prediction model parameter optimally to achieve the satisfactory QoS prediction. In addition, for big data applications with large-scale data sets, some available QoS prediction approaches are time consuming in implementing model parameter identification by trial and error with painstakingly handcrafted exhaustion method. In contrast, the ADP-based method has automatic model parameter tuning capability and it may not be very computationally intensive for the scale of web service QoS data set. In this way, the ADP-based method may be a considerable alternative to big data analytics in this case.

Different from the traditional QoS prediction approaches, our methodology has the following advantages: (1) the optimal parameter tuner used in prediction approach can be implemented online; (2) no prior knowledge or identification of the prediction model is required; (3) the near-optimal performance may be achieved while the parameters of prediction model can adapt to the changes of network environment; and (4) the proposed structure and the associated algorithm can be also extended to applications for other types of QoS prediction model.

This paper is organized as follows. In Section 2, we introduce a collaborative QoS prediction framework using PMF model. An ADP-based parameter tuner for QoS prediction model is proposed in Section 3. Experiment results are presented in Section 4. Conclusion is provided in Section 5.

2. QoS Prediction Model and Parameter Analysis

2.1. Collaborative QoS Prediction Model

A collaborative QoS prediction framework is shown in Figure 1, where the service users are encouraged to share their individually observed past web service QoS information. In this collaborative framework, a service user will obtain the QoS prediction service from the centralized server only if he/she contributes some QoS values. Meanwhile, more web service QoS values are contributed by a service user; more user features can then be mined from those contributed data. In this way, higher QoS value prediction accuracy can therefore be achieved. It is the essence of this collaborative framework [19].

Figure 1

The proposed collaborative QoS prediction framework.

Furthermore, after providing the local QoS values to the server, the service users can get the prediction results via the following three phases. In the first phase, the system calculates the users’ similarities using PCC and determines a set of top-K similar users (i.e., neighbors) for the current user. In the second phase, by employing those neighbors’ information mentioned above, the system designs a collaborative filtering model to predict the missing web service QoS values in the user-item matrix, where each element in this matrix is the value of a certain QoS property of a web observed by a service. In the last phase, an ADP-based parameter tuner is proposed to adjust the key parameters of the above collaborative filtering model to find out the near-optimal results. Details of this phase are presented in the next section.

There is an $m \times n$ user-item matrix R, where m and n are the number of service users and web services, respectively. Here, we use algorithm PCC to compute the similarities between different service users. Generally, because PCC considers the differences in the user value style, it can achieve high accuracy [13]. According to the computation approach of PCC, the similarity between two service users i and k can be expressed as

\begin{matrix} PCC (i, k) = \frac{\sum_{j \in B} ‍ (R_{i j} - {\bar{R}}_{i}) (R_{k j} - {\bar{R}}_{k})}{{(\sum_{j \in B} ‍ {(R_{i j} - {\bar{R}}_{i})}^{2})}^{1 / 2} {(\sum_{j \in B} ‍ {(R_{k j} - {\bar{R}}_{k})}^{2})}^{1 / 2}}, \end{matrix}

(1)

where B is the subset of web services that are invoked by both service user i and user k. And

R_{i j}

is the element in the matrix R, which is the value of a certain client-side QoS property of web service j observed by service user i. In addition,

{\bar{R}}_{i}

and

{\bar{R}}_{k}

are the average QoS values of different web services observed by service users i and k, respectively. With those PCC values computed in 1, we can identify a set of top-K similar users. For a service user i, a set of similar users

T (i)

is defined as [19]

\begin{matrix} T (i) = \{k | k \in top- K (i), PCC (i, k) > 0, i \neq k\}, \end{matrix}

(2)

where top-

K (i)

represents a set of top-K similar users for the user i.

Then, a neighborhood-integrated matrix factorization approach is employed to make prediction in the user-item matrix [19]. For an $m \times n$ user-item matrix R, it is fitted by a matrix $X = U^{T} V$ using the matrix factorization algorithm, where $U \in R^{l \times m}$ , $V \in R^{l \times n}$ , and l represents the rank of the matrix X [16]. With this factorization of the matrix X, the web service QoS values from a user can be predicted by making a linear combination of the factor vectors in U. Actually, each column of V can be regarded as a linear predictor for a web service. Therefore, the prediction can be implemented by minimizing the following sum-of-squared-errors objective function E with quadratic regularization terms [19]:

\begin{array}{l} \min E (U, V) \\ = \frac{1}{2} \sum_{i = 1}^{m} ‍ \sum_{j = 1}^{n} ‍ (R_{i j} - (α U_{i}^{T} V_{j} + (1 - α) \sum_{k \in T (i)} ‍ S_{i k} U_{k}^{T} V_{j})) \\ + \frac{λ_{U}}{2} {‖U‖}^{2} + \frac{λ_{V}}{2} {‖V‖}^{2}, \end{array}

(3)

where

α \in [0,1]

is a weight coefficient designed to balance the usages of information from user and from the user's neighbors,

λ_{U}

and

λ_{V}

are two parameters used to avoid overfitting, and

‖\cdot‖

represents the Frobenius norm. In addition,

S_{i k}

represents the normalized similarity between users i and k, where it is defined as

\begin{matrix} S_{i k} = \frac{PCC (i, k)}{\sum_{k \in T (i)} ‍ PCC (i, k)} . \end{matrix}

(4)

By applying a gradient descent rule for $U_{i}$ and $V_{j}$ , the objective function $E (U, V)$ is minimized and a local minimum can be found.

2.2. Parameter Analysis

In function 3, the weight coefficient α bounded within $[0,1]$ determines how much the objective function relies on the users themselves and their similar users, where the QoS prediction can be made only by using the information from the current user when $α = 1$ , and the missing QoS value is predicted purely by using the information from those similar users (i.e., the current user's neighbors) when $α = 0$ .

In addition, the top-K value determines the number of similar users employed in collaborative QoS prediction. Generally speaking, a too large top-K value may hurt the prediction accuracy, because it means that some dissimilar users may be involved in the prediction computation. Meanwhile, a too small top-K value may be often computationally debilitating.

Therefore, an appropriate α and a top-K value may help to improve the QoS prediction accuracy with low computational complexity. Here, we design an ADP-based parameter tuner which aims to find the satisfactory parameters α and top-K through online optimization.

3. ADP-Based Parameter Tuner

3.1. System Architecture

For this QoS prediction problem, those available collaborative filtering approaches (e.g., [19]) are developed in accordance with the designer's experience for parameter setting used in system model. And a set of static parameters lack adaptability. Thus, for an unpredictable network environment, the prediction performance is unsatisfactory only by using predefined static parameters.

The QoS prediction framework using our ADP-based parameter tuner is illustrated in Figure 2, which aims to address the above issue in the design of QoS prediction model. Here, the proposed ADP-based tuner is employed to update two key parameters in the QoS prediction model, that is, α and top-K value.

Figure 2

The QoS prediction framework with ADP-based parameter tuner.

In Figure 2, the vector $X ≜ (x_{1}, x_{2}) = (α, k)$ from the QoS prediction model is taken as the input for the ADP-based tuner. Two regulatory signals $(Δ α, Δ k)$ are generated through modulation transform for u which is the output of the ADP-based tuner, where k represents the the top-K value. And $(Δ α, Δ k)$ are employed to update the parameters of the original QoS prediction model by

\begin{matrix} α (t) = α (t - 1) + Δ α (t), \\ k (t) = k (t - 1) + Δ k (t) . \end{matrix}

(5)

Moreover, the ADP-based parameter tuner in Figure 2 is implemented in accordance with actor-critic architecture of ADP [21]. It is illustrated in Figure 3, where the solid lines are signal flow and the dashed lines are the paths for NNs’ parameter tuning. As mentioned above, there are two NNs in the ADP-based tuner. One is designed as an action network to take the system state $X (t)$ as input to output the control signal $u (t)$ at time t. And the other is designed as a critic network to take $X (t)$ and $u (t)$ as inputs to generate $J (t)$ . The discount factor is $γ \in (0,1)$ . In the ADP-based tuner, $r (t)$ is a reinforcement signal which is provided from the QoS prediction model. Specifically, for the prediction problem, the reinforcement signal can be defined as a function of the measure of prediction results at time t (e.g., mean absolute error (MAE)). A small MAE will be encouraged. Actually, $r (t)$ is generated to indicate effectiveness of the control $u (t)$ and speed up the convergence.

Figure 3

Schematic diagram for implementation of ADP-based parameter tuner.

Considering the reinforcement signal $r (t)$ as an instantaneous cost function, the cost function can be defined as

\begin{matrix} R (t) = \sum_{i = 1}^{\infty} ‍ γ^{i - 1} r (t + i) . \end{matrix}

(6)

The optimal parameter tuning problem is to find a control policy to minimize the cost function. So the optimal cost function can be defined as

\begin{matrix} R^{*} (t) = \min_{u (t)} \{r (t + 1) + γ R^{*} (t + 1)\} . \end{matrix}

(7)

Here, the output of the critic network (i.e., $J (t)$ ) is to approximate the cost function $R (t)$ . When the critic network is well trained, $J (t)$ should satisfy the equation $J (t - 1) = r (t) + γ J (t)$ or $γ J (t) - [J (t - 1) - r (t)] = 0$ in accordance with 7. And the output of the action network (i.e., $u (t)$ ) is to minimize the difference between the approximate function $J (t)$ and the desired objective, denoted by $U_{c} (t)$ which is set to 0 without loss of generality in this paper. Under this actor-critic architecture with convergence guarantees, a near-optimal control policy is achieved.

In addition, the regulatory signals are generated by using modulation transform for $u (t) ≜ (u_{1} (t), u_{2} (t))$ as follows:

\begin{matrix} Δ α (t) = M_{1} u_{1} (t), \\ Δ k (t) = M_{2} u_{2} (t), \end{matrix}

(8)

where

M_{1}, M_{2} \in R^{+}

are modulation factors. Then, we provide the implementation details for our ADP-based parameter tuner.

3.2. The Action Network and the Critic Network

In our implementation, the feedforward NN with one hidden layer is used to design the action network and the critic network. The action network is illustrated in Figure 4, where $(x_{1} (t), x_{2} (t)) = (α (t), k (t))$ are the inputs, $(u_{1} (t), u_{2} (t))$ are the outputs, $w_{a_{i j}}^{(1)}$ is the weight connecting the jth input node to the ith hidden node, and $w_{a_{i k}}^{(2)}$ is the weight connecting the ith hidden node to the kth output node. In addition, $v (t)$ is the input vector of the output nodes and $h (t)$ and $g (t)$ are the input and output vectors of the hidden nodes, respectively. Let $φ (\cdot)$ be a hyperbolic tangent threshold function used in the hidden layer, and it can be defined as

\begin{matrix} φ (x) = \frac{1 - e^{- x}}{1 + e^{- x}} . \end{matrix}

(9)

Figure 4

The action network.

In the action network, the approximate error and the objective function to be minimized are defined as

\begin{matrix} e_{a} (t) = J (t) - U_{c} (t), \\ E_{a} (t) = \frac{1}{2} e_{a}^{2} (t) . \end{matrix}

(10)

The critic network is illustrated in Figure 5, where $w_{c i j}^{(1)}$ is the weight connecting the jth input node to the ith hidden node and $w_{c i}^{(2)}$ is the weight connecting the ith hidden node to the output node. Moreover, $q (t)$ and $p (t)$ are the input and output vectors of the hidden nodes, respectively. Similarly, the error and the objective function to be minimized can be expressed as follows:

\begin{matrix} e_{c} (t) = γ J (t) - [J (t - 1) - r (t)], \\ E_{c} (t) = \frac{1}{2} e_{c}^{2} (t) . \end{matrix}

(11)

Figure 5

The critic network.

3.3. Weight Online Learning Rules

In our proposed online learning and optimization strategy, the feature of not requiring prior knowledge means that the action network and the critic network in ADP-based parameter tuner can both be randomly initialized toward their network weights in the initialization phase of collaborative QoS prediction. Once a system state is observed, an action will be subsequently produced through the computation for the combination of those weights in the action network. A good action will be encouraged, while a bad action will be punished.

The weight online learning rule for the action network is a gradient-based adaptation designed as

\begin{matrix} w_{a} (t + 1) = w_{a} (t) - l_{a} (t) \frac{\partial E_{a} (t)}{\partial w_{a} (t)}, \\ \frac{\partial E_{a} (t)}{\partial w_{a} (t)} = \frac{\partial E_{a} (t)}{\partial J (t)} \frac{\partial J (t)}{\partial u (t)} \frac{\partial u (t)}{\partial w_{a} (t)}, \end{matrix}

(12)

where

w_{a} (t)

is the weight vector in the action network and

l_{a} (t) > 0

represents the learning rate of the action network at time t.

Here, the output $u (t)$ can be written as

\begin{matrix} u_{k} (t) = φ (v_{k} (t)), k = 1,2, \\ v_{k} (t) = \sum_{i = 1}^{N_{h a}} ‍ w_{a_{i k}}^{(2)} (t) g_{i} (t), k = 1,2, \\ g_{i} (t) = φ (h_{i} (t)), i = 1, \dots, N_{h a}, \\ h_{i} (t) = \sum_{j = 1}^{2} ‍ w_{a_{i j}}^{(1)} (t) x_{j} (t), i = 1, \dots, N_{h a}, \end{matrix}

(13)

where

N_{h a}

is the number of hidden nodes in the action network.

The weight online learning rule for the critic network is also a gradient-based adaptation designed as

\begin{matrix} w_{c} (t + 1) = w_{c} (t) - l_{c} (t) \frac{\partial E_{c} (t)}{\partial w_{c} (t)}, \\ \frac{\partial E_{c} (t)}{\partial w_{c} (t)} = \frac{\partial E_{c} (t)}{\partial J (t)} \frac{\partial J (t)}{\partial w_{c} (t)}, \end{matrix}

(14)

where

w_{c} (t)

is the weight vector in the critic network and

l_{c} (t) > 0

represents the learning rate of the critic network at time t.

Then, the output $J (t)$ is written as

\begin{matrix} J (t) = \sum_{i = 1}^{N_{h c}} ‍ w_{c i}^{(2)} p_{i} (t), \\ p_{i} (t) = φ (q_{i} (t)), i = 1, \dots, N_{h c}, \\ q_{i} (t) = \sum_{j = 1}^{2} ‍ w_{c i j}^{(1)} x_{j} (t) + \sum_{j = 3}^{4} ‍ w_{c i j}^{(1)} (t) u_{j - 2} (t), i = 1, \dots, N_{h c}, \end{matrix}

(15)

where

N_{h c}

is the number of hidden nodes in the critic network.

In the implementation of ADP-based tuner, it should be noted that the weight normalization is performed in both networks as follows:

\begin{matrix} w_{c} (t + 1) = \frac{w_{c} (t) + Δ w_{c} (t)}{‖w_{c} (t) + Δ w_{c} (t)‖}, \\ w_{a} (t + 1) = \frac{w_{a} (t) + Δ w_{a} (t)}{‖w_{a} (t) + Δ w_{a} (t)‖} . \end{matrix}

(16)

Finally, the ADP-based parameter tuner algorithm is summarized in Algorithm 1. Here, $N_{c}$ and $N_{a}$ are the internal cycles of the critic network and the action network, respectively. In addition, $T_{c}$ and $T_{a}$ are the internal training error thresholds for the critic network and the action network, respectively.

Algorithm 1: ADP-based parameter tuner.

(1) initialize the weights of the critic network and the action network arbitrarily;

(2) for each trail (from 1 to maximum trail) do

(3) initialize $X (t) = (α (t), k (t))$ ;

(4) for each step t in a trail do

(5) send $X (t)$ to the action network and get the output $u (t) = (u_{1} (t), u_{2} (t))$ ;

(6) send $X (t)$ and $u (t)$ to the critic network, then get the output $J (t)$ ;

(7) calculate $Δ α (t) \leftarrow M_{1} u_{1} (t)$ and $Δ k (t) \leftarrow M_{2} u_{2} (t)$ ;

(8) take $X (t + 1) = (α (t + 1), k (t + 1))$ , where $α (t + 1) \leftarrow α (t) + Δ α (t)$ and $k (t + 1) \leftarrow k (t) + Δ k (t)$ ;

(9) check the boundedness for $α (t + 1) \in (0,1)$ and $k (t + 1) \in (0$ , Max_User);

(10) send $α (t + 1)$ and $k (t + 1)$ to the QoS prediction model;

(11) get reinforcement signal $r (t + 1) = MAE$ ;

(12) repeat to update weight of the critic network $w_{c} (t)$

(13) until $E_{c} (t) < T_{c}$ or the maximum iteration number $N_{c}$ is met;

(14) repeat to update weight of the action network $w_{a} (t)$

(15) until $E_{a} (t) < T_{a}$ or the maximum iteration number $N_{a}$ is met;

(16) $X (t) \leftarrow X (t + 1)$ and $J (t - 1) \leftarrow J (t)$ ;

(17) end for

(18) end for

(19) return prediction result and MAE.

4. Experiment Results and Discussions

4.1. Data Set Description

To test the performance of our approach in real-world, we use a common data set collected from $339$ service users (in $30$ countries) on $5, 825$ web services (in $73$ countries) [25, 26]. This experiment focuses on the investigation for the response time of different web services and service users. Response time is one of the representative QoS values, which is defined as the time duration between sending a request and receiving a corresponding response for a service user.

In this experiment, there is a total of $1,974,67 5$ real-world web service invocation data. After processing those web service invocation data, there is a $339 \times 5,8 25$ user-item matrix, where each element in this matrix represents the response time value observed by a user on a web service [19].

4.2. Algorithm Settings

The parameters used in the experiments are summarized in Table 1, where the notations are defined as follows:

$l_{a} (0)$ : initial learning rate of the action network,

$l_{c} (0)$ : initial learning rate of the critic network,

$l_{a} (t)$ : learning rate of the action network at time t which is decreased by $0.05$ every $5$ time steps until it reaches $l_{a} (f)$ and stays thereafter,

$l_{c} (t)$ : learning rate of the critic network at time t which is decreased by $0.05$ every $5$ time steps until it reaches $l_{c} (f)$ and stays thereafter,

$N_{a}$ : internal cycle of the action network,

$N_{c}$ : internal cycle of the critic network,

$T_{a}$ : internal training error threshold for the action network,

$T_{c}$ : internal training error threshold for the critic network.

Table 1

Summary of parameters used in ADP-based tuner for QoS prediction.

Parameter	$l_{a} (0)$	$l_{c} (0)$	$l_{a} (f)$	$l_{c} (f)$	$N_{h a}$	$N_{a}$	$N_{c}$	$T_{a}$	$T_{c}$	$N_{h c}$
Value	0.3	0.3	0.005	0.005	6	100	50	0.005	0.05	6

Here, the weights in the action and the critic networks are trained using their internal cycles or their internal training error threshold.

4.3. Experiment Results for Case 1

4.3.1. Statement of Evaluation

In this experiment, the mean absolute error (MAE) and root-mean-squared error (RMSE) metrics are used to measure the prediction quality of our approach in the comparative study. MAE is defined by

\begin{matrix} MAE = \frac{1}{N} \sum_{i, j} ‍ |R_{i j} - {\hat{R}}_{i j}|, \end{matrix}

(17)

where

{\hat{R}}_{i j}

represents the predicted QoS value of web service j observed by service user i and N is the number of predicted values. And RMSE is defined by

\begin{matrix} RMSE = {(\frac{1}{N} \sum_{i, j} ‍ {(R_{i j} - {\hat{R}}_{i j})}^{2})}^{1 / 2} . \end{matrix}

(18)

Here, the MAE is to measure how close predicted QoS values are to the observed QoS values. Compared to the MAE, the RMSE provides an effective way to severely punish large errors.

4.3.2. Performance Comparison

We adopt the proposed optimal online parameter tuning methodology based on ADP to achieve the QoS prediction model parameters identification for α and k. The values of α and k obtained by using our approach are similar to those achieved by trial and error with method of exhaustion in [19]. Figures 6 and 7 show the trajectories of α and k under the ADP-based parameter tuner for a successful learning trial, respectively. It can be seen that α and k are gradually tuned through the use of our tuner. After $170$ time steps, α and k almost reach steady states, which indicates that the learning process of ADP has converged and near-optimal α and k are obtained.

Figure 6

A typical trajectory of α during a successful learning trial for the ADP-based parameter tuner.

Figure 7

A typical trajectory of k during a successful learning trial for the ADP-based parameter tuner.

Moreover, to evaluate the prediction accuracy, we compare our approach with the following approaches: (i)

UPCC (user-based collaborative filtering method using PCC): it only employs similar users for the QoS value prediction [13, 14];

(ii)

IPCC (item-based collaborative filtering method using PCC): it only employs similar web services for the QoS value prediction [15];

(iii)

UIPCC: it employs both similar users and similar web services for the QoS value prediction [27];

(iv)

NMF (nonnegative matrix factorization): it is also a collaborative filtering method based on matrix factorization; unlike other matrix factorization methods, it enforces the constraint that the factorized factors must be nonnegative [28];

(v)

NIMF (neighborhood-integrated matrix factorization): it fuses the neighborhood-based and model-based collaborative filtering approaches for the QoS value prediction [19].

The experimental results are shown in Table 2. From Table 2, we can observe that our method can almost obtain better prediction accuracy (with smaller MAE and RMSE values) than UPCC, IPCC, UIPCC, and NMF methods for response time. But the performance of our method is slightly poorer than the NIMF method, which is due to the fact that the NIMF method can get the globally optimal solutions of α and k through global search in [19], while our method obtains the near-optimal solutions through online parameter tuning methodology based on ADP without any designers’ a priori knowledge. However, the NIMF method needs to conduct an exhaustive search for the optimal parameters mindlessly in a large parameters space by trial and error; our method can automatically tune the parameters through online learning and optimization. In this regard, our method may reduce the computing time and improve the computational efficiency for prediction implementation. Moreover, when the matrix density is $20 %$ , the MAE and RMSE values are smaller than those values when the matrix density is $10 %$ , since denser matrix can provide more information for predicting the missing values.

Table 2

Performance comparison.

Methods	Matrix density = 10%		Matrix density = 20%
Methods	MAE	RMSE	MAE	RMSE
UPCC	0.5655	1.3326	0.5516	1.3114
IPCC	0.7596	1.6133	0.7624	1.6257
UIPCC	0.5654	1.3309	0.5053	1.2486
NMF	0.6754	1.5354	0.6771	1.5241
NIMF	0.4854	1.2745	0.4357	1.1678
Our method	0.5632	1.3199	0.5314	1.2533

4.4. Experiment Results for Case 2

To be more complex, we design a changeful network environment by adding some random disturbance to those $1,974,675$ web service invocation results of the data set used in this experiment. If some similar users’ network environment changes, those similar users may not be worthy of trust. It is obvious that the initial parameters of prediction model are unreasonable now. With the online optimization of ADP-based tuner, our method will automatically adjust parameters of prediction model to conform with the changes in the new network environment. In this case, we make QoS predictions for a service user under the changed network environment. Here, we compare our ADP-based prediction method with the NIMF method [19].

In this case, we consider a kind of disturbance for similar users. In the experiment, we artificially add a series of unrelated users to replace part of similar users with the purpose of reducing the PCC value among similar users. As our ADP-based method perceives the changes happening in similar users, it will adjust the number of similar users employed in the method by reducing the top-K value (i.e., the value k in our approach) and reduce the usage of information from similar users by increasing the value α which appeared in 3.

In Figure 8, we check the response time of the first 500 missing web services observed by one service user under the changed network environment. In this figure, the red line with spots represents the actual QoS value in every missing entry, and the blue line with stars represents the prediction data obtain by using the NIMF method with the fixed parameters of prediction model. In addition, the green line with rhombus represents the predictions results obtained by using our ADP-based online method with optimized parameters of prediction model. We can find a big error between the actual data and the prediction data using NIMF method in a changed network environment. Meanwhile, the prediction method with the ADP-based parameter tuner can achieve the satisfactory QoS prediction performance.

Figure 8

The comparison of prediction results.

5. Conclusion

In this paper, we propose an ADP-based parameter tuner for QoS prediction model using collaborative filtering algorithm. Due to its online adaptability and approximation capabilities through successive iterations, this ADP-based parameter tuner shows satisfactory performance in a large-scale experiment studied in this paper. Moreover, our approach can easily adapt to the changes of network environment without requiring prior knowledge or identification of the prediction model. Then, our method may not be computationally intensive for the scale of web service QoS data set in a complex and changeful network environment. Furthermore, with the help of the flexible structure and learning algorithm of ADP, the proposed method can be extended to other types of QoS prediction models and achieve near-optimal performance. Our experiment studies validate the effectiveness of the proposed ADP-based parameter tuner.

Footnotes

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was jointly supported by the National Natural Science Foundation of China under Grants 61174103 and 61272357, the National Key Technologies R&D Program of China under Grant 2015BAK38B01, and the Aerospace Science Foundation of China under Grant 2014ZA74001.

References

Ardagna

Pernici

Adaptive service composition in flexible processes

IEEE Transactions on Software Engineering 2007 33 6 369 384

10.1109/tse.2007.1011

2-s2.0-34248633749

Zhang

L. J.

Zhang

Cai

Services Computing 2007

Berlin, Germany

Springer

Jula

Sundararajan

Othman

Cloud computing service composition: a systematic literature review

Expert Systems with Applications 2014 41 8 3809 3824

10.1016/j.eswa.2013.12.017

2-s2.0-84891771378

Zhang

Hansen

K. M.

Ingstrup

A hybrid approach to self-management in a pervasive service middleware

Knowledge-Based Systems 2014 67 143 161

10.1016/j.knosys.2014.06.002

Shua

Chub

Chen

Supporting QoS-based discovery for visualization web services

International Journal of Distributed Sensor Networks 2009 5 1 19

10.1080/15501320802508295

Dong

A QoS driven web service composition method based on ESGA (elitist selection genetic algorithm) with an improved initial population selection strategy

International Journal of Distributed Sensor Networks 2009 5 1 54

10.1080/15501320802540900

Zhang

Lin

K.-J.

Efficient algorithms for Web services selection with end-to-end QoS constraints

ACM Transactions on the Web 2007 1 1, article 6

10.1145/1232722.1232728

2-s2.0-34248525624

Cardellini

Casalicchio

Grassi

Iannucci

Presti

F. L.

Mirandola

MOSES: a framework for QoS driven runtime adaptation of service-oriented systems

IEEE Transactions on Software Engineering 2012 38 5 1138 1159

10.1109/tse.2011.68

2-s2.0-84866424247

Zheng

H. Y.

Zhao

W. L.

Yang

Bouguettaya

QoS analysis for web service compositions with complex structures

IEEE Transactions on Services Computing 2013 6 3 373 386

10.1109/tsc.2012.7

2-s2.0-84883757301

10.

Zheng

Zhang

Lyu

M. R.

Investigating QoS of real-world web services

IEEE Transactions on Services Computing 2014 7 1 32 39

10.1109/TSC.2012.34

2-s2.0-84894498693

11.

Zhao

X. C.

Wen

Z. C.

X. M.

QoS-aware web service selection with negative selection algorithm

Knowledge and Information Systems 2014 40 2 349 373

2-s2.0-84875985403

10.1007/s10115-013-0642-x

12.

Mehdi

Bouguila

Bentahar

Probabilistic approach for QoS-aware recommender system for trustworthy web service selection

Applied Intelligence 2014 41 2 503 524

10.1007/s10489-014-0537-x

2-s2.0-84901575799

13.

Breese

J. S.

Heckerman

Kadie

Empirical analysis of predictive algorithms for collaborative filtering

Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence

July 1998

Madison, Wis, USA

43 52

14.

Shao

Zhang

Wei

Zhao

Xie

Mei

Personalized QoS prediction for web services via collaborative filtering

Proceedings of the IEEE International Conference on Web Services (ICWS ′07)

July 2007

Salt Lake City, Utah, USA

IEEE

439 446

10.1109/icws.2007.140

2-s2.0-46849102374

15.

Resnick

Iacovou

Suchak

Bergstrom

Riedl

GroupLens: an open architecture for collaborative filteringof netnews

Proceedings of the ACM Conference on Computer Supported Cooperative Work

October 1994

Chapel Hill, NC, USA

175 186

10.1145/192844.192905

16.

Salakhutdinov

Mnih

Probabilistic matrix factorization

Proceedings of Conference on Advances in Neural Information Processing Systems

December 2007

Vancouver, Canda

1257 1264

17.

Yang

Zhou

Study on selection strategy of web service based on fusion of subjective and objective evaluation for QoS

Journal of Information & Computational Science 2013 10 13 4213 4223

10.12733/jics20102100

2-s2.0-84886284773

18.

Luo

Chang

A novel data fusion scheme using grey model and extreme learning machine in wireless sensor networks

International Journal of Control, Automation, and Systems 2015 13 5

10.1007/s12555-014-0309-8

19.

Zheng

Lyu

M. R.

King

Collaborative web service Qos prediction via neighborhood integrated matrix factorization

IEEE Transactions on Services Computing 2013 6 3 289 299

10.1109/tsc.2011.59

2-s2.0-84883778327

20.

Barto

Powell

Wunsch

Handbook of Learning and Approximate Dynamic Programming 2004

Hoboken, NJ, USA

John Wiley & Sons

21.

Wang

Y.-T.

On-line learning control by association and reinforcement

IEEE Transactions on Neural Networks 2001 12 2 264 276

10.1109/72.914523

2-s2.0-0035273403

22.

Lewis

F. L.

Liu

D. R.

Reinforcement Learning and Approximate Dynamic Programming for Feedback Control 2012

Hoboken, NJ, USA

Wiley-IEEE

23.

Liu

Wei

Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems

IEEE Transactions on Neural Networks and Learning Systems 2014 25 3 621 634

10.1109/tnnls.2013.2281663

2-s2.0-84897594646

24.

Werbos

P. J.

Approximate dynamic programming for real-time control and neural modeling

Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches 1992 15

New York, NY, USA

Van Nostrand-Reinhold

493 525

25.

Zheng

Zhang

Lyu

M. R.

Distributed QoS evaluation for real-world Web services

Proceedings of the IEEE 8th International Conference on Web Services (ICWS ′10)

July 2010

Miami, Fla, USA

83 90

2-s2.0-77957282463

10.1109/icws.2010.10

26.

http://www.wsdream.net/dataset.html

27.

Zheng

Lyu

M. R.

King

QoS-aware web service recommendation by collaborative filtering

IEEE Transactions on Services Computing 2011 4 2 140 152

10.1109/tsc.2010.52

2-s2.0-81455141552

28.

Lee

D. D.

Seung

H. S.

Learning the parts of objects by non-negative matrix factorization

Nature 1999 401 6755 788 791

10.1038/44565

2-s2.0-0033592606