Abstract
More recently, with the increasing demand of web services on the World Wide Web used in the Internet of Things (IoTs), there has been a growing interest in the study of efficient web service quality evaluation approaches based on prediction strategies to obtain accurate quality-of-service (QoS) values. However, it is obvious that the web service quality changes significantly under the unpredictable network environment. Such changes impose very challenging obstacles to web service QoS prediction. Most of the traditional web service QoS prediction approaches are implemented only using a set of static model parameters with the help of designer's a priori knowledge. Unlike the traditional QoS prediction approaches, our algorithm in this paper is realized by incorporating approximate dynamic programming- (ADP-) based online parameter tuning strategy into the QoS prediction approach. Through online learning and optimization, the proposed approach provides the QoS prediction with automatic parameter tuning capability, and prior knowledge or identification of the prediction model is not required. Therefore, the near-optimal performance of QoS prediction can be achieved. Experimental studies are carried out to demonstrate the effectiveness of the proposed ADP-based prediction approach.
1. Introduction
Recently, with the increasing presence and adoption of cloud computing, the new idea of “anything as a service (XaaS)” is becoming more and more popular. XaaS enables the consumers to use the software with the form of “Use and Not Have”. Therefore, it plays an important role in the applications of Internet of Things (IoTs). However, with the emergence of a huge number of cloud services, it is more and more difficult to choose an appropriate service in accordance with demand from the users. A number of web service composition and web service selection approaches have been proposed. It has led to the development of the service of computing (SOC) [1–4].
Obviously, only considering the services from the function has been unable to meet the requirements from users. Then, the service recommendation based on nonfunctional indexes (e.g., quality of service (QoS)) has become one of the attractive research fields in SOC [5, 6]. QoS represents the real user experience of a cloud service. Generally speaking, the QoS data from the user or server includes the availability, response time, throughput, delay, and delay variation and loss. While recommending a service based on the QoS data, one of the biggest problems is that the QoS data we have is not complete [7–12]. Actually, the QoS values of web services can be collected from the server side or the client side. At the server side, QoS values are usually provided and collected by the service providers. Here, we only focus on the QoS values measured at the client side. Due to the influence of the unpredictable network connections and complex user application environment on the Internet, QoS values vary widely at the client side. Thus, the web service evaluation is conducted for obtaining detailed and accurate QoS values at the client side. However, because there are huge amounts of web services on today's Internet, it will take too much time to evaluate all the web service candidates for service users. Therefore, it might be difficult or even impractical to fulfil the above web service evaluation task at the client side.
To obtain accurate web service QoS values on condition that there are no sufficient service evaluations at the client side, some effective approaches with the help of prediction strategies are widely studied. The traditional QoS prediction algorithms are simple, and they are generally implemented by carrying out the arithmetic average operation for QoS values. Here, they use the average QoS values to predict the unknown QoS values. The major drawback of such methods is that they ignore some personalized factors, which may lead to a low prediction accuracy. In view of it, the collaborative filtering based approaches for making personalized QoS value prediction for the service users have been proposed. Specifically, the collaborative filtering technique is developed to automatically predict QoS values of the current user by collecting information from other similar users or items. In general, the collaborative filtering based QoS prediction approaches can be categorized into three major groups. The first group is the user-based collaborative filtering method using Pearson correlation coefficient (PCC), namely, UPCC. It is a very classical method. The QoS value prediction is implemented by employing similar users [13, 14]. The second group is the item-based collaborative filtering method using PCC, namely, IPCC. It is widely used in industrial companies (e.g., Amazon). This approach employs similar web services (i.e., items) for the QoS value prediction [15]. The third group is the probabilistic matrix factorization (PMF) based collaborative prediction. This idea was proposed by Salakhutdinov and Mnih [16], where the user preference matrix is fitted by a product of two lower-rank matrices. This method may perform well on some large, sparse, and imbalanced data sets. Furthermore, through the use of fusion techniques [17, 18], some advanced methods are also developed. For instance, Zheng et al. proposed a neighborhood-integrated matrix factorization (NIMF) approach by fusing the neighborhood-based and model-based collaborative filtering approaches to improve the prediction accuracy [19].
Although those approaches mentioned above play important roles in applications, there are also some limitations. Most of the QoS prediction approaches depend on designers’ a priori knowledge about the prediction model parameters. It is obvious that the information about a prediction model or more specifically precise knowledge about a prediction system is quite difficult to obtain in some practical applications. Then, model parameter identification through experiments is usually needed. However, it is time consuming for some large-scale experiments. This practical limitation imposes challenging obstacles to the applicability of those QoS prediction methods. For instance, in [19], a series of experiments were conducted for the purpose of parameter identification used in QoS prediction. But it was computationally intensive for the scale of web service QoS data set. Furthermore, it was not suitable for online parameter tuning under complex network environment on the Internet.
In this paper, an optimal online parameter tuning methodology based on approximate dynamic programming (ADP) is proposed to improve the QoS prediction approach, and the experiment is conducted on a large-scale web service QoS data set that has some characteristics of big data. Because of the fast adaptability and approximation capabilities of neural network (NN) in its model-free reinforcement learning scheme, ADP using NNs is a powerful tool for computing optimal solution of a multistage dynamic decision process while avoiding the “curse of dimensionality” [20–23]. Under the ADP scheme with actor-critic architecture, a critic network is designed for approximating cost function and an action network is designed for generating optimal actions [24]. We propose a model-free online ADP learning algorithm for parameter tuning used in collaborative web service QoS prediction. By designing an ADP architecture and defining an appropriate reinforcement signal, the ADP tunes the QoS prediction model parameter optimally to achieve the satisfactory QoS prediction. In addition, for big data applications with large-scale data sets, some available QoS prediction approaches are time consuming in implementing model parameter identification by trial and error with painstakingly handcrafted exhaustion method. In contrast, the ADP-based method has automatic model parameter tuning capability and it may not be very computationally intensive for the scale of web service QoS data set. In this way, the ADP-based method may be a considerable alternative to big data analytics in this case.
Different from the traditional QoS prediction approaches, our methodology has the following advantages: (1) the optimal parameter tuner used in prediction approach can be implemented online; (2) no prior knowledge or identification of the prediction model is required; (3) the near-optimal performance may be achieved while the parameters of prediction model can adapt to the changes of network environment; and (4) the proposed structure and the associated algorithm can be also extended to applications for other types of QoS prediction model.
This paper is organized as follows. In Section 2, we introduce a collaborative QoS prediction framework using PMF model. An ADP-based parameter tuner for QoS prediction model is proposed in Section 3. Experiment results are presented in Section 4. Conclusion is provided in Section 5.
2. QoS Prediction Model and Parameter Analysis
2.1. Collaborative QoS Prediction Model
A collaborative QoS prediction framework is shown in Figure 1, where the service users are encouraged to share their individually observed past web service QoS information. In this collaborative framework, a service user will obtain the QoS prediction service from the centralized server only if he/she contributes some QoS values. Meanwhile, more web service QoS values are contributed by a service user; more user features can then be mined from those contributed data. In this way, higher QoS value prediction accuracy can therefore be achieved. It is the essence of this collaborative framework [19].

The proposed collaborative QoS prediction framework.
Furthermore, after providing the local QoS values to the server, the service users can get the prediction results via the following three phases. In the first phase, the system calculates the users’ similarities using PCC and determines a set of top-K similar users (i.e., neighbors) for the current user. In the second phase, by employing those neighbors’ information mentioned above, the system designs a collaborative filtering model to predict the missing web service QoS values in the user-item matrix, where each element in this matrix is the value of a certain QoS property of a web observed by a service. In the last phase, an ADP-based parameter tuner is proposed to adjust the key parameters of the above collaborative filtering model to find out the near-optimal results. Details of this phase are presented in the next section.
There is an
Then, a neighborhood-integrated matrix factorization approach is employed to make prediction in the user-item matrix [19]. For an
By applying a gradient descent rule for
2.2. Parameter Analysis
In function 3, the weight coefficient α bounded within
In addition, the top-K value determines the number of similar users employed in collaborative QoS prediction. Generally speaking, a too large top-K value may hurt the prediction accuracy, because it means that some dissimilar users may be involved in the prediction computation. Meanwhile, a too small top-K value may be often computationally debilitating.
Therefore, an appropriate α and a top-K value may help to improve the QoS prediction accuracy with low computational complexity. Here, we design an ADP-based parameter tuner which aims to find the satisfactory parameters α and top-K through online optimization.
3. ADP-Based Parameter Tuner
3.1. System Architecture
For this QoS prediction problem, those available collaborative filtering approaches (e.g., [19]) are developed in accordance with the designer's experience for parameter setting used in system model. And a set of static parameters lack adaptability. Thus, for an unpredictable network environment, the prediction performance is unsatisfactory only by using predefined static parameters.
The QoS prediction framework using our ADP-based parameter tuner is illustrated in Figure 2, which aims to address the above issue in the design of QoS prediction model. Here, the proposed ADP-based tuner is employed to update two key parameters in the QoS prediction model, that is, α and top-K value.

The QoS prediction framework with ADP-based parameter tuner.
In Figure 2, the vector
Moreover, the ADP-based parameter tuner in Figure 2 is implemented in accordance with actor-critic architecture of ADP [21]. It is illustrated in Figure 3, where the solid lines are signal flow and the dashed lines are the paths for NNs’ parameter tuning. As mentioned above, there are two NNs in the ADP-based tuner. One is designed as an action network to take the system state

Schematic diagram for implementation of ADP-based parameter tuner.
Considering the reinforcement signal
Here, the output of the critic network (i.e.,
In addition, the regulatory signals are generated by using modulation transform for
3.2. The Action Network and the Critic Network
In our implementation, the feedforward NN with one hidden layer is used to design the action network and the critic network. The action network is illustrated in Figure 4, where

The action network.
In the action network, the approximate error and the objective function to be minimized are defined as
The critic network is illustrated in Figure 5, where

The critic network.
3.3. Weight Online Learning Rules
In our proposed online learning and optimization strategy, the feature of not requiring prior knowledge means that the action network and the critic network in ADP-based parameter tuner can both be randomly initialized toward their network weights in the initialization phase of collaborative QoS prediction. Once a system state is observed, an action will be subsequently produced through the computation for the combination of those weights in the action network. A good action will be encouraged, while a bad action will be punished.
The weight online learning rule for the action network is a gradient-based adaptation designed as
Here, the output
The weight online learning rule for the critic network is also a gradient-based adaptation designed as
Then, the output
In the implementation of ADP-based tuner, it should be noted that the weight normalization is performed in both networks as follows:
Finally, the ADP-based parameter tuner algorithm is summarized in Algorithm 1. Here,
(1) initialize the weights of the critic network and the action network arbitrarily; (2) (3) initialize (4) (5) send (6) send (7) calculate (8) take (9) check the boundedness for (10) send (11) get reinforcement signal (12) (13) (14) (15) (16) (17) (18) (19) return prediction result and MAE.
4. Experiment Results and Discussions
4.1. Data Set Description
To test the performance of our approach in real-world, we use a common data set collected from
In this experiment, there is a total of
4.2. Algorithm Settings
The parameters used in the experiments are summarized in Table 1, where the notations are defined as follows:
Summary of parameters used in ADP-based tuner for QoS prediction.
Here, the weights in the action and the critic networks are trained using their internal cycles or their internal training error threshold.
4.3. Experiment Results for Case 1
4.3.1. Statement of Evaluation
In this experiment, the mean absolute error (MAE) and root-mean-squared error (RMSE) metrics are used to measure the prediction quality of our approach in the comparative study. MAE is defined by
Here, the MAE is to measure how close predicted QoS values are to the observed QoS values. Compared to the MAE, the RMSE provides an effective way to severely punish large errors.
4.3.2. Performance Comparison
We adopt the proposed optimal online parameter tuning methodology based on ADP to achieve the QoS prediction model parameters identification for α and k. The values of α and k obtained by using our approach are similar to those achieved by trial and error with method of exhaustion in [19]. Figures 6 and 7 show the trajectories of α and k under the ADP-based parameter tuner for a successful learning trial, respectively. It can be seen that α and k are gradually tuned through the use of our tuner. After

A typical trajectory of α during a successful learning trial for the ADP-based parameter tuner.

A typical trajectory of k during a successful learning trial for the ADP-based parameter tuner.
Moreover, to evaluate the prediction accuracy, we compare our approach with the following approaches:
UPCC (user-based collaborative filtering method using PCC): it only employs similar users for the QoS value prediction [13, 14]; IPCC (item-based collaborative filtering method using PCC): it only employs similar web services for the QoS value prediction [15]; UIPCC: it employs both similar users and similar web services for the QoS value prediction [27]; NMF (nonnegative matrix factorization): it is also a collaborative filtering method based on matrix factorization; unlike other matrix factorization methods, it enforces the constraint that the factorized factors must be nonnegative [28]; NIMF (neighborhood-integrated matrix factorization): it fuses the neighborhood-based and model-based collaborative filtering approaches for the QoS value prediction [19].
The experimental results are shown in Table 2. From Table 2, we can observe that our method can almost obtain better prediction accuracy (with smaller MAE and RMSE values) than UPCC, IPCC, UIPCC, and NMF methods for response time. But the performance of our method is slightly poorer than the NIMF method, which is due to the fact that the NIMF method can get the globally optimal solutions of α and k through global search in [19], while our method obtains the near-optimal solutions through online parameter tuning methodology based on ADP without any designers’ a priori knowledge. However, the NIMF method needs to conduct an exhaustive search for the optimal parameters mindlessly in a large parameters space by trial and error; our method can automatically tune the parameters through online learning and optimization. In this regard, our method may reduce the computing time and improve the computational efficiency for prediction implementation. Moreover, when the matrix density is
Performance comparison.
4.4. Experiment Results for Case 2
To be more complex, we design a changeful network environment by adding some random disturbance to those
In this case, we consider a kind of disturbance for similar users. In the experiment, we artificially add a series of unrelated users to replace part of similar users with the purpose of reducing the PCC value among similar users. As our ADP-based method perceives the changes happening in similar users, it will adjust the number of similar users employed in the method by reducing the top-K value (i.e., the value k in our approach) and reduce the usage of information from similar users by increasing the value α which appeared in 3.
In Figure 8, we check the response time of the first 500 missing web services observed by one service user under the changed network environment. In this figure, the red line with spots represents the actual QoS value in every missing entry, and the blue line with stars represents the prediction data obtain by using the NIMF method with the fixed parameters of prediction model. In addition, the green line with rhombus represents the predictions results obtained by using our ADP-based online method with optimized parameters of prediction model. We can find a big error between the actual data and the prediction data using NIMF method in a changed network environment. Meanwhile, the prediction method with the ADP-based parameter tuner can achieve the satisfactory QoS prediction performance.

The comparison of prediction results.
5. Conclusion
In this paper, we propose an ADP-based parameter tuner for QoS prediction model using collaborative filtering algorithm. Due to its online adaptability and approximation capabilities through successive iterations, this ADP-based parameter tuner shows satisfactory performance in a large-scale experiment studied in this paper. Moreover, our approach can easily adapt to the changes of network environment without requiring prior knowledge or identification of the prediction model. Then, our method may not be computationally intensive for the scale of web service QoS data set in a complex and changeful network environment. Furthermore, with the help of the flexible structure and learning algorithm of ADP, the proposed method can be extended to other types of QoS prediction models and achieve near-optimal performance. Our experiment studies validate the effectiveness of the proposed ADP-based parameter tuner.
Footnotes
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
This work was jointly supported by the National Natural Science Foundation of China under Grants 61174103 and 61272357, the National Key Technologies R&D Program of China under Grant 2015BAK38B01, and the Aerospace Science Foundation of China under Grant 2014ZA74001.
