Abstract
In this article, the optimal linear quadratic control problem is considered for the wireless sensor and actuator network with stochastic network-induced delays and packet dropouts. Considering the event-driven relay nodes, the optimal solution is obtained, which is a function of the current plant state and all past control signals. It is shown that the optimal control law is the same for all locations of the controller placement. Since the perfect plant state information is available at the sensor, the optimal controller should be collocated with the sensor. In addition, some issues such as the plant state noise and suboptimal solution are also discussed. The performance of the proposed scheme is investigated by an application of the load frequency control system in power grid.
Keywords
Introduction
Networked control systems (NCSs) in which the shared communication medium is used for the connection between the plant and the controller have recently attracted much attention due to their potential applications in various areas such as power grids, dc motors, and robotic networks.1,2 In such systems, the network-induced delay and the packet dropout introduced by the communication network cause performance degradation and potential system instability.
In the literature, network-induced delays have been modeled in various forms such as constant delays, 3 short stochastic delays, 4 and long stochastic delays. 5 The typical approaches to modeling, analysis, and synthesis of NCSs with delays are summarized in Hespanha et al. 6 and Zhang et al. 7 On the other hand, the packet dropout can be modeled either as a stochastic process 8 or a deterministic one. 3 If the packet containing the current state information is dropped, the typical solution is to either predict the lost state, 9 or use the previous state values, 3 or simply set the lost state to zero. 10 With full state information, the optimal control problem can be solved. In particular, the optimal linear quadratic (LQ) controller is developed for the NCS in Nilsson et al. 4 and Hu and Zhu 5 with stochastic network-induced delays.
Compared with the traditional NCS, wireless NCSs, especially wireless sensor and actuator networks (WSANs), offer architectural flexibility and additional degrees of freedom. 11 Several new standards have recently been introduced for multi-hop WSANs, for example, WirelessHART, 12 ISA-100. 13 Due to the flexibility of placing the controller in multi-hop WSANs, two significant challenges arise: the optimal placement of the controller node and the optimal control law design, which have been studied by a significant amount of works on WSANs. In particular, assuming the packets to be arbitrarily long, the problem of optimal controller placement in multi-hop WSANs is addressed in Robinson and Kumar, 14 and it is shown that placing the controller at the actuator node achieves better performance than at the sensor node under certain conditions. Over the lossy networks with packet dropouts, the optimal control problem is decomposed into an LQ optimal regulator design and state estimation problem in Gupta et al., 15 and then the optimal control algorithm is obtained by a backward recursion for every time step. Assuming the sensor and the actuator can exchange data with the controller independently, the LQ Gaussian optimal control problem is addressed in Garone et al. 16 for both finite and infinite horizon cases. The LQ Gaussian problem has also been studied in Lin et al. 17 for systems with packet losses but without acknowledgment signals of status of control packet losses. In Xin et al., 18 the necessary and sufficient conditions under which the closed-loop WSAN is guaranteed to be stable for arbitrary controller location are studied in both one-hop and multi-hop network cases. Given the predesigned control strategy, the location distribution for the optimal controller placement is found in Quevedo et al., 19 where the actuator node is assumed to have the capability to broadcast the plant state information to the intermediate nodes over parallel unreliable links. However, only the effect of the packet dropout is considered in all the above works, and in this case the optimal control law is linear with the plant state (or estimated state). On the other hand, in practice, the network-induced delay and the packet dropout are the two most important characteristics that should be jointly studied in wireless NCSs.6,7 Taking the delay into consideration, not only the plant state but also the previous control signals should be used in the design of the optimal controller.4,5,20,21
In this article, we focus on the general case in which both the stochastic network-induced delays and the packet dropouts are considered, and the optimal control problem is investigated for a linear WSAN with event-driven relay nodes. Using the quadratic cost function, the optimal solution is derived as a function that is linear with the current plant state and all past control signals, and the corresponding suboptimal solution is also proposed. Then, the optimal controller location is deserved to be at the sensor since the perfect plant state is available. The performance of the proposed algorithm is assessed using the application of load frequency control (LFC) in power grid systems.
The remainder of this article is organized as follows. The system model and problem formulation are given in section “System model and problem formulation.” We then derive the optimal controller with stochastic delays and packet dropouts in section “Optimal control law design.” Section “Extension to the long delay case” extends the results to the case of long stochastic network-induced delays. Numerical results and conclusions are given in sections “Simulation results” and “Conclusion,” respectively.
System model and problem formulation
In general, one sensor (or relay) node is selected as the controller to maintain the system stability in WSANs. As shown in Figure 1, node 5 is selected as the controller to coordinately generate control signals which are fed back to the plant. Actually, the WSAN considered in Figure 1 can be simplify converted to be the equivalent wireless multi-hop WSAN as shown in Figure 2,14,19 where the plant, sensor, actuator, and a number of relay nodes together form a closed-loop NCS.

The structure of the WSAN.

The wireless multi-hop WSAN.
We assume that the plant is a continuous-time linear time-invariant system, the sensor is time-driven with a constant sampling period
For a given controller located at the
where
Note that, from equations (1) and (2), we can see that the S/A delay and packet dropout are independent of the controller node location, that is, unrelated to the parameter
In this section, we first assume that the S/A delay is smaller than one sampling period, that is,
where
Then, the corresponding discrete-time version of equation (1) is given as [4]
where
If the packet dropout is also considered,
where
If the consecutive packet dropout happens, iteratively using equation (5), the general formulation of
which means that
Substituting equations (6) into (4), the discrete-time control process with delays and packet dropouts can be expressed as
where
which is determined by the S/A delay
Note that there are many works to model the discrete-time control process with packet dropouts. However, most of them ignore the network-induced delay or directly set the actuator input to be zero when the packet dropout happens, which is the typical way to simplify the analysis. Here, considering the event-driven controller and actuator, the plant state loss and the control signal loss can result in no signal (i.e., no event) arriving at the controller and the actuator, respectively. In this case, the actuator input remains the same as the last control signal, but cannot be directly set to zero. In addition, the effect of network-induced delays is also considered. Thus, we formulate the discrete-time control process as in equation (7).
In this article, we focus on the optimal control problem of the WSAN with stochastic network-induced delays and packet dropouts. Using a quadratic cost function as in Robinson and Kumar, 14 Xin et al., 18 and Quevedo et al., 19 the design of the optimal control is to minimize the cost function, that is
where
Optimal control law design
In this section, the optimal control law is derived for the WSAN in equation (9) subject to the constraint equation (7).
Optimal control law derivation
Define
Then, equation (7) can be rewritten as
where the time-varying coefficient matrices
where
Then, the optimization problem in equation (9) can be rewritten as
where
Lemma 1
Assume that function
Define
We next derive the expressions for the optimal control law.
L = N
When
where
L=N–1
When
where
Then, the optimal control law minimizing equation (18) with respect to
where
At the same time, substituting
where
which is symmetric, that is,
When
We can see that
where
Here
From equations (25) and (26), we observe that the optimal control law
Some discussions
In this subsection, some issues are discussed for design of the optimal controller.
Optimal location for the controller
The design of the optimal controller is obtained in equation (25) and we observe that
State noise
The disturbance usually exists at the plant. If the state noise is considered, equation (4) can be rewritten as
where
Using equations (27) instead of (4) as the dynamic process for design of the optimal controller, similar to that in subsection “Optimal control law derivation,” we can derive the same optimal control law as equation (25), and the value of
where
The cost function in equation (9) can be derived as
Suboptimal solution
From equations (25) and (26), a drawback with the optimal control law is the complicated matrix
which means that at most the effect of
Using equation (29), it is equivalent to set
where
and the related parameters
From the definition of
Extension to the long delay case
In this section, we extend the result with short network-induced delay in section “Optimal control law design” to the case when the delay is larger than one sampling period.
Here, we assume that the S/A network-induced delay is bounded that
where
Note that

Signal timings.

An example of arrivals in
Based on equations (6) and (33), the discrete-time control process with long delays is given as
where
and
We observe that equations (7) and (35) have the same form. Then, using
Simulation results
In this section, we provide simulation studies on the application of the proposed optimal controller scheme with stochastic network-induced delays and packet dropouts to the LFC system in power grid.24,25 The typical LFC system is composed of speed governor, turbine, generator, and LFC controller (included in the wireless sensor and actuator network), which is illustrated in Figure 5. The objective is to optimally adjust LFC controller
where

WSAN-based LFC system for power grid.
The deviation of frequency
where
The system state vector can be defined as
The sensor is required to measure the four-dimensional corresponding system state, and then send to the LFC controller. Once the state signal has arrived at the controller, a new control signal is calculated and sent to the actuator node.
Then, the linear dynamic control model can be described as
where
Based on equations (37) and (38) and Figure 5, we obtain the system parameters
In the simulation, a 4-hop wireless network composed of the sensor, actuator, and three relay nodes is chosen for simulation. For each hop, it is assumed that the packet dropout follows the Bernoulli distribution with
In addition, we set the initial value of the plant state
and the sampling period and duration are chosen as T = 0.05 and N = 50, respectively.
Figure 6 shows the performance comparison when the controller is located at different nodes such as the sensor (the first node) and relay nodes (the second, third, and fourth nodes). We observe that the cost can be reduced when the controller collocates with the sensor, which indicates that the imperfect plant state information caused by the network-induced delays and packet dropouts introduces performance degradation, and the sensor is the optimal controller placement. In addition, the system performance is degraded significantly with the network-induced delay increasing. This reveals that the effect of delays should be considered in the design of the optimal control law.

Performance comparison for various controller locations.
Figure 7 shows the performance comparison for three cases: our proposed scheme with the optimal controller location, the approach in Robinson and Kumar 14 where the controller collocates with the actuator, and the LQ optimal control strategy proposed in Garone et al. 16 ignoring the network-introduced delay. We can observe that our proposed scheme with the optimal controller location outperforms the other two schemes in the sense that it has a lower total cost. It can also be seen that the cost of the approach proposed in Garone et al. 16 without considering the effect of network-induced delay increases more rapidly than the others. This is because this scheme cannot effectively deal with the network-induced delay so that both network-induced delays and packet losses should be investigated.

Performance comparison of the proposed scheme with existing approaches.
The performance of the suboptimal approach is shown in Figure 8, where the controller is assumed to be collocated with the sensor. It can be seen that when we truncate the effect of packet dropouts, the performance degradation is significantly decreased with larger

Performance comparison for suboptimal control design.
In Figure 9, the effect of the plant state noise is investigated. Here the covariance matrix of the noise is set as

The effect of plant state noise.
Conclusion
We have considered the design of the optimal control for the WSAN in the presence of stochastic network-induced delays and packet dropouts. In particular, the optimal control solution is derived as a linear function of the current plant state and all past control signals, and the optimal controller should be collocated with the sensor. Other issues such as the state noise and suboptimal solution are also discussed, and the corresponding performances are investigated by the LFC system in power grid.
Footnotes
Handling Editor: Antonio Lazaro
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by National Natural Science Foundation of China under Grant Nos 61701010, 61601330, and 61671029, and the Foundation of Beijing Municipal Commission of Education KM201610005004.
