Research on unmanned combat aerial vehicle robust maneuvering decision under incomplete target information

Abstract

This article investigates the problem of designing a novel maneuvering decision-making method for the unmanned combat aerial vehicle. The design objective is to promote the real-time ability of decision-making method and solve the problem of uncertainty caused by incomplete target information. On the basis of statistics theory, a robust maneuvering decision method with self-adaptive target intention prediction is proposed. The robustness design is embedded in the membership function of the situation parameters. The reachable set theory and adaptive adjustment mechanism of the target state weight are used in the target intention prediction to promote the real-time ability. Simulations are conducted under the condition that the enemy aircraft perform both non-maneuvering and combat maneuvering. The results verify the good properties of the decision-making method, which can extend the survival time of the unmanned combat aerial vehicle when the enemy aircraft attacks, and short the taking position and attack time of the unmanned combat aerial vehicle when the enemy aircraft evades.

Keywords

Decision-making statistical analysis uncertainty robust design real time

Introduction

Autonomous air combat decision is a mechanism regarding how the unmanned combat aerial vehicle (UCAV) choose tactical plan or maneuver action in real time during the process of air combat. The pros and cons of the mechanism reflect the intelligent level of the UCAV decision. Air combat decision can be taken as a system, and the input of the system is all kinds of parameters related to air combat, such as platform parameters, weapon parameters, and situation parameters. Decision-making process is the information processing mechanism within the system. The output of the system is the decision results, namely, tactical plan or some certain maneuvers.

Recently, the commonly utilized decision-making methods include expert system theory,^1–3 differential games,^4–6 rough set,^7–9 Bayesian network,^10–12 and swarm intelligence algorithm.^13–15 Literature¹⁶ studied the autonomous decision-making method of the unmanned aerial vehicle (UAV) under uncertain environment in intelligence, surveillance, and reconnaissance (ISR) task. However, the operators on the ground can also feed back the effect after the implementation of decision to the UAV in the loop. Literature¹⁷ utilized fuzzy theory as a decision mechanism. The flight attitude of the enemy plane in next moment was predicted in the decision-making process, so as to decide the optimal flight tactical action of our plane. Literature¹⁸ deeply studied the influence diagram and utilized multistage influence diagram to model the stand-alone air combat. Literature¹⁹ studied tactical decision-making of the underwater vehicle to avoid torpedo. A method of tactical decision based on fuzzy logic was described. Furthermore, literature¹⁹ utilized Python language to script the decision-making process and carried out experiments on engineering application simulation platform. Literature²⁰ utilized fuzzy logic and Bayesian network to construct a situation assessment system, which included pilot mental model, aircraft platform model, sensor model, and data processing algorithm. Li and Xiaoguang²¹ studied the autonomous decision-making under the condition of close combat of the UCAV. According to the “variable value” theory in the machine game theory, the traditional payment function in the differential game model was improved, so that the decision result would be more reasonable. Literature²² reviewed the methods of air combat decision; the characteristics of application are compared between various methods such as differential game, influence diagram, and expert system in air combat decision. The development trend of air combat decision-making method in the future is predicted. Literature²³ put forward an autonomous air combat maneuvering decision method based on tentative control input. Furthermore, on the basis of current aircraft status and tentative input parameters, artificial neural network was utilized to predict the state of the fighter plane. In literature,²⁴ based on excitation the idea that “pilots could predict maneuver of the enemy plane through the visual sense in air combat within sight distance,” a method for predicting maneuver of the enemy plane on the basis of image of the enemy planes was proposed. However, the premise of this method was that the airborne equipment can provide the image with certain precision, and it needed neural network to carry out the estimation. Furthermore, whether it can meet the real-time constraints is yet to be verified.

The main motivation of this work is to propose a robust and fast maneuvering decision-making method for the UCAV. The contributions of this article are as follows. (1) The basic maneuver library is extended so that the UCAV can perform the maneuvers that other typical maneuver libraries cannot achieve. (2) By embedding the robustness design into the membership function of air combat situation, the influences brought by uncertain target information could be overcome. (3) The reachable set theory and adaptive adjustment mechanism of the target state weight are used in the mechanism of target intention prediction, so that the real-time ability of the method is promoted and the UCAV can timely react to the rapid combat maneuvering of the enemy.

Establishment of the UCAV air combat model

UCAV particle model

For the high-level maneuvering decision, the particle model of the UCAV platform can meet the requirements. Model parameters are presented in Figure 1. The particle model is shown as below

\dot{x} = V \cos γ \sin ψ

(1)

\dot{y} = V \cos γ \cos ψ

(2)

\dot{z} = V \sin γ

(3)

where x, y, and z indicate the positions of the UCAV in the inertial coordinate system; $\overset{\cdot}{x}$ , $\overset{\cdot}{y}$ , and $\overset{\cdot}{z}$ refer to the speed component in three axis directions, respectively; $γ$ means the angle between the speed and horizontal planes, namely, the track angle; and $ψ$ indicates the angle between the velocity projection in the horizontal plane and y-axis direction, namely, course angle. The derivative $\overset{\cdot}{V}, \overset{\cdot}{γ}, and \overset{\cdot}{ψ}$ expressions of $V, γ, and ψ$ are as shown as below

\overset{\cdot}{V} = g (n_{x} - \sin γ)

(4)

\overset{\cdot}{γ} = \frac{g}{V} (n_{z} \cos ϕ - \cos γ)

(5)

\overset{\cdot}{ψ} = \frac{g n_{z} \sin ϕ}{V \cos γ}

(6)

In equations (4)–(6), the first controlled variable $n_{x}$ indicates the overload along the speed direction, representing the platform thrust. The second controlled variable $n_{z}$ indicates the overload along the longitudinal direction and is also referred to as the normal overload. The third controlled variable $ϕ$ is the roll angle of the velocity vector. Assuming that the velocity vector is consistent with the body axis direction, $ϕ$ can be used to represent the controlled variable of rolling platform. When the control commands $n_{x_{com}}, n_{z_{com}}, and ϕ_{com}$ are input into the system, the dynamic delay effect existed. Thus, the modeling is conducted on the delay effect, which can be shown as below

n_{x} = \frac{1}{1 + τ_{x} s} n_{x_{com}}

(7)

n_{z} = \frac{1}{1 + τ_{z} s} n_{z_{com}}

(8)

ϕ = \frac{ω_{n}^{2}}{s^{2} + 2 ω_{n} ξ s + ω_{n}^{2}} ϕ_{com}

(9)

where $τ_{x}$ and $τ_{z}$ are the delay time constant of $n_{x}$ and $n_{z}$ , respectively. $ω_{n}$ refers to the natural oscillation frequency. $ξ$ means the damping coefficient.

Figure 1.

Definition diagram of particle model parameters.

Basic maneuver library

The establishment of the maneuver library of the unmanned combat aircraft can draw lessons from tactical actions when fighter pilot conducts air combat. According to the common way of air combat maneuvering, NASA scholars²⁵ have designed seven typical flight maneuvers at a constant speed, as shown in Figure 2: (1) continued stable flight, (2) maximum acceleration flight, (3) maximum deceleration flight, (4) maximum G-force left-turn flight, (5) maximum G-force right-turn flight, (6) maximum G-force upward flight, and (7) maximum G-force downward flight. For the constraint of maximum overload, only the physical structure limit of the UCAV platform needs to be considered. Without the capacity which constraints the pilot’s body, the maneuvering performance of the UCAV can be fully realized in air combat.

Figure 2.

Typical maneuver library.²⁵

However, the maneuver contained in the maneuver library is single and can be moved only in two separate planes without considering the actual situation of the air combat. As shown in Figure 3, this article expands the maneuver based on the problems in the above typical maneuver. The extended maneuver library contains the top-right, top-left, bottom-left, and bottom-right maneuver, and maneuver in each direction includes three classes of state, including acceleration state, deceleration state, and uniform state. By extending the maneuver library, the UCAV can perform the maneuvers such as oblique loop, chandelle, and drum maneuver which other typical maneuver libraries cannot achieve.

Figure 3.

Extended maneuver library: (a) principle diagram and (b) MATLAB simulation diagram.

Robust design of air combat situation function

According to the presentation of advantages and disadvantages of air combat situation in the literature,¹⁸ as shown in Figure 4, the air combat situation can be further divided into equilibration, advantages, disadvantages, and badness to each other; the purpose of air combat decision is to transform any situation into advantageous situation. Four parameters can be utilized to characterize the current air combat situation in the process of air combat:¹⁸ $Θ = (A, \vec{R}, V, H)$ , where A represents the direction of the UCAV and the enemy aircraft under the current situation, $\vec{R}$ stands for the distance vector of the UCAV and the enemy aircraft, V refers to the UCAV speed, and H represents the UCAV flight height. In order to enhance the robustness of decision results, the robust design is conducted on the membership function of air combat situation parameters.

Figure 4.

Air combat situation diagrams.

The membership function $η_{A} (α)$ of position parameter is defined as follows

η_{A} (α) = \frac{α}{π^{2}}, α = α_{e} α_{u}; α_{e}, α_{u} \in [0, π]

(10)

\vec{R} = [\begin{matrix} x_{u} - x_{e}, & y_{u} - y_{e}, & z_{u} - z_{e} \end{matrix}]

(11)

{\vec{V}}_{u} = [\begin{matrix} V_{u} \cos γ_{u} \sin ψ_{u} \\ V_{u} \cos γ_{u} \cos ψ_{u} \\ V_{u} \sin γ_{u} \end{matrix}]

(12)

{\begin{matrix} α_{u} = \arccos (\frac{\vec{R} \times {\vec{V}}_{u}}{‖ \vec{R} ‖ \times ‖ {\vec{V}}_{u} ‖}) \\ α_{e} = \arccos (\frac{\vec{R} \times {\vec{V}}_{e}}{‖ \vec{R} ‖ \times ‖ {\vec{V}}_{e} ‖}) \end{matrix}

(13)

In the above equation, subscript u means the UCAV, subscript e means the enemy aircraft, and the definition of $α_{e}$ and $α_{u}$ is shown in Figures 5 and 6. When both $α_{e}$ and $α_{u}$ approach $π$ , the membership function value of the position parameter reaches the maximum. It means that the UCAV is in the stern attack situation at the enemy.

Figure 5.

Orientation variable membership function.

Figure 6.

Position and distanced diagram.

When the distance between the enemy aircraft and the UCAV is less than the missile attack distance, in order to make the UCAV decision result have certain robustness, the membership function $η_{R} (R)$ of the distance parameter can be defined as

η_{R} (R) = {\begin{matrix} 1 & R \leq R_{g} \\ e^{(- \frac{{(R - R_{g})}^{2}}{2 σ^{2}})}, & R > R_{g} \end{matrix}

(14)

In the above equation, $R = ‖ \vec{R} ‖$ , $R_{g}$ shows the weapon attack distance of the UCAV, and $σ$ refers to the standard deviation.

The membership function $η_{V} (V)$ of the speed parameter is defined as

η_{V} (V) = \frac{V_{u}}{V_{*}} e^{(- \frac{2 | V_{u} - V_{*} |}{V_{*}})}

(15)

In the above equation, $V_{*}$ means the best attack speed of the UCAV to attack the enemy aircraft, and the values are shown as follows

V_{*} = {\begin{matrix} V_{e} + (V_{max} - V_{e}) (1 - e^{(\frac{R_{g} - R}{R_{g}})}), & R > R_{g} \\ V_{e}, & R \leq R_{g} \end{matrix}

where $V_{max}$ means the maximum speed and $V_{e}$ means the speed of the enemy aircraft.

Same as the membership function of distance parameter, in order to enhance the robustness of the UCAV decision, the membership function $η_{H} (Δ z)$ of height parameter can be defined as

η_{H} (Δ z) = {\begin{matrix} 1, & h_{s} \leq Δ z \leq h_{s} + σ_{h} \\ e^{(- \frac{{(Δ z - h_{s})}^{2}}{2 σ_{h}^{2}})}, & Δ z < h_{s} \\ e^{(- \frac{{(Δ z - h_{s} - σ_{h})}^{2}}{2 σ_{h}^{2}})}, & Δ z > h_{s} + σ_{h} \end{matrix}

(16)

In the above equation, $h_{s}$ represents the best attack height of the UCAV to attack the enemy aircraft, $Δ z = z_{u} - z_{e}$ stands for the height difference between the UCAV and the enemy aircraft, and $σ_{h}$ is the standard deviation of best attack height.

As can be seen from the membership function of the above four parameters, when the four membership function values are gradually approaching 1, the UCAV is in the taking position and attack situation. If they are approaching 0, the UCAV is in the attacked situation.

In conclusion, the general situation assessment function of the UCAV is represented as follows

f (η_{A}, η_{R}, η_{V}, η_{H}) = \sum_{i = 1}^{4} w_{i} η_{x}

(17)

where $w_{i}$ means the weight with subscript $x \in {A, R, V, H}$

UCAV robust maneuvering decision method based on the statistics theory

The UCAV MIN-MAX decision method based on the fuzzy logic was put forward by the literature¹⁸ and is further improved in this article. The decision method used in the literature cannot ensure that the membership functions of four situation parameters keep increasing monotonically during the decision process, and eventually converge to 1.

Aiming at the problems of the timeliness and accuracy of information, we improve the membership function of situation evaluation during the decision process on the basis of the MIN-MAX decision method, in order to make the situation function have a certain insensitivity to air combat situation change and then turn into the robustness of decision result. However, statistical theory is applied to make the maneuvering decision process fully consider the combined action of situation parameters during the decision process. Thus, it can be ensured that tUCAV will eventually reach the situation dominant area in the autonomous air combat. Based on the statistical theory, the specific process of decision method is shown as follows:

Step 1. Based on the information of the UCAV and the enemy aircraft at the current time t, send the control command of all the actions in the maneuver library to the particle model for maneuver trial.

Step 2. All the possible locations of the UCAV in the next stage are obtained through step 1; resolve the situation at each position. Thus, the set can be obtained as below

Q_{i}^{t + Δ t} = {η_{A}^{i, t + Δ t} (α), η_{R}^{i, t + Δ t} (R), η_{H}^{i, t + Δ t} (z_{u}), η_{V}^{i, t + Δ t} (V)}

where i means the serial number of maneuver, and the membership value set of the situation parameters conforms to all the maneuvers as below

\begin{array}{l} Q^{t + Δ t} = {Q_{1}^{t + Δ t}, Q_{2}^{t + Δ t}, \dots, Q_{i}^{t + Δ t}, \dots, Q_{n}^{t + Δ t}}, \\ i = 1, 2, 3, \dots, n \end{array}

Step 3. Solve the mean and standard deviation of $Q_{i}^{t + Δ t}$ corresponding to the maneuver i. Choosing the maneuver with higher mean can ensure that the membership value of situation parameters converges to the dominant area. Choosing the maneuver with small standard deviation can make the degree of membership at all air combat situation parameters gather as much as possible, thus obtaining the binary array that consists of $m_{i}^{t + Δ t}$ and $s_{i}^{t + Δ t}$

{MS}_{i}^{t + Δ t} = (m_{i}^{t + Δ t}, s_{i}^{t + Δ t})

For $i = 1, 2, 3, \dots, n$ , the set $M Q^{t + Δ t}$ is constituted, $M Q^{t + Δ t} = {{MS}_{i}^{t + Δ t}}$ . The element with the maximum expectation is chosen, and its corresponding maneuver is the executed tactical action. If the number of the maximal element is greater than 1, the corresponding maneuver of the corresponding smallest standard deviation in these elements is chosen as the executed tactical action.

Step 4. Update time, and return to step 1.

However, from the numerical simulation experiment, we find that when targets fly with non-maneuvering, the UCAV maneuvering decision can find out the maneuver with robustness and the optimality in time. When targets fly with maneuvering, the above method still cannot response to the enemy maneuver behavior timely. In order to solve this problem, we put forward a prediction method of adaptive variable weight to predict the state of the enemy aircraft on the basis of the above decision method. The statistical principle decision method is adopted for maneuver on the basis of the prediction results, to further enhance the real-time performance of the algorithm.

The UCAV robust maneuvering decision based on the reachable set theory at prediction target state

Reachable set calculation method of the enemy aircraft state

Reachable set is the set of all states in the system under specific constraints.²⁶ It can be divided into forward reachable set and backward reachable set. This article chooses the forward reachable set according to the characteristics of the air combat. Assuming that the dynamics equation of the enemy aircraft motion is expressed by the following equation

\overset{\cdot}{x} = f (x, u, d)

(18)

where x means the state of the enemy aircraft, u refers to the controlled variable of the enemy aircraft, and d shows the disturbance of the enemy aircraft system. We assume that the enemy aircraft system works under the ideal condition.

It is unrealistic to calculate the forward reachable set at one point using all the control variables of the enemy aircraft because of the real time and high dynamics of air combat. Thus, considering the aims of reachable set is to predict the state of the enemy, there is no need to traverse all the values of controlled variables. In this article, we select some representative element such as maximum acceleration, maximum deceleration, and maximum overload for calculation.

Adaptive variable weight more multi-state prediction method

The UCAV mainly focuses on three aspects of state prediction of the enemy aircraft. The first aspect is the probability for the enemy aircraft to continue to fly under the current state. The second aspect is the probability for the enemy aircraft to detect the UCAV and take attack strategy. The third is the probability that the UCAV cannot estimable the strategy of enemy aircraft for the next moment. In terms of the three aspects as above, it is proposed to utilize the weight instead of probability and design the adaptive adjustment mechanism of the weight.

Let the enemy aircraft’s forward reachable set be $Frs {x_{n}}$ , $i = 1, 2, \dots, n$ , which corresponds to the first aspect, and the state closest to the current state in the Frs is extracted and expressed as $x_{l}^{keep}$ . Corresponding to the second aspect, according to the membership function of position parameter (see equation (10)), the elements in the Frs are evaluated, the membership evaluation value of each element is recorded in the set ${μ_{n}}$ , the element with the lowest value is selected, and the corresponding element in the reachable set is denoted as $x_{m}^{threat}$ . Corresponding to the third area, the following formula is used to obtain the equivalent element of $x_{*}^{eq}$

x_{*}^{eq} = \frac{\sum_{i = 1}^{n} x_{i} * μ_{i}}{\sum_{i = 1}^{n} μ_{i}}

(19)

Three extracted state elements $(x_{l}^{keep}, x_{m}^{threat}, and x_{*}^{eq})$ are brought into the following formula to predict the state of the enemy aircraft

x^{pre} = ω_{1} x_{*}^{eq} + ω_{2} x_{l}^{keep} + ω_{3} x_{m}^{threat}

(20)

where $x^{pre}$ represents the predictive state of the enemy aircraft at $t_{k + n Δ t}$ and $ω$ stands for the weight, $ω_{1} + ω_{2} + ω_{3} = 1$ .

Assuming that the state of the enemy at previous time ( $t_{k - 1}$ time) is $x_{t_{k - 1}}$ and the state of the current time ( $t_{k}$ time) is $x_{t_{k}}$ , the prediction state sub-item that get from the forward reachable set Frs of the enemy aircraft at $t_{k - 1}$ time is $x_{t_{k}, l}^{keep}, x_{t_{k}, m}^{threat}$ . The calculation formula of their close degree and the current state $x_{t_{k}}$ is

\begin{matrix} λ_{max} = max {f_{A} (x_{t_{k}}^{*}, x_{t_{k}}), f_{R} (x_{t_{k}}^{*}, x_{t_{k}}), f_{H} (x_{t_{k}}^{*}, x_{t_{k}}), f_{V} (x_{t_{k}}^{*}, x_{t_{k}})} \\ λ_{min} = min {f_{A} (x_{t_{k}}^{*}, x_{t_{k}}), f_{R} (x_{t_{k}}^{*}, x_{t_{k}}), f_{H} (x_{t_{k}}^{*}, x_{t_{k}}), f_{V} (x_{t_{k}}^{*}, x_{t_{k}})} \end{matrix}

(21)

where

\begin{matrix} f_{A} (x_{t_{k}}^{*}, x_{t_{k}}) = | η_{A}^{x_{t_{k}}^{*}} (α) - η_{A}^{x_{t_{k}}} (α) |, \\ f_{R} (x_{t_{k}}^{*}, x_{t_{k}}) = | η_{R}^{x_{t_{k}}^{*}} (R) - η_{R}^{x_{t_{k}}} (R) | \\ f_{H} (x_{t_{k}}^{*}, x_{t_{k}}) = | η_{H}^{x_{t_{k}}^{*}} (Δ z) - η_{H}^{x_{t_{k}}} (Δ z) |, \\ f_{V} (x_{t_{k}}^{*}, x_{t_{k}}) = | η_{V}^{x_{t_{k}}^{*}} (V) - η_{V}^{x_{t_{k}}} (V) | \end{matrix}

(22)

In the above two equations, * denotes keep and threat; $f_{A} (x_{t_{k}}^{*}, x_{t_{k}})$ , $f_{R} (x_{t_{k}}^{*}, x_{t_{k}})$ , $f_{H} (x_{t_{k}}^{*}, x_{t_{k}})$ , and $f_{V} (x_{t_{k}}^{*}, x_{t_{k}})$ denote the close degree of the prediction state of $x_{t_{k}, l}^{keep}$ or $x_{t_{k}, m}^{threat}$ with the current state of $x_{t_{k}}$ for four situation parameters, namely, the four situation parameters with absolute value of the membership degree difference under two states.

Let the maximum and minimum values of the close degree at the prediction state of $x_{t_{k}, l}^{keep}$ or $x_{t_{k}, m}^{threat}$ with the current state $x_{t_{k}}$ are $λ_{t_{k}, max}^{keep}$ , $λ_{t_{k}, min}^{keep}$ , $λ_{t_{k}, max}^{threat}$ , and $λ_{t_{k}, min}^{threat}$ . Let the threshold value of the close degree is $q_{λ}$ , the self-adaptive adjustment strategy of weight $ω$ is as follows

{\begin{matrix} ω_{2} = {\begin{matrix} ω_{2} + Δ ω, λ_{t_{k}, max}^{keep} \leq q_{λ} \\ ω_{2} - Δ ω, (λ_{t_{k}, min}^{keep} > q_{λ}) \cap (ω_{2} \geq Δ ω) \\ ω_{2}, others \end{matrix} \\ ω_{3} = {\begin{matrix} ω_{3} + Δ ω, λ_{t_{k}, max}^{threat} \leq q_{λ} \\ ω_{3} - Δ ω, (λ_{t_{k}, min}^{threat} > q_{λ}) \cap (ω_{3} \geq Δ ω) \\ ω_{3}, others \end{matrix} \\ ω_{1} = {\begin{matrix} ω_{1} - Δ ω, ((λ_{t_{k}, max}^{threat} \leq q_{λ}) \cup (λ_{t_{k}, max}^{keep} \leq q_{λ})) \\ \cap (ω_{1} \geq Δ ω) \\ ω_{1} + Δ ω, (λ_{t_{k}, min}^{threat} > q_{λ}) \cap (λ_{t_{k}, min}^{keep} > q_{λ}) \\ ω_{1}, others \end{matrix} \end{matrix}

(23)

where $Δ ω$ denotes the unit increment of weight.

In order to guarantee the normalization of the weight, the weight is normalized after the formula is updated, and then is substituted for the enemy aircraft state prediction. When the enemy aircraft flies under the current state, the $ω_{2}$ value will increase. When the enemy aircraft continues to make maneuvering preparation to conduct of attack on the UCAV, the $ω_{3}$ value will rise. When the enemy aircraft does not fly under the current state and does not conduct the attack maneuvering, the $ω_{1}$ value will increase. By observing the change trend of weight values, the state and intention of the enemy aircraft can be judged.

Decision process

Based on the multi-state prediction of adaptive variable weight of the enemy aircraft, combined with the statistics principle, the UCAV maneuvering decision method based on the combination of reachable set target state prediction and the principle of statistics is put forward. The decision process is shown in Figure 7.

Figure 7.

Flow chart of maneuvering decision based on the target state prediction.

Time complexity analysis

The time complexity of our method can be estimated as follows:

In the part of initialization, the time complexity is $T (n) = O (1)$ .

In the part of prediction, the time complexity is $T (n) = O (n^{2})$ .

In the part of decision-making, the time complexity is $T (n) = O (n^{2})$ .

In other parts, the computational complexity is rather simple, which can be neglected.

To summarize, the overall time complexity of our method is $T (n) = O (n^{2})$ .

Simulation analysis

This section carries out the simulation comparison between the UCAV maneuvering decision method based on the MIN-MAX maneuvering and the UCAV maneuvering decision method based on the multi-state prediction of reachable set target. The performance of the proposed method is verified in this article.

The settings of general simulation parameter are presented as follows: $τ_{x} = 0.17$ , $τ_{z} = 1.17$ , $ξ = 0.7$ , the best attack distance $R_{g}$ of the UCAV to the enemy is set as 2500 m, and the standard deviation $σ$ is 500 m. The maximum and minimum flight speed of the enemy aircraft and the UCAV is 406 and 90 m/s, respectively. The best height difference $h_{s}$ of the UCAV to attack the enemy is 0 m. the standard deviation $σ_{h}$ is 100 m. the minimum and maximum flight height limit is 1000 and 20,000 m, respectively. Without considering the threat of the enemy air defense firepower on the ground, the weight is $w_{1} = w_{2} = w_{3} = w_{4} = 0.25$ and controlled variables are $n_{x_{com}} \in [0, 2]$ , $n_{z_{com}} \in [0, 10]$ , and $ϕ_{com} \in [- π, π]$ . When the UCAV simulation starts, the flight action is set as the straight and level flight. The condition of the UCAV taking position and attack mode is when $η_{A} (α) \geq 0.9, η_{R} (R) \geq 0.9, η_{H} (Δ z) \geq 0.9, and η_{V} (V) \geq 0.8$ are true, the simulation terminates immediately.

Considering the radar-cross section (RCS) from broadside and tail of stealth target is larger, the superior situation description in the modified membership function of situation parameters is changed to be when $α_{u} \geq 150^{o} \cap α_{e} \geq 90^{o}$ , the UCAV reaches the advantageous situation of position. The achieved condition of the UCAV taking position in the simulation is modi-fied as $η_{A} (α) \geq 0.625, η_{R} (R) \geq 0.8, η_{H} (Δ z) \geq 0.7, and η_{V} (V) \geq 0.5$ .

The initial value of the enemy aircraft prediction state weight is set as $ω_{1} = 0.3$ , $ω_{2} = 0.4$ , $ω_{3} = 0.3$ . The attacking mode is set to flank attack and stern attack. When $α_{e} \leq 45^{o}$ and the distance between both is $R \leq 4 km$ , the UCAV falls into the attack range of the enemy. The simulation analysis is conducted by setting the enemy aircraft under non-maneuvering and maneuvering conditions.

Case 1: UCAV encounters the enemy from ahead, and the enemy aircraft flies straightly

The initial flight status of the enemy aircraft is as follows: position (3000, 3000, 3000), speed of 204 m/s, track angle 0°, and course angle −135°. Initial flight state of the UCAV is as follows: the location (0, 0, 2700) m, speed of 250 m/s, track angle of 0°, and course angle of 45°. In order to verify the proposed method under the situation where the enemy flies without maneuvering, we assume that the enemy aircraft does not react to the attack action.

As shown in Figures 8 and 9, the black line refers to the trajectory prediction of the enemy. The red and blue lines alternate in the figure means the trajectories of both aircrafts correspond to each time quantum.

Figure 8.

Decision simulation based on the proposed method.

Figure 9.

Simulation of the decision based on the MIN-MAX method.¹⁸

Figures 10 and 11 show the membership function curve of situation parameters under the non-maneuvering condition of the enemy aircraft. The UCAV can reach the conditions of taking position in 18 s with the proposed method. By contrast, it takes longer time (22 s) to reach the taking position with the MIN-MAX maneuvering decision method.

Figure 10.

Membership function curve of the proposed method.

Figure 11.

Membership function curve of the MIN-MAX maneuvering decision method.¹⁸

As can be seen in Figure 12, with the proposed maneuvering decision method in this article, the UCAV achieves the direction advantage at 18 s. The four situation parameters have exceeded the conditions of threshold setting, realizing the position taking and the simulation is terminated. By contrast, with the decision method based on the MIN-MAX maneuvering, the UCAV realizes the direction alignment at 20 s. Combining Figure 11 with Figure 12, we can see that the membership parameter of height fails to meet the threshold condition until 25 s. By comparison, results verify the rapidity and superiority of the proposed method in this article.

Figure 12.

Comparison of relative direction situation of the two decision methods: (a) maneuvering decision method based on the target state prediction and (b) maneuvering decision method based on the MIN-MAX maneuvering.

Figure 13 shows the trend of weight change with the adaptive adjustment ability in the target state prediction decision method. As can be seen from the figure, the weight $ω_{2}$ value increases constantly, while the other two weight values decrease continuously. It means that the UCAV has recognized that the enemy keeps flying in the current mode without any attacking maneuvering. It is consistent with the set constant linear flight mode of the enemy aircraft. Thus, the judgment of the UCAV is accurate.

Figure 13.

The eight change map.

Case 2: The UCAV encounters the enemy from ahead, both side carry out the attacking maneuvering

The initial flight state of the enemy aircraft is set as follows: position (2500, 2500, 3000) m, speed of 204 m/s, track angle 0°, and course angle 90°. The UCAV initial flight state is as follows: position (2500, 1500, 2700) m, speed of 250 m/s, track angle 0°, and course angle 90°. Considering the simulation duration constraints, the initial position of the enemy aircraft is set to be closer to that of the UCAV, and the simulation results of the two methods are shown in the two figures as follows.

The three-dimensional (3D) simulation results of Figures 14 and 15 show that the maneuvering decision method based on the target state prediction makes the combat time of the UCAV with the enemy aircraft longer than the decision method based on the MIN-MAX maneuvering. The black line refers to the trajectory prediction of the enemy. The red and blue lines alternate in the figure means the trajectories of both aircrafts correspond to each time quantum.

Figure 14.

Simulation results based on the proposed method.

Figure 15.

Simulation results based on the MIN-MAX maneuvering decision method.¹⁸

Figure 16 shows the comparison of the situation parameter change curve with membership functions based on the two methods. The thick curve in the figure shows the results of the decision method based on the MIN-MAX maneuvering. At the beginning of the simulation, there is no big difference between two decision methods, and the UCAV is not in the passive situation. However, as time goes on, the enemy aircraft begins to carry out the attacking maneuvering, making drastic changes in situation. The decision method based on the MIN-MAX maneuvering stops the simulation at 3 s, and the decision method based on the prediction simulation of target state stops the simulation at 7 s. In the simulation process, although the membership degree of the position parameter shows a short increasing trend, the general trend reduces constantly. It shows that the UCAV situation gets worse, and the conclusion can be verified from Figure 17.

Figure 16.

Comparison of the membership function curve.

Figure 17.

Comparison of the relative direction situation experienced by two decision methods.

Figure 17 shows the comparison of the relative direction of the situation. The thick line in the figure is the corresponding direction situation curve of the decision method based on the MIN-MAX maneuvering. According to the constraints set by the simulation, the decision result of decision method based on the MIN-MAX maneuvering makes the relative direction situation of the UCAV worse at 3 s. The UCAV enters into the attack direction of the enemy aircraft. The result of the decision method based on the target state prediction is slightly better than the former one, and the subsequent decision result makes this trend to enlarge. However, the UCAV falls into the attack range of the enemy aircraft due to the platform gap at 7 s.

Figure 18 shows the trend of weight change in the target state prediction decision method. As can be seen from the figure, the weight $ω_{2}$ decreases constantly means the enemy is maneuvering constantly, $ω_{3}$ increased first and then decreased means the enemy carried out an attacking maneuvering at the beginning and then carried out a non-attacking maneuvering.

Figure 18.

Weight value change trend of target state prediction.

Case 3. The enemy evade and the UCAV attack

When the enemy aircraft (such as the stealth bombers) with stealth performance attacks the ground target, it would try to avoid entanglement with fighter jets, and often chooses the quickest and smallest maneuvering to remain hidden and avoid attack. Under this context, the initial conditions of the simulation are set up as follows: the initial position of the enemy aircraft (2000, 2000, and 3000), speed of 204 m/s, track angle of 0°, and course angle of 90°. The initial flight state of the UCAV is as follows: location (0, 0, and 2900) m, speed of 200 m/s, track angle 0°, and course angle 0°. The simulation results are shown in Figures 19 and 20.

Figure 19.

Decision simulation result based on the proposed method.

Figure 20.

Simulation result based on the MIN-MAX maneuvering decision method.¹⁸

From the 3D simulation results, both the methods can make the UCAV turn right and climb to attack the enemy aircraft, as the enemy aircraft will choose the appropriate maneuver in real time according to the situation; during the process of simulation, there is a slight difference between the enemy aircraft tracks in Figures 19 and 20. According to Figure 19, there is little difference between the predicted state of the enemy and the actual state of the enemy. It shows that the state prediction of the enemy aircraft by the method is accurate.

According to the membership function curve of situation parameters as shown in Figures 21 and 22, it takes only 8 s to realize the taking attack position with the proposed method, while it needs longer time (10 s) using the MIN-MAX maneuvering decision method. The global combat situation based on the proposed method is much better than that based on the MIN-MAX maneuvering decision method.

Figure 21.

Membership function curve based on the proposed method.

Figure 22.

Membership function curve based on the MIN-MAX maneuvering decision method.¹⁸

Conclusion

This article proposes a novel maneuvering decision-making method that combines with a mechanism of target intention prediction. The basic maneuver library is extended so that the UCAV can perform the maneuvers that other typical maneuver libraries cannot achieve. The method is capable of overcoming the uncertainty which is brought by the incomplete information of the enemy. The reachable set theory and adaptive adjustment mechanism of the target state weight are used in the target intention prediction so that the real-time ability is promoted. Simulations verify that the method can effectively forecast the general location of the enemy, short the time of taking position and attacking of the UCAV when the enemy aircraft evades, and extend the UCAV survival time when the enemy aircraft attacks.

In future works, first, the presentation of this article has preliminarily assumed that all the flight states of the enemy aircraft are accurately measurable. However, the noise will inevitably appear in the system, so the noise-perturbed situation is needed for applications. Second, since the state of the enemy is hard to be acquired, the reachable set calculation of the enemy aircraft in the article is based on the parameters of the UCAV. Thus, difference may exist during the reachable set calculation process, so there is still a huge space for development of the reachable set calculation method and this will be one of our future studies.

Footnotes

Academic Editor: Gang Chen

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

References

Guierrez

Vachtsevanos

Heck

. An approach to the adaptive mode transition control of unmanned aerial vehicles. In: Proceedings of the American control conference, Denver, CO, 4–6 June 2003. New York: IEEE.

Zhang

Wei

Autonomous tactical decision-making of UCAVs in air combat. Dianguang Kongzhi 2012; 19: 92–96.

Gao

SY.

Research on expert system and decision support system for multiple air combat tactical maneuvering. Syst Eng: Theory Pract 1999; 19: 76–79.

Oshman

Arad

Differential-game-based guidance law using target orientation observations. IEEE T Aero Elec Sys 2006, 42: 316–326.

Leitmann

(ed.). Multicriteria decision making and differential games. New York: Springer, 2013.

Huang

Ding

Zhang

. Automation-assisted capture-the-flag: a differential game approach. IEEE T Contr Syst T 2015; 23: 1014–1028.

Jie

Linping

Changqiang

A synthesized tactical gray-rough decision-making method for UCAV based on extended incomplete information. Acta Armament 2010; 31: 1279–1284.

Liu

Meng

. Research on beyond visual range target allocation and multi-aircraft collaborative decision-making. In: Proceedings of the 25th Chinese control and decision conference (CCDC), Guiyang, China, 25–27 May 2013, pp.586–590. New York: IEEE.

Eisa

Improving group decision support systems using rough set [J]. Int J Comput Appl 2013; 69(2).

10.

Kochenderfer

Amato

Reynolds

HJD

. Decision making under uncertainty: theory and application. Cambridge, MA: MIT Press, 2015.

11.

Duan

Zhao

UCAV situation assessment based on fuzzy rules and dynamic ant colony-Bayesian network. CAAI Trans Intell Syst 2013; 2: 7.

12.

Chen

Zhang

Cao

Autonomous intelligent decision-making system based on Bayesian SOM neural network for robot soccer. Neurocomputing 2014; 128: 447–458.

13.

Ernest

Carroll

Schumacher

. Genetic fuzzy based artificial intelligence for unmanned combat aerial vehicle control in simulated air combat missions. J Def Manag 2016; 6: 144.

14.

Duan

Wei

Dong

Multiple UCAVs cooperative air combat simulation platform based on PSO, ACO, and game theory. IEEE Aero El Sys Mag 2013; 28: 12–19.

15.

Zhang

Zhou

. Decision-making for air combat maneuvering based on variable weight pseudo-parallel genetic algorithm [J]. Flight Dynamics 2012; 30: 470–474.

16.

Holsapple

Chandler

Baker

. Autonomous decision making with uncertainty for an urban intelligence, surveillance and reconnaissance (ISR) scenario. In: Proceedings of the AIAA guidance, navigation and control conference, Honolulu, HI, 18–21 August 2008, pp.1–14. Reston, VA: AIAA.

17.

Sun

Tsai

Lee

. The study on intelligent advanced fighter air combat decision support system. In: Proceedings of the IEEE international conference on information reuse and integration, Waikoloa, HI, 16–18 September 2006, pp.39–44. New York: IEEE.

18.

Virtanen

Karelahti

Raivio

Modeling air combat by a moving horizon influence diagram game. J Guid Control Dynam 2006; 29: 1080–1091.

19.

Jo Son

Wan Kim

. Torpedo evasion simulation of underwater vehicle using fuzzy logic based tactical decision making in script tactics manager. Expert Syst Appl 2012; 39: 7995–8012.

20.

Narayana Rao

Sudesh

Kashyap

. Situation and threat assessment in BVR combat. In: Proceedings of the AIAA guidance, navigation, and control conference, Portland, OR, 8–11 August 2011, pp.1–6. Reston, VA: AIAA.

21.

Xiaoguang

Research on close air combat modeling of differential games for unmanned combat air vehicles. Acta Armament 2012; 33: 1210–1216.

22.

Weight

Li.

Analysis on the air combat strategies for fighterst. Acta Armament 2013; 30: 48–52.

23.

Dong

Trial input method and own-aircraft state prediction in autonomous air combat. J Aircraft 2012; 49: 947–954.

24.

Dong

Huang

Visual perception-based target aircraft movement prediction for autonomous air combat. J Aircraft 2014; 52: 538–552.

25.

Austin

Carbone

Hinz

. Game theory for automated maneuvering during air-to-air combat. J Guid Control Dynam 1990; 13: 1143–1149.

26.

Yinping

. Reachable set estimation for neural networks with polytopic uncertainties [D]. Tianjin University, Tian Jin, China, 2012.