A new algorithm for considering green communication and excellent sensing performance in cognitive radio networks

Abstract

Multi-node cooperative sensing can effectively improve the performance of spectrum sensing. Multi-node cooperation will generate a large number of local data, and each node will send its own sensing data to the fusion center. The fusion center will fuse the local sensing results and make a global decision. Therefore, the more nodes, the more data, when the number of nodes is large, the global decision will be delayed. In order to achieve the real-time spectrum sensing, the fusion center needs to quickly fuse the data of each node. In this article, a fast algorithm of big data fusion is proposed to improve the real-time performance of the global decision. The algorithm improves the computing speed by reducing repeated computation. The reinforcement learning mechanism is used to mark the processed data. When the same environment parameter appears, the fusion center can directly call the nodes under the parameter environment, without having to conduct the sensing operation again. This greatly reduces the amount of data processed and improves the data processing efficiency of the fusion center. Experimental results show that the algorithm in this article can reduce the computation time while improving the sensing performance.

Keywords

Cognitive networks node selection spectrum sensing machine learning

Introduction

Developments in wireless communication technology have increased the need for spectrum resources, which are currently limited.^1,2 To address this problem, cognitive radio networks have been proposed to improve existing spectrum resources.^3,4 In this context, spectrum sensing technology is the basic link between cognitive radio networks.⁵

Sensing nodes need to quickly and accurately perform spectrum sensing in order to efficiently utilize the idle frequency band without interfering with the primary user.⁶ This presents another issue; due to the impacts of path loss, shadow fading, and hidden terminals, it is difficult for a single sensor node to accurately detect the primary user’s status.^7,8 Nevertheless, cooperative sensing can effectively overcome these impacts by fusing detection information from multiple nodes in different geographical locations.⁹ In centralized cooperative sensing, there is a special data fusion center in the cognitive network. This collects local perception results from the nodes participating in cooperative perception, judges the current usage of authorized bands, and then broadcasts the decision results in the network or directly controls and schedules the perception nodes.¹⁰ The local sensing result collection process will increase the communication overhead when a large number of nodes are participating.^11,12 However, too many cooperative cognitive users (sensing nodes) will cause vast communication overhead. A proposed review method addressed this problem by examining the observed values of perceived nodes and only allowing nodes containing sufficient information to send their decision values (0 or 1) to the fusion center.¹³ This method reduces communication overhead but also reduces sensing performance. Aiming at the excessive overhead created by equal gain fusion,^14,15 a double threshold method is used to perform cooperative spectrum sensing in which each node adopts double threshold detection and sends the detected value directly to the fusion center, which then makes the judgment. Combined with the judgments of each node and its own judgment, the fusion center makes two judgments to determine whether the primary user exists. This employs a combination of soft and hard fusion methods, but performs two operations in the fusion center, thus increasing computational power. Furthermore, a hierarchical cooperative spectrum detection method has been proposed to solve the problem of excessive cooperative sensing overhead.^16,17 Here, when nodal observation values are between two thresholds, the region between these two thresholds is evenly divided into four parts; four different regions are thus quantized by 2 bits. The sensing nodes then send 2 bits of information to the fusion center. When compared with the equal gain fusion method, this reduces communication overhead. However, the sensing performance of hard fusion decreases.¹⁸

Spectrum sensing performance directly affects the throughput of cognitive users,¹⁹ and multi-node cooperative spectrum sensing is a common method to improve the performance of spectrum sensing.²⁰ However, when multi-node participates in cooperative sensing, the sensing data will increase greatly, and the fusion center cannot process a large number of data in time, which will cause delayed decision, which will affect the security of the main user or the throughput of cognitive users. In order to make a decision in time, a large number of data in the fusion center needs to be processed quickly, which requires the selection of some node data to reduce the number of processed data. Therefore, in order to achieve the real-time spectrum sensing, the fusion center needs to quickly fuse the data of each node. In this article, a fast algorithm of big data fusion is proposed to improve the real-time performance of the global decision. The algorithm improves the computing speed by reducing repeated computation. The reinforcement learning mechanism is used to mark the processed data. When the same environment parameter appears, the fusion center can directly call the nodes under the parameter environment, without having to conduct the sensing operation again. This greatly reduces the amount of data processed and improves the data processing efficiency of the fusion center. Experimental results show that the algorithm in this article can reduce the computation time while improving the perceived performance.

System model

This study designed an analog cognitive radio system consisting of a primary user (PU) and 16 nodes (cognitive users).²¹ Each node communicates with the fusion center through a channel, while the fusion center fuses information from each node to determine whether the primary user channel is idle. The system model is illustrated in Figure 1.

Figure 1.

Simulation scenario for spectrum sensing.

Derivation of the optimum local detection threshold

For spectrum sensing, every SU independently performs an energy detection process. The signal received by an SU is determined as follows²²

x (m) = hs (m) + v (m), m = 0, 1, \dots, M - 1

(1)

where $s (m)$ is a PU signal, $v (m)$ is the additive white Gaussian noise with zero mean and variance $σ_{v}^{2}$ , $m$ represents the serial number of the sampling point, $M$ is the number of samples, and h is channel gain. Suppose PU is absent (i.e. $s (m) = 0$ ); the hypothesis of free channel is then denoted by $H_{0}$ , whereas the hypothesis of busy channel is denoted by $H_{1}$ , as follows²³

\begin{matrix} H_{0} : x (m) = v (m), m = 0, 1, \dots, M - 1 \\ H_{1} : x (m) = hs (m) + v (m), m = 0, 1, \dots, M - 1 \end{matrix}

(2)

Assuming E is the average collected energy of an SU and it is expressed as follows²⁴

E = \frac{1}{M} \sum_{m = 0}^{M - 1} {| x (m) |}^{2}

(3)

If each SU can make its local decision according to single threshold $λ_{o}$ with probabilities of detection $P_{d}$ and false alarm $P_{f}$ , the equation is as follows²⁵

P_{d} = P {E \geq λ_{o} | H_{1}} = Q (\frac{λ_{o} - (σ_{s}^{2} + σ_{v}^{2})}{(σ_{s}^{2} + σ_{v}^{2}) / \sqrt{M / 2}})

(4)

P_{f} = P {E \geq λ_{o} | H_{0}} = Q (\frac{λ_{o} - σ_{v}^{2}}{σ_{v}^{2} / \sqrt{M / 2}})

(5)

where $σ_{s}^{2}$ is signal power and $σ_{v}^{2}$ is noise power; the complementary cumulative distribution function $Q (x)$ will be described as follows²⁶

Q (x) = \frac{1}{\sqrt{2 π}} \int_{x}^{\infty} e^{\frac{- t^{2}}{2}} dt

(6)

Assuming the presence probability of a PU is $P (H_{1}) = β$ , $0 < β < 1$ and the absence probability of the PU is $P (H_{0}) = 1 - β$ , then the probability of error detection ( $P_{e}$ ) is as follows²⁷

P_{e} = P (H_{1}) (1 - P_{d}) + P (H_{0}) P_{f} = β (1 - P_{d}) + (1 - β) P_{f}

(7)

where $P_{e}$ is the quadratic function of the $λ_{o}$ , we can derive $\partial^{2} P_{e} / \partial^{2} λ_{o} > 0$ . The optimal threshold $λ_{o}$ is obtained by $\partial P_{e} / \partial λ_{0} = 0$ and it is expressed as follows²⁸

λ_{o} = \frac{σ_{v}^{2} (1 + γ) [1 + \sqrt{1 + \frac{4 (2 + γ)}{M \cdot γ} \ln [\frac{(1 - β) (1 + γ)}{β}]}]}{2 + γ}

(8)

It is easy to obtain optimal threshold $λ_{o}$ if signal power $σ_{s}^{2}$ , noise power $σ_{v}^{2}$ and the SNR of the SU’s receiving terminal are known.

Estimating the optimal threshold

If a sensing cycle sampling point is $M = 2^{L}$ and L is a positive integer, M can be divided into the two following equal sections: (1) the previous $M / 2$ sampling points can be expressed with $x_{1}$ ; (2) the later $M / 2$ sampling points can be expressed with $x_{2}$ . $x_{1}$ and $x_{2}$ are given as follows

x_{1} = (x (0), x (1), \dots, x (M_{1} - 1))

(9)

x_{2} = (x (M_{1}), x (M_{1} + 1), \dots, x (M - 1))

(10)

where $M_{1} = M / 2$ , the average energy of each section is expressed as follows

E_{1} = \frac{1}{M_{1}} \sum_{m = 0}^{M_{1} - 1} {| x (m) |}^{2}

(11)

E_{2} = \frac{1}{M_{1}} \sum_{m = M_{1}}^{M - 1} {| x (m) |}^{2}

(12)

If $E_{1} < E_{2}$ , $x_{1}$ denotes AWGN, $x_{2}$ denotes the sum of signal and AWGN (i.e. $E_{1} = σ_{v}^{2}$ ), $E_{2} = σ_{v}^{2} + σ_{s}^{2}$ . If $E_{1} > E_{2}$ , $x_{2}$ denotes AWGN.

Let $E_{1}$ denote estimated noise power ${\hat{σ}}_{v}^{2}$ and $E_{2}$ denote the estimated sum of signal and noise powers ${\hat{σ}}_{v}^{2} + {\hat{σ}}_{s}^{2}$ ; estimated signal power is then ${\hat{σ}}_{s}^{2} = E_{2} - E_{1}$ . As such, the estimated SNR of received signal $x (m)$ is expressed as follows

\hat{γ} = \frac{{\hat{σ}}_{s}^{2}}{{\hat{σ}}_{v}^{2}} = \frac{E_{2} - E_{1}}{E_{1}}

(13)

The estimation of optimal thresholds ${\hat{λ}}_{o}$ is expressed as follows

{\hat{λ}}_{o} = \frac{{\hat{σ}}_{v}^{2} (1 + \hat{γ}) [1 + \sqrt{1 + \frac{4 (2 + \hat{γ})}{M \cdot \hat{γ}} \ln [\frac{(1 - β) (1 + \hat{γ})}{β}]}]}{2 + \hat{γ}}

(14)

Here, it should be noted that the conditions for the solution of equation (2) should be satisfied according to the following equation

1 + \frac{4 (2 + γ)}{M \cdot γ} \ln [\frac{(1 - β) (1 + γ)}{β}] \geq 0

(15)

Transform equation (15) type to the following

(1 + γ) \cdot e^{\frac{M \cdot γ}{4 \cdot (2 + γ)}} \geq \frac{β}{1 - β}

(16)

When the number of sampling points is $M = 1024$ and the SNR $γ = - 20 dB$ , after mathematical derivation, the value of $β$ must satisfy $β \in (0, 0.731)$ , and the equation (16) can be established. The range of $β$ values can be extended if the number of sampling points increases. When $M = 10^{5}$ and $γ = - 30 dB$ , then $β \in (0, 0.999)$ . This means that during a sensing cycle and when there is a sufficient number of sampling points (even in the case of low SNR and when the probability of the primary user signal is uncertain), equation (15) is established and there are solutions to equation (14).

Adaptive double energy thresholds

To avoid error judgments due to SNR variations in a received end, lower threshold $λ_{l}$ and upper threshold $λ_{h}$ are set based on optimal threshold $λ_{o}$ (Figure 2).²⁹ In Figure 2, d is the distance between $λ_{o}$ and lower threshold $λ_{l}$ or $λ_{o}$ and upper threshold $λ_{h}$ . The following is thus obtained

{\begin{matrix} λ_{l} = λ_{o} - d \\ λ_{h} = λ_{o} + d \end{matrix}

(17)

Figure 2.

Double threshold settings.

For weak PU signal detection, threshold $λ_{o}$ should be decreased. However, it should not be lower than $σ_{v}^{2}$ . Noise would otherwise be detected as a PU signal. However, $λ_{o}$ should not be larger than $σ_{v}^{2} + σ_{s}^{2}$ . The PU signal would otherwise miss detection. To assure a lower probability of false alarm and a higher probability of detection, we thus put a limiting range on threshold $λ_{o}$ , as follows

σ_{v}^{2} \leq λ_{o} - d

(18)

λ_{o} + d \leq σ_{v}^{2} + σ_{s}^{2}

(19)

Here, equations (18) and (19) gives the following

0 \leq d \leq \frac{1}{2} σ_{s}^{2}

(20)

Different from conventional double threshold settings, we introduced control parameter $ε$ to accurately fine-tune the double thresholds and define the following

d = ε σ_{s}^{2}

(21)

According to equation (20), $ε$ satisfies the following

0 \leq ε \leq 0.5

(22)

When $ε = 0$ , it is equivalent to a single threshold case. Thus, $λ_{l}$ and $λ_{h}$ can be rewritten as follows

{\begin{matrix} λ_{l} = λ_{o} - ε σ_{s}^{2} \\ λ_{h} = λ_{o} + ε σ_{s}^{2} \end{matrix}

(23)

Parameter $ε$ is an impact factor for double thresholds.

Quantization and coding based on adaptive double energy thresholds

We considered the cognitive radio network as shown in Figure 1. Here, each node communicates with the fusion center. This study assumed that the channel between the node and fusion center was perfect. $λ_{l, i}$ , $λ_{l, i}$ , $λ_{0, i}$ and $w_{i}$ ( $i = 0, 1, \dots, N - 1$ ) denote the upper threshold, lower threshold, optimal threshold, and weight of node $E_{i}$ , respectively.

Calculating bode weights

Assume $E_{i}$ is the average energy collected by ith node. First, if $E_{i}$ is not lower than upper threshold $λ_{h, i}$ and its weight equals 1, then the SU decides that the PU is present. Next, if $E_{i}$ is not larger than lower threshold $λ_{l, i}$ and its weight equals 0, then the node decides that the PU is absent. Finally, if $E_{i}$ is located between $λ_{h, i}$ and $λ_{l, i}$ and cannot determine whether the primary user is present, then upper threshold $λ_{h, i}$ is set as a comparison value. That is, $E_{i}$ is normalized by $λ_{h, i}$ and its weight is equal to the normalized result. The weight calculation is expressed as follows

{\begin{matrix} w_{i} = 1, E_{i} \geq λ_{h, i} \\ w_{i} = 0, E_{i} \leq λ_{l, i} \\ w_{i} = \frac{E_{i}}{λ_{h, i}}, λ_{l, i} < E_{i} < λ_{h, i} \end{matrix}

(24)

where $i = 0, 1, \dots, N - 1$ is index of SUs, after $N$ nodes performed local spectrum sensing, the $N$ weights form a set $θ = {w_{0}, w_{1}, \dots, w_{N - 1}}$ . Figure 3 shows the assigned fusion weights and cooperative spectrum sensing algorithm.

Figure 3.

Assigned weights and the cooperative spectrum sensing algorithm.

According to Figure 3, global sensing performance will change when the two nodal thresholds are altered. According to equation (24), both the weights of the SUs and global sensing performance will change when the two thresholds are altered. As such, it is highly important to select an optimal $ε$ to establish two proper thresholds, thus improving sensing performance. A grid search is conducted to obtain the best parameters for $ε$ in part 4. These are memorized through reinforcement learning strategies to obtain better sensing performance and higher sensing efficiency.

Cooperative spectrum sensing based on quantization and coding

After an SU obtains weight and encodes weight $w_{i}$ , where $i = 0, 1, \dots, N - 1$ , the coding rules are as follows: First, if $w_{i} = 1$ , is encoded as 1, and expressed as $C_{i, 1} = 1,$ then this will be sent to the fusion center, which denotes that an SU transmitted 1 bit of data and consumed 1 unit of energy (e.g. $e_{j} = 1$ ). Next, if $w_{i} = 0,$ is encoded as 0 and expressed as $C_{i, 2} = 0,$ then this also denotes that an SU sent 1 bit of data to the fusion center and consumed 1 unit of energy. Finally, if $0 < w_{i} < 1,$ $C_{i, 3} = d_{3} d_{2} d_{1}$ , where $d_{j}$ are integers (1 or 0) and $j = 1, 2, 3$ , when $2 \times w_{i} \geq 1$ , $d_{3} = 1$ , otherwise $d_{3} = 0$ , when $2 \times (2 \times w_{i} - d_{3}) \geq 1$ , $d_{2} = 1$ ; otherwise $d_{2} = 0$ , and when $2 \times (2 \times (2 \times w_{i} - d_{3}) - d_{2}) \geq 1$ , $d_{1} = 1$ ; otherwise $d_{1} = 0$ , which denotes that an SU sent 3 bits of data to the fusion center and consumed 3 units of energy.

The fusion center will decode for $C_{i}$ after receiving sensing results from an SU. Here, the decoding rules can be described as one of three types. First, if $C_{i} = 1$ , then the decoding data are expressed as $D_{i} = 1$ . Second, if $C_{i} = 0$ , then the decoding data are expressed as $D_{i} = 0$ . Third, if $C_{i} = d_{3} d_{2} d_{1}$ , then the decoding data are expressed as

D_{i} = \frac{d_{3} \times 2^{2} + d_{2} \times 2^{1} + d_{1} \times 2^{0}}{8}

The decoding rules can thus be expressed as follows

{\begin{matrix} C_{i} = 1, D_{i} = 1, \\ C_{i} = 0, D_{i} = 0, \\ C_{i} = d_{3} d_{2} d_{1}, D_{i} = \frac{d_{3} \times 2^{2} + d_{2} \times 2^{1} + d_{1} \times 2^{0}}{8}, \end{matrix}

(25)

where $0.125 \leq D_{i, 3} \leq 0.875$ , and its resolution ratio is 0.125; this can reflect that a node made a contribution for the cooperative spectrum sensing to match its weight.

The fusion center will use majority-rule fusion after completely decoding all data received from all SUs. The fusion expressed is as follows

ℜ = \sum_{i} D_{i}, i = 0, 1, \dots, N - 1

(26)

where ℜ is the fusion result, $i$ is the index of the nodes, and $D_{i}$ is the decoding data for the node. The fusion center compares ℜ and $N / 2$ to decide whether there is a signal from the PU. The expression of this decision is as follows

ℜ = \sum_{i = 0}^{N - 1} D_{i} {\begin{matrix} \geq \frac{N}{2}, H_{1} \\ < \frac{N}{2}, H_{0} \end{matrix}

(27)

The code algorithm summarizes the coding-based cooperative spectrum sensing.

Coding-based cooperative spectrum sensing algorithm
1: Initialization
calculate $E_{i}$ , $i = 0, 1, \dots, N - 1$ , estimate $\| σ_{v}^{2} \|$ , set N, M, $λ_{0, i}$ , calculate $λ_{l, i}$ and $λ_{h, i}$ according to (23);
2: Calculate the weights of all SUs
if $E_{i} \geq λ_{h, i}$ ; $w_{i} = 1$ ;
else if $E_{i} \leq λ_{l, i}$ ; $w_{i} = 0$ ;
else $w_{i} = \frac{E_{i}}{λ_{h, i}}$ ;
3: Encode the weights for all SUs
if $w_{i} = 1$ ; $C_{i} = 1$ ;
else if $w_{i} = 0$ ; $C_{i} = 0$ ;
else
if $2 \times w_{i} \geq 1$ ; $d_{3} = 1$ ;
else $d_{3} = 0$ ;
if $2 \times (2 \times w_{i} - d_{3}) \geq 1$ ; $d_{2} = 1$ ;
else $d_{2} = 0$ ;
if $2 \times (2 \times (2 \times w_{i} - d_{3}) - d_{2}) \geq 1$ ; $d_{1} = 1$ ;
else $d_{1} = 0$ ;
$C_{i} = d_{3} d_{2} d_{1}$ ;
4: Fusion center decoding for received data from N SUs
if $C_{i} = 1$ ; $D_{i} = 1$ ;
else if $C_{i} = 0$ ; $D_{i} = 0$ ;
else
$D_{i} = \frac{d_{3} \times 2^{2} + d_{2} \times 2^{1} + d_{1} \times 2^{0}}{8}$ ;
5: Fusion center fuses decoding data on the basis of (26).
6: Fusion center makes decision about PU according to (27).
7: Sensing end

PU: primary user.

For the note code algorithm, the transmission of all SUs combines 1 bit sent and 3 bits sent. As such, the algorithm can improve sensing performance while reducing communication overhead.

Reinforcement learning based on the grid search algorithm

Grid search algorithm

A grid search can be used to obtain optimal $ε$ according to the feedback of global $P_{d}$ . We conducted a grid search to train parameters in order to improve search efficiency. The trained parameters included SNR (−25 to 0 dB with step 1 dB) and $ε$ (0 to 0.5 with step 0.05). These were saved as a prior knowledge to a knowledge base. Spectrum sensing will then directly invoke optimal $ε$ under an SNR according to the prior knowledge. Figure 4 shows the grid search algorithm used for parameter training.

Figure 4.

Flowchart showing the reinforcement learning scheme based on the grid search algorithm.

In Figure 4, k and j stand for SNR and control parameter $ε$ , respectively; $(k_{n}, j_{n})$ is a prior knowledge group, while $j_{n}$ is optimal $ε$ under $k_{n}$ , $n = 1, 2, \dots, 26$ . If an SNR is newly appearing, then this algorithm will immediately train new parameters; these will be saved as prior knowledge in the knowledge base.

The grid search algorithm process is described as follows

1. When an SNR $κ_{i}$ appears, the fusion center will conduct a real-time search to find the optimal $ε_{i}$ and obtain the highest $P_{d}$ . It will then proceed to step 3, where $κ_{i}$ is the ith newly appearing SNR and $ε_{i}$ is the corresponding optimal control parameter. These results will be output when $P_{d}$ and $ε_{i}$ are returned. Furthermore, $ε_{i}$ and $κ_{i}$ will become a prior knowledge couple that the fusion center will then learn, for example

φ = f (ε_{i}, κ_{i})

(28)

where $i$ is a positive integer, $φ$ is a storage library, and $f (\cdot)$ is a learn function.

2. When SNR $κ_{i}$ is not newly appearing, the fusion center will utilize learned knowledge to directly select optimal $ε_{i}$ ; for example

ε_{i} = f^{- 1} (κ_{i}, φ)

(29)

where $f^{- 1} (\cdot)$ is a function reading knowledge from the storage library.

3. Under SNR $κ_{i}$ , the range of parameter $ε_{i}$ is divided into 10 equal grids by 11 grid points, $ε_{i, j}$ is the jth grid point $j \in {0, 1, 2, \dots, 10}$ . $Δ ε = 0.05$ is the searching step. The searching process is from $ε_{i, 0}$ to $ε_{i, 10}$ with step $Δ ε$ .

4. $P_{d, j}$ is $P_{d}$ of the jth grid point, when a $P_{d, j} = 1, j \in {0, 1, 2, . . ., 10}$ , stopping search, the $P_{d, j}$ and corresponding $ε_{i, j}$ are returned to 1. The search otherwise continues.

5. When the real-time search has been finished, highest probability $P_{d, \max}$ and corresponding optimal parameter $ε_{i, j}$ are returned to step 1, as follows

{\begin{matrix} P_{d, max} = max {P_{d, 0}, P_{d, 1}, \dots, P_{d, 10}} \\ ε_{i, j} = {f_{1}}^{- 1} (P_{d, max}) \end{matrix}

(30)

where ${f_{1}}^{- 1} (\cdot)$ is a function seeking $ε_{i, j}$ through $P_{d, max}$ .

6. End

Reinforcement learning based on the grid search algorithm

The learning process is as follows:

1. After the fusion center finishes executing the grid search, the obtained grid coordinates are represented as follows

A = [\begin{matrix} \begin{matrix} x_{1} \\ x_{2} \\ ⋮ \\ x_{40} \end{matrix} & \begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{40} \end{matrix} \end{matrix}]

(31)

The first column in matrix $A$ represents the value of control parameter $ε$ , while the second column represents the value of signal-to-noise ratio $α$ . Each row represents the optimal control parameters found by executing the grid search algorithm in a specific radio environment.

2. The fusion center sends matrix $A$ and the matrix description to cognitive users.

3. The cognitive user can memorize the data from matrix $A$ ; the local detection threshold is set according to the data and is the best to be preserved. When the radio environment is consistent with the memory, then the optimal threshold in this environment can be directly invoked to perform the next spectrum sensing process.

4. In the case of a new radio environment, the fusion center must alter the range of $α$ values and re-execute the grid search algorithm (i.e. execute steps 1-3 again).

Experiments and evaluation

This study designed three groups of Monte Carlo simulation experiments to evaluate the performance of the cooperative spectrum sensing method, as follows: (1) a comparison of detection probabilities ( $P_{d}$ ) between traditional fusion methods and the new fusion method presented in this article, (2) under the same conditions, a comparison of probability of error ( $P_{e}$ ) between the used grid search algorithm and those unused in this article, (3) under the same conditions, a comparison of sensing speed between reinforcement learning and non-reinforcement learning, (4) verification of fast fusion algorithm, compare the processing time of using fast fusion algorithm with that of not using fast fusion algorithm. The Monte Carlo simulations were conducted under conditions involving path loss and additive Gaussian white noise.

The simulation experiments set the PU signal to a BPSK, bandwidth to 100 kHz, and sensing duration to 100 ms.³⁰ The PU was placed in the center of a 1000 × 1,000 m square and surrounded by 16 evenly distributed sensing nodes. The simulation scenario is shown in Figure 1. The probability of setting the channel of PU occupancy was $β = 50 %$ , while the transmitting power of the PU signal was 100 mW.³¹ Each node sampled 20 points; the noise power range was set to between 0 and 2 dB, while the path-loss exponent was 2.7, the standard deviation of the shadow was 5 dB, and the mean of the multipath Rayleigh fading was 1.^32–34

Figure 5 illustrates a comparison of detection probabilities ( $P_{d}$ ) between the traditional fusion methods and the new fusion method proposed in this article. The traditional methods used for comparison included AND, OR and Majority fusion methods. As seen in Figure 5, the detection probability ( $P_{d}$ ) obtained by the new fusion method was higher than that obtained by traditional methods. This is more obvious in cases involving low SNR because each sensing node uses double the threshold energy detection and calculates weights according to the signal energy received by itself. Such nodes can then make appropriate contributions to the spectrum sensing process based on their own weights. As such, the new fusion method more accurately reflects the actual roles of each node when compared to traditional methods.

Figure 5.

Comparison the traditional fusion methods and the new fusion method.

Figure 6 illustrates the comparison of probability of error ( $P_{e}$ ) between the grid search algorithm used in this study and other algorithms (i.e. fixed-single and fixed-double threshold algorithms). As seen in Figure 6, the grid search algorithm exhibited the lowest error probability during spectrum sensing. This is because the best detection threshold can be obtained through the grid search algorithm in any radio environment. The other two algorithms have fixed thresholds and can therefore not adapt to noise fluctuations. While the probability of error ( $P_{e}$ ) increases when SNR decreases in all cases, the grid search algorithm results in the smallest increase.

Figure 6.

A comparison of probability of error between the grid search algorithm and others.

Figure 7 shows a sensing speed comparison between reinforcement and non-reinforcement learning. As seen, reinforcement learning takes less sensing time than non-reinforcement learning under the same SNR conditions. This is because reinforcement learning can directly invoke detection thresholds in the same environment from the repository. If reinforcement learning is not used, then every spectrum sensing procedure requires a grid search algorithm to find the optimal threshold; this requires more sensing time. Sensing time decreases when SNR increases because the radio environment is simpler; with an increased signal-to-noise ratio, less information is stored and judgments are easier to make.

Figure 7.

A sensing speed comparison between reinforcement and non-reinforcement learning.

In order to verification of fast fusion algorithm, compare the processing time of using fast fusion algorithm with that of not using fast fusion algorithm. The experiments are all under the same number of nodes. In order to highlight the advantages of the fast algorithm proposed in this article, observe the data processing time under different node numbers. Table 1 shows the processing time at different nodes. The processing environment is MATLAB 7.0, and the computer configuration is Intel (R) Core (TM) i5-8500 CPU at 3.00 GHz, RAM is 8 GB, and 64-bit operation system.

Table 1.

compare the processing time of using fast fusion algorithm with that of not using fast fusion algorithm.

Number of nodes	Processing time using fast fusion algorithm (ms)	Processing time do not using fast fusion algorithm (ms)
5	0.32	0.38
10	0.56	0.64
15	0.72	0.79
20	0.84	0.92
25	0.95	1.04
30	1.02	1.16

It can be seen from Table 1 that the fast algorithm used by the fusion center can effectively reduce the data processing time, and the average processing time can be reduced by 18%. When the number of nodes is more, the advantage of fast algorithm in dealing with big data is more obvious. When the number of nodes is more than 30, the time of fast algorithm in dealing with data is less than that of not using fast algorithm in dealing with 25 nodes, which can be the advantage of fast algorithm in this article.

Conclusion

This article studies a new perceptual data fusion algorithm, which can process the perceptual data of each node quickly without delay. In the cognitive radio network, different nodes have different perception data due to different geographical location, and the contribution of each node’s perception data to cooperative perception is also different. The fusion center uses reinforcement learning mechanism to select cooperation nodes by identifying the sensing performance of node, which can reduce the processing data to a certain extent, and enable the fusion center to process quickly the data sent by each node will not cause decision delay. This greatly improves the throughput of cognitive users while protecting the primary users. The experimental results show that the big data fast fusion algorithm in this article can effectively reduce the data processing time, The average processing time of using fast algorithm is 18% less than that of not using fast algorithm. When the number of nodes is more than 30, the time of fast algorithm in dealing with data is less than that of not using fast algorithm in dealing with 25 nodes, which can be the advantage of fast algorithm in this article. Furthermore, the algorithm in this article can reduce the processing time of node data and improve the sensing performance at the same time and increase the throughput of cognitive users, which is of great significance. However, at present, only the fast algorithm of big data is implemented in the fusion center, but not the energy-saving algorithm in the node itself, which is the follow-up research goal.

Footnotes

Handling Editor: Zheng Chang

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This article was supported by the Natural Science Foundation of Hunan Province, China (grant nos 2019JJ40097 and 2019JJ40096), the Key Research and Development Projects of Science and Technology Department of Hunan Province (grant no. 2017NK2390), the Research Foundation of Education Bureau of Hunan Province, China (grant no. 17B107), the Research Foundation of Science and Technology Bureau of Yongzhou City, China (nos 2019YZKJ08 and 2019YZKJ10), and the construct program of applied characteristic discipline at the Hunan University of Science and Engineering. The authors would like to thank Editage () for English language editing.

ORCID iD

Tangsen Huang

References

Jing

Huang

. Blind recognition of binary BCH codes for cognitive radios. Math Prob Eng 2016; 2016: 1–6.

Xiong

Yao

, et al. Random, persistent and adaptive spectrum sensing strategies for multiband spectrum sensing in cognitive radio networks with secondary user hardware limitation. IEEE Access 2017; 5: 14854–14866.

Xiong

Yao

Ren

, et al. Multiband spectrum sensing in cognitive radio networks with secondary user hardware limitation: random and adaptive spectrum sensing strategies. IEEE Tran Wirel Commun 2018; 17: 3018–3029.

Jun

Min

, et al. Adaptive weight collaborative complementary learning for robust visual tracking. KSII Tran Inte Info Syst 2019; 13: 305–326.

, et al. Energy efficiency design for secure MISO cognitive radio network based on a nonlinear EH model. Math Prob Eng 2018; 2018: 1–7.

Liu

Zhao

, et al. Spectrum sensing based on maximum generalized correntropy under symmetric alpha stable noise. IEEE Tran Vehic Technol 2019; 68: 10262–10266.

Chen

Zhou

Xie

, et al. Joint spectrum sensing and resource allocation scheme in cognitive radio networks with spectrum sensing data falsification attack. IEEE Tran Vehic Technol 2016; 65: 9181–9191.

Park

Pawelczak

Cabric

. Performance of joint spectrum sensing and MAC algorithms for multichannel opportunistic spectrum access Ad Hoc networks. IEEE Tran Mob Comput 2011; 10: 1011–1027.

Khoshkholgh

Navaie

Yanikomeroglu

. Optimal design of the spectrum sensing parameters in the overlay spectrum sharing. IEEE Tran Mob Comput 2014; 13: 2071–2085.

10.

Zhang

Gao

, et al. Autonomous compressive-sensing-augmented spectrum sensing. IEEE Tran Vehic Technol 2018; 67: 6970–6980.

11.

Yang

Tsay

S-C

Wei

, et al. Remote sensing of cirrus optical and microphysical properties from ground-based infrared radiometric Measurements-part I: a new retrieval method based on microwindow spectral signature. IEEE Geosci Remo Sens Lett 2005; 2: 128–131.

12.

Zhang

Wen

, et al. Sensing nodes selective fusion scheme of spectrum sensing in spectrum-heterogeneous cognitive wireless sensor networks. IEEE Sens J 2018; 18: 436–445.

13.

Sun

Song

, et al. Spectrum sensing and the utilization of spectrum opportunity tradeoff in cognitive radio network. IEEE Commun Lett 2016; 20: 2442–2445.

14.

Nallanathan

, et al. Deep sensing for next-generation dynamic spectrum sharing: more than detecting the occupancy state of primary spectrum. IEEE Tran Commun 2015; 63: 2442–2457.

15.

Srikant

. Improving channel utilization via cooperative spectrum sensing with opportunistic feedback in cognitive radio networks. IEEE Commun Lett 2015; 19: 1065–1068.

16.

Sharifi

Musevi Niya

. Defense against SSDF attack in cognitive radio networks: attack-aware collaborative spectrum sensing approach. IEEE Commun Lett 2016; 20: 93–96.

17.

Feng

Chen

Cao

. “A joint PHY-MAC spectrum sensing algorithm exploiting sequential detection. IEEE Sig Proces Lett 2010; 17: 703–706.

18.

Sun

Chen

, et al. Permuted&filtered spectrum compressive sensing. IEEE Sig Proces Lett 2013; 20: 685–688.

19.

Xiong

, et al. Predecision for wideband spectrum sensing with sub-Nyquist sampling. IEEE Tran Vehic Technol 2017; 66: 6908–6920.

20.

Huang

, et al. Intelligent cooperative spectrum sensing via hierarchical dirichlet process in cognitive radio networks. IEEE J Selec Areas Commun 2015; 33: 771–787.

21.

Ismail

Mohamad

. Review on energy efficient opportunistic routing protocol for underwater wireless sensor networks. KSII Tran Inte Info Syst 2018; 12: 3064–3094.

22.

Chen

Zheng

Hou

, et al. Energy efficient design for OFDM-based underlay cognitive radio networks. Math Prob Eng 2014; 2014: 1–8.

23.

Chen

Zhang

. A joint scheduling and beamforming scheme for RoF-aided MC-SSN. IEEE Access 2019; 7: 29245–29252.

24.

Zhao

Lin

Zhou

, et al. Cascaded Mach–Zehnder interferometers with Vernier effect for gas pressure sensing. IEEE Photon Technol Lett 2019; 31: 591–594.

25.

Alibeigi

Taherpour

. Optimisation of secrecy rate in cooperative device to device communications underlaying cellular networks. IET Commun 2019; 13: 512–519.

26.

Luo

Zhu

. Full-duplex cognitive radio using guided independent component analysis and cumulant criterion. IEEE Access 2019; 7: 27065–27074.

27.

Yazicigil

Haque

Kinget

, et al. Taking compressive sensing to the hardware level: breaking fundamental radio-frequency hardware performance tradeoffs. IEEE Sig Proces Magaz 2019; 36: 81–100.

28.

Han

. Full-duplex-based control channel establishment for cognitive internet of things. IEEE Commun Magaz 2019; 57: 70–75.

29.

Huang

. A weighted cooperative spectrum sensing scheme based on dynamic double energy thresholds In cognitive radio networks. In: Proceedings of IEEE Global High Tech Congress on Electronics(GHTCE)Shenzhen, China, 17–19 November 2013, pp.201-204. New York: IEEE.

30.

Liu

, et al. Incentive mechanism based cooperative spectrum sharing for OFDM cognitive IoT network. IEEE Tran Net Sci Eng. Epub ahead of print 16 May 2019. DOI: 10.1109/TNSE.2019.2917071.

31.

Salah

Omer

Mohammed

. Spectral efficiency enhancement based on sparsely indexed modulation for green radio communication. IEEE Access 2019; 7: 31913–31925.

32.

Chen

Shi

Xiong

. Generalized real-valued weighted covariance-based detection methods for cognitive radio networks with correlated multiple antennas. IEEE Access 2019; 7: 34373–34382.

33.

Bhowmick

Roy

Kundu

. Performance of spectrum sensing scheme using double threshold energy detection in the presence of sensor noise. Int J Energy Info Commun 2012; 3: 75–84.

34.

Lee

Noda

Mizuno

, et al. Distributed temperature sensing based on slope-assisted Brillouin optical correlation-domain reflectometry with over 10 km measurement range. Electron Lett 2019; 55: 276–278.