Abstract
This paper investigates the traffic pattern prediction based on seasonal deviation and spectrum reallocation with multiple channel width in cognitive cellular networks. Compared to the existing approaches based on time series or classical statistic method, the binary exponential deviation offset prediction proposed in this paper focuses on the increment or decrement on every sampling point during an exponential offset period. Then the deviations will be revised at different levels in the next prediction process. The proposed approach is validated with some real end-user data from a WiFi network and simulation experiments. Based on such a precise prediction, we allocate the channels with different bandwidth to end-users according to diverse quality-of-service (QoS), which increases both the system's profits and actual spectrum utilization. The multidimensional bounded knapsack problem is introduced to divide channels, to which the proposed balance between value density and request probability strategy gets the approximate solution. The simulation experiment results show its good performance in not only utility but also spectrum utilization of the base-stations, especially when the resources are deficient.
1. Introduction
Cellular networks have succeeded in the last twenty years, but they are put to the test in rapidly increasing data traffic. The popularity of smart phones, which are not so much phones as hand-held computers, integrates several multimedia information services. In an open market report of Ericsson, it is described that the data traffic of cellular networks has doubled from the third quarter 2011 to the third quarter 2012, and mobile data traffic driven mainly by video is expected to grow with a CAGR (compound average growth rate) of around 50 percent in the time frame 2012–2018, which entails growth of around 12 times by the end of 2018 [1]. A study report of Juniper Research describes that about
Since the objective of cognitive radio (CR) technology is to reallocate spectrum resources reasonably and efficiently, the combination of these two technologies brings the promising prospect to cellular networks [3]. The idle spectrum will be circulated, and the cellular networks will obtain more resources than before.
The spectrum sharing in cognitive cellular (CGC) networks is a two-phase procedure. In the first phase, the CGC network gets spectrum resources from the primary users (PUs) through sensing [4] or borrowing [5] or through auctions [6–8]. In the second phase, the base-station (BS) of every CGC network cell allocates the spectrum to end-users.
In the allocation process, the BS should understand the traffic pattern, such as arrival rate, of end-users in order to plan spectrum allocation in its cell properly. In the researches of predecessors, PUs have higher priority to use spectrum as its owners and secondary users (SUs) will not be affected if they know when and how long the PUs will use it. So, the prediction for traffic pattern is one of the important contents in CR research field. A time series analysis model is adopted in [9] in which the user arrivals are regarded as time series. The authors of [10] estimate PUs' actions according to experimental statistics. In [11], the authors propose a prediction algorithm which exploits the periodicity of traffic process. These above studies focus on PU's traffic pattern estimation and got some outstanding achievements. However, compared to PUs which are often organizations or institutions, the end-users as individuals are more susceptible and fluctuant because of weather, festival, or other events. In such a situation, the traffic will present an obvious deviation. Considering the deviations adequately will make the traffic prediction for end-users more accurate.
After estimating the traffic actions of end-users, the BS allocates cognitive spectrum to them. At present, there are few studies on spectrum allocation in CGC networks particularly, but some studies in CR networks are developed. A distributed resource allocation method is proposed in [12], whose purpose is to balance the queue length and promote the fairness. The limitation is that the architecture of CGC network is centralized, and the spectrum allocation from a BS to end-users is planned by the BS rather than the end-users voluntarily. The resource allocation standard in [13] is to maximize the coverage of CR network through a power control method. Since in a CGC network the cell's range is certain, expanding the coverage does not absorb much attention. Moreover, the previous achievements are mainly concentrating on the technical property of CR networks. The commerciality of CGC networks, different from other common CR networks, has determined that it needs the characteristics of market-based operation. Since the economical benefits encourage a CGC network to value the end-user's requirements and devote itself to improve QoS, it is as important as the technical aspect. The better user experience is always one of the most significant goals of cellular networks. Such an absence is a shortage of current study on spectrum allocation in CGC networks.
The resources allocation in a CR network is summarized to a multidimensional knapsack problem and a greedy algorithm is introduced in [14]. Such a process is instructive for us to consider the spectrum allocation in CGC networks. However, the allocation in a CGC network is reduced to knapsack problem which is not only bounded but also multiple, and the mathematical model should be defined again. Furthermore, the density of value greedy algorithm only concerns the reward and cost of BS that ignores the volatility of end-user demands. So, an improvement is necessary too.
Our contributions are as follows.
A binary exponential deviation offset (BEDO) method is proposed to predict the arrival rate of end-users. The increment or decrement from predicted value to sampled data is regarded as a deviation. Certain external factors cause the deviation and will continue to affect the next sample points. A binary exponential factor is introduced to measure the influence duration. In every prediction process, the past deviations will be offset at different levels. The BS's utility in the second phase of two-phase spectrum sharing in a CGC network is discussed first, which has been ignored in previous studies. The appropriate economic stimulation will boost the CGC network to serve end-users more attentively. The relevance between profit and spectrum also promotes spectrum saving in resources reallocation. For spectrum reallocation, a balance between value density and request probability (BVDRP) algorithm is proposed. The existing heuristic algorithms cannot completely apply to the multidimensional bounded knapsack problem. In BVDRP, the pursuit for benefit, cost of system, and irregularity of end-user requests are all concerned.
The rest of the paper is organized as follows. The system model is introduced in Section 2. Section 3 does some theoretical analysis to the model, where we give the method to calculate its parameters. In Section 4, the system is evaluated with real network data and simulation experiments. Finally, Section 5 summarizes our conclusions.
2. System Model
2.1. Spectrum Access Method
A cellular network distributes over land areas called cells. Each is served by at least one fixed-location transceiver, known as a BS. A BS is the center of its local communication cell. For clear and simple description, in this paper a BS's spectrum programme is discussed in its local cell.
It is shown in Figure 1 that the right of use of idle spectrum segments is translated from PUs to a BS. At present CGC study stage, the spectrum sharing is mainly in communication networks using different technologies such as GSM, CDMA, UMTS, or Wi-MAX [15]. Along with the communication technologies and equipment progressing, it is believable that cognitive spectrum resource types will be extended.

Spectrum from PUs to BS.
In a CGC network cell, if an end-user wants to communicate with another, he has to get an available channel from the BS as shown in Figure 2. The BS allocates different users different channels. A complete graph

Cognitive cellular network application example.
2.2. Model Framework
The proposed traffic prediction and spectrum allocation approach, illustrated in Figure 3, is composed of three modules.

Traffic prediction and spectrum allocation approach.
The function of traffic prediction model is to predict end-user traffic as accurately as possible. The inputs are the end-user arrival rates
The allocation strategy model builds up the spectrum allocation strategy in accordance with the operation goals of system. The inputs include the weighting factors α and β, price series
The last part, spectrum allocation model, builds on the outputs of the above two modules. The channel quantity series
3. Proposed Algorithm
The spectrum resource that the BS acquires from PUs is denoted by B. The system auctions or rents B from PUs or in secondary markets, so the spectrum is not continuous but a series of spectrum blocks
The distinct n kinds of channel width the system provides to end-users are denoted by
Suppose the amount of demands for channel width
In such an economical procedure, the technical problems to be solved are the following two: how to predict the demand quantity
3.1. To Predict Demanded Quantities
To predict demanded quantities is a traffic pattern prediction procedure. According to a CGC network's characteristics, the system only occupies the spectrum resources for a specific time, after which they have to be returned to PUs. At that time, the system will rent some new blocks of spectrum. Such a process means that the spectrum resources are updated regularly, in which the period is called a spectrum period. The key to divide spectrum blocks suitably is to predict end-user arrivals and calculate
Denote the spectrum period by
In CR network studies, the arrival of spectrum requests is usually described with a Poisson process [18, 19], which can be defined as
During the estimation of end-user actions, the parameter λ is considered unaltered in a time period
Since
Hence the estimation of
The predicted arrival rate
Let us consider the deviation
Denoting the deviation from the statistic baseline to actual arrival rate,
The predicted arrival rate
From an intuitive understanding, the deviation from the statistic baseline to actual arrival rate is caused by external factors. Such an influence will not disappear in a short period likely. When the arrival rate
Another note is that if the sampling days p are certain, the statistic data update is necessary. After every end-user natural period (24 hours), the oldest arrival rate sampling series
Pay attention that the channels amount
3.2. To Divide Channels
To divide m discontinuous spectrum blocks into n different channels, it is not only bounded but also multiple. So we call it a multiple bounded knapsack problem (MBKP).
The decision problem form of such a knapsack problem is NP-complete [20]; thus it is expected that no algorithm can be both correct and fast (polynomial-time) in all cases. Our interest is to get rapid-speed and low-complexity solutions, so the greedy method is concerned. It tries to get the best choice basing on the current situation in spite of the overall conditions. Compared with the optimal solution at the expense of a lot of time and space, the easy and satisfactory one is enough. A heuristic strategy could not get the optimal solution necessarily but achieve the desired objective quickly [14]. It is suitable for our system, where the channel division is updated termly and the convenient acquisition is more important.
The key of such a greedy algorithm is to appoint the greedy strategy. The density of value greedy strategy sorts the items in decreasing order of value per unit of weight,
However, it has to be considered that the required bandwidth's probabilities lie in different spectrum periods. Though some channels' value densities are bigger, they appear much less in end-user's requests. Such idle channels not only bring few incomes for the system, but also waste scarce spectrum resources seriously.
On account of such considerations, an improved density of value greedy strategy for channel division is proposed. It makes a balance between value density and request probability (BVDRP).
The probability that a kind of bandwidth appears in requests is valued in interval (0,1). In a CGC network, if the selling price of channel is many times than the buying price, the value density will be much larger than request probability and the comparison is uneven. So, the first step to formulate BVDRP strategy is to polish value density in order to be comparable to request probability, which can be written as follows:
Then, the weighting factors, α and β, are introduced into the improved greedy algorithm as follows:
The determination coefficient of channel division decides the allocation order. The bandwidth
The values α and β show the preference the system has between value density and bandwidth request probability. Most often, they are both in interval (0,1). Two extreme situations are as follows. When
In practical applications, the values of α and β are selected according to the system load, choice between income and resource utilization, and other demands.
The mathematical model is to select proper channel bandwidth from
The algorithm to describe BVDRP strategy, which takes into account both value density and request probability, is shown in Algorithm 1. The space complexity of sort operation demonstrated in steps 5 and 8 is
Weighting factors α and β; Bandwidth of every spectrum block Bandwidth of every kind of provided channel Selling price Channel amount (1) Compute processed density of value (2) Compute bandwidth (3) Compute determination coefficients for provided channels (4) Initialize (5) Assign (6) Divide a channel (7) Update (8) Update (9) Update (10) Goto Step 6 if (11)
4. Experiment and Evaluation
Firstly, the proposed BEDO prediction method is compared with three other ones which are as follows.
(i) Classical Statistical Method. This method takes the mean value of arrival rates at the same time point in sampling periods as the prediction result. It is the most easily understandable method and simple to calculate.
(ii)
(iii) WCNC Method. The prediction method in [11] is presented in WCNC 2008. It takes arrival increment dividing time change as slope to estimate the prospective arrival rate and shows an admirable prediction ability.
Although a WiFi network is very different from a CGC one, the two kinds of networks provide network access service. For end-users, when they are ready to visit a website, they will not notice any difference. So, the end-user behaviors are similar in these networks. Some real end-user usage data from CRAWDAD [21] are adopted to verify the BEDO prediction method proposed in this paper. The beginning 240 hours are taken as training time, and then the arrival rates in the next 120 hours are predicted and compared to real arrival data. The comparison results are shown in Figure 4.

Comparison of arrival rate prediction with different methods (real data).
Considering that the arrival rate is easy to be affected by external factors, we take some further simulation experiments to examine the prediction effects with the arrival rate changing severely. The performances of distinct methods are compared in Figure 5.

Comparison of arrival rate prediction with different methods (simulation).
In the above prediction comparisons, when the arrival rate is stationary, just like in Figure 4, the performances of four prediction methods are similar. They all reflect the periodic trend and the variances are acceptable. Although some certain variances of
Cumulative variances of distinct predictions.
So, BEDO method is validated to predict the end-users arrival rate accurately. This is the prerequisite of following spectrum allocation.
Subsequently, some simulation experiments are adopted to validate the channel division approach this paper put forward. The parameters are shown in Table 2. Suppose end-user requests coming with a Poisson process whose parameter
Channel division simulation parameters.
In simulations, the spectrum blocks are generated randomly, the amount of which is from 2 MHz to 50 MHz increasingly. The channels are divided using five different allocation strategies, and the allocation efficiencies are calculated with allocated bandwidth divided by total spectrum bandwidth.
BVDRP is compared with the other four strategies including density of value greedy, value greedy, channel width from small to large, and channel width from large to small. The reasons that they are selected are as follows.
(i) Density of Value Greedy. This strategy is introduced in [14] as a desirable heuristic strategy. Compared with the other previous ones, it is a practical method regarding not only reward but also cost.
(ii) Value Greedy. It highlights the purpose to seek profits merely, which is an economical target on spectrum allocation in CGC networks superficially.
(iii) Channel Width from Small to Large. This strategy divides channels as many as possible and accords with the objective of [4] to enlarge the network capacity.
(iv) Channel Width from Large to Small. It is the opposite to the third strategy, so it is conducted for reference.
Three aspects affected by the above five channel divisions are compared, including allocation efficiencies, utilities of system, and spectrum utilizations.
As shown in Figure 6, the five division methods are not obviously different in allocation efficiencies. When the spectrum resources reach 40 MHz totally, the five curves begin to overlap. It illustrates that 40 MHz resources have satisfied all the system's demands, over which the allocation results are consistent in spite of division difference. The allocation curves decrease gradually over 40 MHz, because the superfluous resources are wasted above the spectrum demands satisfaction.

Channel division effect.
According to these simulation results, it is known that the different division strategies do not reflect differences if the spectrum resources acquired from primary users are sufficient. It is understood that the BSs in CGC networks could not achieve a plenty of spectrums due to resource's unbalance. So, we select the spectrum scenes of 10 MHz, 20 MHz, and 30 MHz to compare system utilities and spectrum utilizations. Such scenes stand for spectrum resources from exceedingly poor to relatively enough.
Figure 7 shows that, with different distributable resources, the system utilities increase linearly following time growth. The incomes of BVDRP, density of value, and bandwidth from small to large strategies are approximate, of which value greedy and bandwidth from large to small are lower.

System utility with different allocation methods.
Value greedy strategy pursues value maximization too much and ignores that more value means more bandwidth. On the face of seeking incomes, this division approach brings less utility to the system when the resources are limited. Bandwidth from large to small strategy wants to divide the channels needing larger bandwidth in advance, until the allocation is difficult. Then the remainder spectrum is divided into smaller channels. Though the higher price is charged for larger channel, the spectrum spent is more and the number of available channels is less. Finally, the total utility of system is less.
Among the three channel division strategies producing more incomes, density of value greedy wants to exchange less bandwidth for more profits. Bandwidth from small to large tries to divide more available channels when spectrum resources are certain and exactly in the experiments the smaller channels have the higher density of value. So, they obtain more utilities for the system.
BVDRP strategy this paper proposed takes into account both value density and request probability. The available channels are vacant rarely that brings the good return to the system. When the lack of spectrum resources is serious (
Figure 8 shows the spectrum utilizations using five different channel division strategies.

Spectrum utilization with different allocation methods.
When
When
On the whole, BVDRP shows up some certain advantages in both system utility and spectrum utilization, especially when cognitive spectrum is scarce.
5. Conclusion
In this paper, we propose a traffic pattern prediction and channel division approach for the wireless service provider (the base-station) in a CGC network. The end-user arrival rate and bandwidth probability are estimated with a binary exponential deviation offset (BEDO) method. Then, we adopt the balance between value density and request probability strategy (BVDRP) to solve the multiple bounded knapsack problem and divide discontinuous spectrum blocks into channels. At last, the methods are evaluated with real data and simulation experiments. The results prove that in a CGC network our process will not only bring more profits to the service provider but also make better use of spectrum resources. And the advantage is more obvious, while the resource is less.
Footnotes
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
The authors have to thank the anonymous reviewers for the helpful suggestions. They also need to thank contributors of CRAWDAD for their open data. This work is supported by the Natural Science Foundation of China under Grant no. 61170188 and the Specialized Research Fund for the Doctoral Program of Higher Education of China under Grant no. 20121102130004.
