Abstract
The paper studies the cooperative spectrum sharing among multiple secondary users (SUs) in a clustering cognitive ad hoc network. The problem is formulated as a repeated game with the aim of maximizing the total transmission rate of SUs. Firstly, a clustering formation procedure is proposed to reduce the overhead and delay of game process in cognitive radio network (CRN). Then the repeated game-inspired model for SUs is introduced. With the model, the convergence condition of the proposed spectrum-sharing algorithm is conducted, and the convergence performance is investigated by considering the effects of three key factors: transmission power, discount factor, and convergence coefficient. Furthermore, the fairness of spectrum sharing is analyzed, and numerical results show a significant performance improvement of the proposed strategy when compared to other similar spectrum-sharing algorithms.
1. Introduction
The scarcity of spectrum has become a major bottleneck of the development of next generation wireless communication system. Cognitive radio (CR), which allows unlicensed or secondary users (SUs) to share the spectrum with licensed or primary users (PUs), shows great promise to enhance the spectrum utilization efficiency [1, 2]. The CR technology enables the SUs to opportunistically access the available spectrum bands through four main functionalities: spectrum sensing, spectrum managing, spectrum mobility, and spectrum sharing [3]. Among these, spectrum sharing is one of the most important functions in cognitive radio, which allows SUs to share the available spectrum bands among the coexisting PUs [4]. It is essential for improving spectrum utilization.
There exist some research efforts on the problem of spectrum sharing in CR. Among these studies, a centralized spectrum management scheme was proposed in [4]. It greatly improves the system performance over the (iterative water-filling) IWF scheme by utilizing a centralized spectrum management center (SMC). However, due to the heterogeneous and dynamic nature of cognitive radio, centralized approach is not practical. Instead, in some studies, the distributed approach which does not need any central controller is suggested [5]; in [5], asynchronous distributed pricing scheme is proposed, based on the signal exchange via coordination between users to compensate the ascendant interference level. Distributed approach provides the better adaptation capability to CR in the dynamically changing heterogeneous environment, but the coordination among SUs results in significant amount of coordination delay. In [6], a novel distance-dependent MAC protocol for CRN is proposed, which attempts to maximize the CRN throughput. Regretfully, these protocols do not consider the fairness of the spectrum sharing.
Game theory, which analyzes the conflict and cooperation among decision makers (users), has widely used in designing efficient spectrum sharing. A dynamic game model is presented in [7], in which the SUs can iteratively adapt their strategies in terms of requested spectrum size. The stability condition of the dynamic behavior for the spectrum-sharing scheme is investigated. In [8, 9], the authors investigate whether spectrum efficiency and fairness can be obtained by modeling the spectrum sharing as a repeated game. In [10], the authors model the channel assignment and power control problems as a noncooperative game, in which all wireless users jointly pick an optimal channel and power level to minimize a joint cost function. A no-regret learning algorithm using the correlated equilibrium concept to coordinate the secondary spectrum access is considered in [11]. In [12], a self-enforcing truth-telling game mechanism is used to suppress cheating and collusion behavior of selfish users for spectrum sharing. It is shown that the SUs can get the highest rate utility only by announcing their true private information under the assumption that all the SUs have the same maximal transmission power. Based on the Nash bargaining in cooperative game, an improved utility function is proposed in [13] to maximize the profit product of all the SUs.
Consider that the CRN is characterized by a lack of centralized control and the restriction that global information is not available, which requires that the game algorithms for spectrum sharing should be completely distributed relying on local information. It motivate us to employ local interaction games [14], which have been recently introduced in CRN research known as graphical games in [15]. In [16], two cases of local interaction game are proposed to cope with the lack of centralized control and local influences. The first is local altruistic game, in which each user considers the payoffs of itself as well as its neighbors rather than considering itself only. The second is local congestion game, in which each user minimizes the number of competing neighbors. It is shown that, with the local games, global optimization is achieved with local information. Although some progresses have been achieved in the above approaches, the problem of spectrum sharing more efficient and fair is not yet solved.
In this paper, a repeated game theoretic model is proposed, with the aim to maximize the total rate revenue of SUs in cognitive ad hoc network. Clustering is executed firstly in the model to avoid frequent collisions and to reduce the coordination delay between SUs. With the model, the convergence condition for the total revenue maximum is studied. The transmission power, discount factor, and convergence coefficient, which impact the convergence behavior, are explored. Besides, the fairness of spectrum sharing is investigated.
The rest of the paper is organized as follows. In Section 2, the system model is introduced. In Section 3, the total rate revenue of the spectrum-sharing algorithm, the convergence behavior and fairness of the algorithm are analyzed. In Section 4, the performances of the rate revenue, convergence, and fairness are simulated and evaluated. Finally, conclusions are drawn in Section 5.
2. System Model and Descriptions
Spectrum-sharing model based on repeated game theory is presented in Figure 1. In the scenario, N secondary users compete to access the available spectrum to transmit data. Note that the wireless channels are assumed to be quasistatic for each time slot, that is, channels remain unchanged within the time slot duration, but they vary from one slot to another one.

Repeated game-inspired spectrum-sharing model.
The model consists of two parts: cluster formation and spectrum sharing. With no clustering in a cognitive network, the collision happened as long as
For the spectrum sharing, a static repeated game among SUs can be used to obtain the Pareto optimality in a clustering fashion by the “grim trigger” strategy.
Definition 1.
The Nash equilibrium is a set of strategies, one for each player, such that no player has incentive to unilaterally change her action. Players are in equilibrium if a change in strategies by any one of them would lead that player to earn less than if she remained with her current strategy. For games in which players randomize (mixed strategies), the expected, or average payoff (also termed as revenue, utility or outcome) must be at least as large as that obtained by any other strategy.
Definition 2.
The Pareto optimality is a measure of efficiency. An outcome of a game is Pareto optimal, if there is no other outcome that makes every player at least as well off and at least one player strictly better off. That is, a Pareto optimal outcome cannot be improved upon without hurting at least one player. Often, a Nash Equilibrium is not Pareto optimal implying that the players' payoffs can all be increased.
In order to stimulate cooperation among selfish players (SUs) and achieve the Pareto optimality, the “grim trigger” strategy is adopted in our spectrum-sharing model. The “grim trigger” is a trigger strategy employed in a repeated game [18]. Initially, a player using grim trigger will cooperate, as soon as the opponent defects (thus satisfying the trigger condition), and the player using grim trigger will defect for the remainder of the iterated game. Since a single defect by the opponent triggers defection forever, grim trigger is the most strictly unforgiving of strategies in an iterated game.
2.1. Clustering Formation
In this section, a clustering procedure that exploits combined weight metrics is proposed. The idea of using combined weight metrics, including the ideal degree, transmission power, and battery power, has been considered in the literature [19]. In this paper, in addition to the above weight metrics, the clustering stability and node type have also been considered as weight metrics to elect the cluster-heads (CHs). The clustering stability is used to retain the stability of the network topology, and node type helps to reflect the realistic characteristics of multiple types of nodes in cognitive ad hoc networks.
The network formed by the SUs and transmission links can be represented by an undirected graph
The following metrics are considered in our clustering procedure for a cognitive ad hoc network.
(i) Ideal Degree. Each CH can ideally support
The degree difference from the ideal degree helps in efficient MAC functions and load balancing because it is always desirable for a CH to handle up to a certain number of nodes in its cluster.
The neighbors of each SU-v (i.e., SUs within its transmission range) is defined as the degree of node v,
(ii) Transmission Power. It is known that more power is required to communicate to a larger distance. As the nodes move away from the CH, the communication may become difficult due mainly to signal attenuation with increasing distance.
The usual attenuation in the signal strength is inversely proportional to some exponent of the distance, which is usually approximated to 4 in cellular networks, where the distance between mobiles and base stations is of the order of 2-3 miles. In ad hoc networks, the distances involved are rather small (approximately hundreds of meters). In this range, the attenuation can be assumed to be linear [20].
For every node, the sum of the distances,
(iii) Battery Power. The battery power can be efficiently used within certain transmission range; that is, it takes less power for a node to communicate with other nodes if they are within close distance to each other. A CH consumes more battery power than an ordinary node, since it has extra responsibilities to carry out for its members.
We consider a heterogeneous network with multiple initial energy levels. The cumulative time,
(iv) Clustering Stability. In order to avoid frequent CH changes, it is desirable to elect a CH that does not move very quickly. The focus of the most existing literatures has mostly been on absolute mobility of the nodes without taking into consideration the relative mobility. Thus, the stability of the network is perturbed. Our clustering procedure achieves the network stability by considering the relative mobility of nodes.
The average distance for every node v from its neighbor u till current time T is calculate as
Let
The average of the link stability for every node v, with all its neighbors [21], is given by
(v) Node Types. In many realistic ad hoc networks, multiple types of nodes do coexist [22]. For example, in a battlefield network, portable wireless devices are carried by soldiers, and more powerful and reliable communication devices are carried by vehicles, tanks, aircrafts, and satellites; these devices/nodes have different communication characteristics in terms of transmission power, data rate, processing capability, reliability, security level, and so forth.
In heterogeneous ad hoc networks, it would be more realistic to elect CHs for considering the different types of nodes. For simplicity, two types of nodes are considered in the network. One type of node has larger transmission range (power) and data rate and better processing capability and is more reliable and robust than the other types. Accordingly, the mapping values of the two types of nodes
The combined weight
The contribution of the individual metrics can be tuned by choosing the appropriate combination of the weighing factors [21]. The node with the smallest
The first component in (9),
The proposed clustering strategy is demonstrated with the help of Figures 2(a)–2(d). All numeric values obtained from clustering process are given in Table 1. Figure 2(a) shows the initial configuration of the nodes (SUs) in a cognitive radio network, where an edge (link) between two nodes in the figure signifies that the nodes are neighbors of each other, and the length of a link represents the distance of two nodes. In the figure, the degree difference,
Execution of the clustering strategy.

(a) Initial configuration of nodes and neighbors identified. (b) Velocity of the nodes. (c) CHs identified. (d) Clusters identified and connectivity achieved.
2.2. Repeated Game-Based Spectrum Sharing
The SUs sharing the spectrum of licensed users (PUs) may lead to the following conflicting problems: (i) limitation on the transmission power in each channel for minimum interference to coexisting PUs and (ii) certain signal-to-noise ratio (SNR) required for data transmission of SUs without substantial performance degradation. Cooperation among SUs has been proved to be beneficial in solving such conflicting interests [23]. Such cooperation can be achieved with the application of the concepts of game theory. But, cooperative game faces scalability problem to be implemented in CR networks. In a large CR network, the overhead and delay of the game process will be unbearable, if all the SUs play a single game; on the other hand, it is not reasonable to let SUs that are set apart far way (thus has little direct mutual impacts) play the same game. So it is necessary and reasonable to group the cognitive radios network into multiple clusters firstly.
In order to model and analyze long-term interactions among players, the repeated game model is used where the game is played for multiple stages. A repeated game is a special form of an extensive-form game in which each stage is a repetition of the same strategic-form game. Particularly, the spectrum-sharing problem can be modeled as the outcome of a repeated game, in which the players are the SUs their strategies (actions) are the choice of spectrum resources.
In a repeated game, a normal-form game is mathematically defined as
In analyzing the outcome, as the decision of one SU is influenced by the other SUs' decisions, we are interested to determine, if there exists a convergence point, that is, Nash equilibrium (NE), for the spectrum-sharing algorithm, from which no SU would deviate anymore. A strategy profile for the SUs,
When there is more than one NE in the repeated game process, it is natural to ask whether there exists an optimal one, that is, Pareto optimality. In order to stimulate cooperation among selfish SUs and achieve the optimality, the “grim trigger” strategy is adopted. For the case of no deviation from cooperation in a repeated game, the utility function at every stage for SU-v is unchangeable. The overall utility for SU-v in a repeated game is represented as the discounted sum of immediate utilities from each stage; that is,
Without loss of generality, consider a repeated game with two SUs (one is CH and the other is cluster-member) competing for the limited spectrum resources. If an SU senses the PU at the licensed spectrum, it moves to another spectrum hole or stays in the same band without interfering with the PU by adapting its communication parameters such as transmission power or modulation scheme. For this paper, the total transmission power is constrained for SUs to make them stay in the same band during the spectrum sharing.
Figure 3 illustrates the utility region of a repeated game with two SUs for the Gaussian interference channel.

The feasible utility region of a repeated two-player game.
3. Performance Evaluation
The cognitive ad hoc network considered consists of multiple SUs is illustrated in Figure 1. In the figure, TDMA scheme is suggested for every cluster; that is, the TDMA frame is divided into M time slots, and every cluster is assigned one unique spectrum-sharing slot in a frame. The mth cluster always transmits in slots assigned to it.
For a cluster containing only one SU, the SU might occupy the assigned slots for transmission. However, if two or more SUs exist in a cluster, the repeated game should be executed to compete for the assigned slot. Take SU-1 and SU-2 in Figure 1 as an example; they consist of a set of two transmitting-receiving (T-R) pairs with channel gain of 1 for each T-R pair. Assume that the sharing channels, width is normalized to 1, which is divided into two independent channels with width of 1/2 and with the external noise power of
3.1. Rate Utilities and Total Rate Revenue
When the two users transmit over the same channel, the interference is looked as the Gaussian noise, and the Gaussian interference game was defined in [24, 25].
The interference measured at the receiver u associated with transmitter v is shown to be [26]
It is apparent that utilities of SUs are closely related to interferences. Moreover, the performance of the spectrum-sharing algorithm depends significantly on the choice of the utility function which characterizes the preference of a user for a particular channel. The choice of a utility function is not unique. It must be selected to have physical meaning for the particular application and also to have appealing mathematical properties that guarantee equilibrium convergence for the game process. Our objective is to maximize the total rate revenue of SUs by cooperatively sharing the spectrum. Let
By substituting (17) into (14),
Then the total rate revenue is
3.2. Convergence Analysis
As mentioned, revenue stability is closely related to the number of stages. The definition is following: when the revenue differences
3.3. Fairness Analysis and Improvements
To maintain reliable communication, a certain transmitting rate threshold required for SUs is necessary. If one user transmits with high power while the other uses low transmission power (as point C or D in Figure 3), the normal communication is not well guaranteed.
How to improve the fairness of spectrum sharing is an urgent issue to be settled. It is assumed that
Let the fixed-step size of power adjustment,
4. Simulation Results
It is assumed that the noise power

(a) Total rate revenue compared with [13]. (b) Total rate revenue versus number of stages for different discount factors.
The total rate revenue versus different number of stages with varying discount factors for
Figure 5(a) shows the impact of transmission power on the convergence rate when the noise power

(a) Convergence performance versus different transmission powers. (b) Convergence performance versus discount factor. (c) Convergence performance versus convergence coefficient.
Figure 5(b) plots the convergence behavior with varying discount factor δ for noise power
Figure 5(c) depicts the number of stages for convergence
The rate utilities

Initial rate utilities for different transmission powers.
The adaptation of rate utility due to the transmission power adjustment is shown in Figure 7. As expected, when the rate utility

Rate utilities adaptation to power adjustment.
The total revenue comparison between the proposed algorithm with the method (LAG) in [16] is plotted in Figure 8. It can be observed that the total rate revenue increases with better fairness (i.e., larger ratio of the transmission powers, r). Especially, the total revenue is maximum when

Total rate revenue compared with LAG.
5. Conclusions
In this paper, the spectrum sharing is modeled as a repeated game under which cluster-member SU communicates with only cluster-head SU, and frequent collisions between SUs are avoided than that under no-cluster policy. The aim of this work is to maximize the total rate revenue of SUs under the repeated game model, while meeting the fairness requirements for transmitting data. The first step toward this work is to analyze convergence condition under which the total rate revenue of SUs is maximized. The analysis shows that the transmission powers, discount factor, and convergence coefficient affect the total rate revenue. The analysis continues by considering the fairness of spectrum sharing, and the fairness is improved by adjusting the transmission powers. Simulation results demonstrate that the proposed spectrum-sharing algorithm can achieve better performance than the preexisting ones in terms of the total rate revenue and fairness of spectrum sharing.
For future research, the QoS (Quality-of-Service) requirement for SUs will be considered for spectrum sharing.
Footnotes
Acknowledgments
This work was partially supported by the National Natural Science Foundation of China (61261014), the State Key Laboratory of Rail Traffic Control and Safety (RCS2011K006), the Beijing Jiaotong University, and the Fundamental Research Funds for the Gansu Universities (212087-2).
