# Selective sensing and access strategy to maximize throughput in cognitive radio sensor network.

1. IntroductionWireless sensor networks (WSNs) play a critical role in many research areas, such as machine-to-machine network (M2M), emergency communication, and smart home [1-4]. There are two characteristics about the WSNs: limited energy and scarce spectrum. Generally, the nodes of WSNs are powered by battery; energy efficiency is one of the design factors. In this paper, we focus on the spectrum scarcity problem of WSNs. According to the dramatic increasing number of wireless devices, the problem of available spectrum scarcity becomes more serious. Cognitive radio (CR), which has been introduced as a way to improve the efficiency of spectrum utilization, becomes a research focus in recent years [1-27]. Combined with the advantages of CR and WSNs, cognitive radio sensor network (CRSN) has been studied [1-4, 6-8,15]. In studies [1, 2], CR has been used in smart grid communication networks. The spectrum access strategy has been proposed, in order to find much available spectrum resource for data collection and transmission. In order to improve the energy efficient of vehicular ad hoc network, CRSN has been studied in [3]. In the CRSN, the wireless nodes are defined as secondary (unlicensed) users (SUs), while the other available spectrum owners are defined as primary (licensed) users (PUs). The important challenge is that the SUs find available channels and adjust their transmission parameters (transmission power, transmission time, carrier frequency, etc.) to access these channels, while avoiding harmful interference to PUs.

Normally, the available spectrum opportunities are found in time [4, 7, 8,10-12,15] space 14] and frequency domains [18]. In [10-12], SUs perform spectrum sensing to find access opportunities in time domain. The SUs access the licensed channel when it was not used by PUs. In [4], channel sensing and switching were used for SU to access a set of PU channels. In [7, 8], two spectrum sensing methods were studied in CRSN. In [15], both single-user and collaborative spectrum sensing schemes were proposed in cognitive sensor networks, while quantised sensing observation is used. In [13], the SUs accessed the licensed channel directly at any time; the transmission power was limited to avoid unacceptable interference to PU system. The spectrum access opportunity in space domain was used. In order to find available spectrum in different channels, parallel sensing [19], cooperative sensing [17], and wideband spectrum sensing (WSS) were studied. In [16, 17, 20], cooperative sensing has been studied. The SUs sense other channels if the current channel has been occupied or detect a set of channels simultaneously [21-25], to find available spectrum in a wideband spectrum.

Different from the excessive spectrum sensing and access researches, which focus on utilizing the time or space domain only, a spectrum sensing and access strategy of CRSN is proposed in this paper, in which an SU system uses the available spectrum both in time and space domains in multiple channels. WSS is used for SU to identify the presence of PU signals. Different from the normal spectrum sensing scheme, which an SU senses the channels one by one, in WSS scheme, multiple channels are detected simultaneously by an SU; the sensing time durations of channels are the same. After spectrum sensing, the SU accesses all of the channels with mixed access strategy (MAS) [26, 27]. Under MAS, the SU accesses the channels with different powers based on the sensing results. When the channel is sensed as idle, the SU accesses it with a higher transmission power; the available spectrum in time domain is used. Otherwise, transmission power is lower enough to avoid unacceptable interference to PUs. The available spectrum in space domain is used. Thus, comparing with other spectrum access strategies, the SU in MAS obtains greater throughput.

However, all of the channels are selected for WSS which is not a suitable choice. There are three reasons for the necessity of the sensing channels' selection. First, sensing all of the channels is a huge challenge; great energy is needed [9], which is not a good choice for WSNs. Second, if PU signals in some channels are weak, much more sensing time is needed to guarantee the protection of PUs. For the reason of WSS utilization, the sensing overhead of the system is prolonged. Finally, when the idle probabilities of some channels are low, the SU obtains a larger average throughput when it accesses these channels directly, compared with the average throughout which is obtained when the SU accesses the channels after sensing. Thus, in this paper, the SU selects some channels for spectrum sensing. For the channel which has been selected, the SU accesses it and adjusts its transmission power based on the sensing results. Otherwise, the SU accesses the channels directly via underlay access strategy. Therefore, in our proposed system model, a tradeoff exists between achieving larger throughput and selecting appropriate sensing channels.

According to the dynamic spectrum environment, the SU cannot obtain accurate states of PU channels, due to the imperfect spectrum sensing and not all of the channels are selected for sensing. In this paper, we propose a selection making algorithm by using the partially observable Markov decision process (POMDP) theory. Under the selection making algorithm, the SU determines which channels are selected for spectrum sensing, how long the sensing time, and the transmission powers of the accessed channels. The objective of the selection making algorithm is to maximize the throughput of SU system, while avoiding unacceptable interference to PUs. An optimal policy and a myopic policy are derived to solve the formulated POMDP problem. Moreover, we present an optimization algorithm to solve the synchronism problem among the selected channels. Extensive numerical examples are proposed to demonstrate the merit of the proposed algorithms.

The contributions of this paper can be described as follows.

(i) A new selective sensing and access strategy of CRSN based on POMDP theory is proposed, in which at beginning of each slot, an SU selects some channels for wideband spectrum sensing and accesses all of the channels via mixed access strategy.

(ii) An optimal policy and a myopic policy are proposed to solve the proposed POMDP problem.

(iii) We consider an optimization problem, in which the decision probability thresholds of the selected channels are jointly optimized, in order to ensure the synchronism among the selected channels.

The rest of this paper is organized as follows. In Section 2, we review the related work. System model is proposed and analyzed in Section 3. In Section 4, we discuss a selection making algorithm via the POMDP framework; an optimal policy and a myopic policy are proposed to solve the proposed POMDP problem. The advantage of the proposed algorithms is illustrated by numerical results in Section 5, and conclusions are drawn in Section 6.

2. Related Work

Wideband spectrum sensing has been discussed in [21-25]. In [21], the detection thresholds of energy detectors in channels were jointly optimized. In order to maximize the throughput of SU, the sensing time and detection thresholds were jointly optimized in [22]. In [23, 24], both the sensing time and transmission powers of channels are jointly optimized. However, in these works, the SU selects all of the channels for spectrum sensing, and accesses these channels only when the sensing result is idle. The selection of sensing channels has not been considered. Different from the previous works, in this paper, not all of the channels are selected for sensing. After spectrum sensing, the SU accesses the channels via mixed access strategy. No matter the sensing result is idle or occupied, the SU accesses the channel with different transmission powers, and greater throughput is obtained. Moreover, different from the previous works in which the WSS was considered in one slot, we consider the sensing channels' selection and spectrum access in multiple slots. The problem becomes complicated.

According to the time-varying character of the dynamic spectrum environment, POMDP is used to formulate the selective spectrum sensing and access problem. In [28, 29], two optimal opportunistic spectrum access MAC protocols were proposed. In [30], a well-known separation principle was proposed to transfer the solution of the POMDP problem from optimal policy to myopic policy. However, both are based on the condition that the SU can sense and access one channel in a slot. In our proposed scheme, the SU selects several channels for spectrum sensing and accesses all of the channels after sensing. The calculation of sensing time becomes complicated. In [31], an adaptive sensing scheduling scheme was proposed, based on POMDP theory. The study [32] is probably the most relevant paper, in which an optimal sensing channels' selection policy is proposed. However, it accesses the channel only when the sensing result is idle. In our work, the SU accesses all of the channels with different transmission powers. Although the previous works take the same mathematical method (POMDP), the problem in this paper becomes quite complicated, and some efficient methods are proposed to solve the problem.

3. System Model

In this section, we present the system model of this paper and the structure of wideband spectrum sensing. Then, a selective sensing and access strategy in multiple channels is proposed.

3.1. System Model. An SU system shares a licensed wideband spectrum assigned to PUs, which can be divided into N nonoverlapping narrowband channels. The channels operate in a time-slotted manner. The traffic of PU system is modeled as a two states ON-OFF process. Figure 1 shows the structure of SU system. We assume the SU obtains the duration of slot and can keep synchronization with PUs [10, 11]. Denote [[alpha].sub.n] as the probability that channel n transits from ON to OFF state and [[beta].sub.n] as the probability that channel n transits from OFF to ON state. We assume [[alpha].sub.n] and [[beta].sub.n] are obtained from the previous long time measurement. [P.sub.n]([H.sub.0]) = [[alpha].sub.n]/([[alpha].sub.n] + [[beta].sub.n]) and [P.sub.n]([H.sub.1]) = [[beta].sub.n]/([[alpha].sub.n] + [[beta].sub.n]), n [member of] {1, 2, ..., N}, where [P.sub.n]([H.sub.0]) and [P.sub.n]([H.sub.1]) are the average idle and occupied probabilities. Let a 1 x N vector S(m) denote the state vector of PU channels in slot m, S(m) = ([s.sub.1](m), [s.sub.2](m), ... , [s.sub.N](m)), where m is the index of slot, m [member of] {1, 2, ..., M}, [s.sub.n](m) = 1 indicates the operating channel n in slot m is occupied, and [s.sub.n](m) = 0 indicates the channel is idle. The state space of S(m) is

w(m) = {([w.sub.1], [w.sub.2], ..., [w.sub.N]) | [w.sub.n] [member of] {0, 1}}. (1)

Let [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] denote the state transition probability from state [w.sub.n] to state [w'.sub.n] of channel n. [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], where [w'.sub.n] [member of] {0,1}. For channel n, the state transition probabilities are [P.sub.01] = [[beta].sub.n], [P.sub.00] = 1 - [P.sub.01], [P.sub.10] = [[alpha].sub.n], and [[beta].sub.n] = 1 - [P.sub.10].

At beginning of each slot, the SU selects some channels for WSS and accesses the channels with MAS. For the other channels, the SU waits the same time, in order to keep synchronism with others. The unselected channels are accessed directly. The SU cannot transmit in one channel while performing spectrum sensing in another one. Denote [[tau].sub.n](m) as the sensing time duration of channel n in slot m. Under MAS, the SU detects the channel states firstly. When the sensing result is idle, the SU accesses it with power [P.sub.1n]. When the sensing result is occupied, the transmission power is [P.sub.2n]. Normally, we assume [P.sub.1n] > [P.sub.2n]. The values of [P.sub.1n] and [P.sub.2n] are obtained by some power allocation optimization algorithms [27], which are not our focus in this paper. If the SU accesses the channel directly, the transmission power is [P.sub.2n]. After transmission, the SU receiver announces ACKs to the transmitter of SU. If the transmission is not successful, the SU receiver announces a NAK. The time duration for acknowledges is ignored in our proposed structure. The structure of SU in one slot is shown as Figure 2.

3.2. Wideband Spectrum Sensing. In WSS scheme, we assume an wideband spectrum is occupied by some PUs. The PUs are operating in different spectrum bands; idle probabilities of channels are not the same. WSS is used to identify the presence of PUs in some special channels. In WSS, the wideband is divided into N nonoverlapping channels. The SU receives data through a wideband RF antenna. Then, the received data is passed through a high speed A/D converter [21-25]. The structure of WSS is shown as Figure 3. We can find that the sensing time durations of channels are the same.

3.3. Selective Sensing and Access Strategy. At the beginning of each slot, the SU determines (1) which channels are selected for spectrum sensing, (2) how long the sensing time, (3) which channels to be accessed with power [P.sub.1n], and which channels with power P2n. The design objective is to maximize the throughput of SU system during a desired period of M slots, while the interferences to PUs are under the predetermined thresholds. Let [OMEGA] denote the set of the selected channels. For the channel n in slot m, n [member of] [OMEGA], the SU selects it for spectrum sensing and accesses it based on the sensing results. According to the difference between the sensing results and the real states of channel, four cases are considered. When the sensing result is idle and the channel is not occupied by PUs, the transmission is successful with power [P.sub.1n](m); the transmission rate is [R.sub.00n](m). However, if the real state of the channel is occupied and the SU has not detected it, the transmission of SU cannot be successful, for the reason of the interference from PUs. When the sensing result is occupied, the transmission power is [P.sub.2n](m). The power [P.sub.2n](m) is limited to avoid unacceptable interference to PUs. No matter the real state of channel is idle or occupied, the transmission of SU is successful. Therefore, the achievable rate of channel n in slot m is given by

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], (2)

where [R.sub.00n] = [Blog.sub.2](1 + ([P.sub.1n](m)[h.sub.ssn]/[N.sub.0])), [R.sub.01n](m) = [Blog.sub.2](1 + ([P.sub.2n](m)[h.sub.ssn]/[N.sub.0])), and [R.sub.11n](m) = [Blog.sub.2](1 + ([P.sub.2n](m)[h.sub.ssn]/([P.sub.p][h.sub.psn] + [N.sub.0])). [h.sub.ssn] and [h.sub.psn] are the channel gains, B is the bandwidth of channel, and [P.sub.p] is the power of PUs. The channel which has not been selected, the SU estimates it as occupied and accesses it directly with transmission power [P.sub.2n](m). The transmission rate is [r'.sub.n](m) = [P.sub.n]([H.sub.0])[log.sub.2](1 + ([P.sub.2n](m)[h.sub.ssn]/[N.sub.0])) + [P.sub.n]([H.sub.1])[log.sub.2](1 + ([P.sub.2n](m)[h.sub.ssn]/([P.sub.p][h.sub.psn] + [N.sub.0]))). To maximize the throughput of SU system, the optimization problem is formulated as

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (3)

subject to

[P.sub.d,n] ([[tau].sub.n] (m)) [greater than or equal to] [P.sub.d,th], (4)

[[tau].sub.1](m) = [[tau].sub.2] (m) = ... = [[tau].sub.N](m) = [tau](m), (5)

n [member of] {1, 2, ..., N} m [member of] {1, 2, ..., M}. (6)

Constraint (4) is a sensing time constraint, in order to protect PUs. Constraint (5) guarantees the WSS is used for spectrum sensing. It is not easy to solve this problem directly. We present a selection making algorithm based on POMDP theory, to find the optimal and suboptimal solutions.

4. Selection Making Algorithm

In the proposed strategy, because not all of the channels are selected for sensing and the presence of sensing errors, the SU cannot obtain the accurate states of each channel. At the beginning of each slot, decisions are made based on the previous actions and observations. This setting matches well with the POMDP framework [28-35]. Therefore, an optimization selection making problem under the proposed strategy is formulated as a POMDP, which determines an optimal policy for sensing channels' selection, the size of sensing time, and the access decisions. Next, we describe the POMDP framework. An optimal and a myopic policy are proposed to solve the POMDP problem.

4.1. POMDP Framework

4.1.1. Actions. At the beginning of slot m, the actions of SU have three stages: determine which channels to sense, the size of sensing time, and the transmission power. Let A(m) denote the SU action in slot m

A (m) = [[A.sub.1](m), [tau](m), [A.sub.2](m)], (7)

where [A.sub.1](m) denotes which channels are selected for spectrum sensing, [A.sub.1](m) = [[a.sub.11](m), [a.sub.12](m), ... , [a.sub.1N](m)], [a.sub.1n](m) [member of] {0, 1}. When channel n is selected for sensing, [a.sub.1n](m) = 1. Otherwise, [a.sub.1n](m) = 0. [tau](m) denotes the sensing time duration. [tau](m) = [[[tau].sub.1](m), [[tau].sub.2](m), ..., [[tau].sub.N](m)], and [[tau].sub.1](m) = [[tau].sub.2](m) = ... = [[tau].sub.N](m). The WSS is used to detect the presence of PU channels. The sensing time durations of the selected channels are the same. If none of the channels are selected, the sensing time durations are zero. [A.sub.2](m) denotes which channels are selected to access with power [P.sub.1n](m) or [P.sub.2n](m). [A.sub.2](m) = [[a.sub.21](m), [a.sub.22](m), ..., [a.sub.2N](m)], [a.sub.2n](m) [member of] {0, 1}, [a.sub.2n](m) = 1 indicates the channel n is accessed with [P.sub.1n](m) and [a.sub.2n](m) = 0 indicates it is accessed with power [P.sub.2n](m). If the channel n has not been selected for sensing, the transmission power is [P.sub.2n](m), [a.sub.2n](m) = 0.

4.1.2. Observations. Let [theta](m) denote the channel observation vector in slot m. [theta](m) = [[[theta].sub.1](m), [[theta].sub.2](m), ..., [[theta].sub.N](m)]. In the proposed scheme, the observation of channel n in slot m has four possible values. [[theta].sub.n](m) [member of] {0, 1, 2, 3}.

(i) If channel n has been selected for sensing and the sensing result is idle, the SU accesses it with power [P.sub.1n](m). After transmission, the SU transmitter receives ACK1, which indicates the sensing result is correct. We denote this as observation 0, [[theta].sub.n](m) = 0.

(ii) However, if the SU transmitter receives NAK, it indicates the sensing result is wrong; the transmission of SU causes unacceptable interference to PU. We denote this as observation 1, [[theta].sub.n](m) = 1.

(iii) If the sensing result of channel n is occupied, the SU accesses it with power [P.sub.2n](m). The SU transmitter receives ACK2 after transmission. We denote this as observation 2, [[theta].sub.n](m) = 2.

(iv) If the SU accesses channel n directly with power [P.sub.2n](m) and receives ACK2 after transmission. We denote this as observation 3, [[theta].sub.n](m) = 3.

The difference between observations 2 and 3 is that the SU has not sensed the channel in observation 3. Although the announcing signals are the same, the transmitter of SU can distinguish different observations.

4.1.3. Belief Vector. In the POMDP formulation, belief vector is used to infer the channel states at the beginning of each slot. It is a conditional probability for the past history, including the past decisions and observations. At the end of each slot, the belief vector is updated based on different actions and the corresponding observations, in order to obtain accurate information of the dynamic environment. The belief vector of channel n in slot m is denoted as [b.sub.n](m), [b.sub.n](m) = [[b.sub.n,0](m), [b.sub.n,1](m)], [b.sub.n,0](m) + [b.sub.n,1](m) = 1. [b.sub.n,0](m) is a conditional idle probability. Consider the following:

[b.sub.n,0](m + 1) = Pr([s.sub.n] (m + 1) = 0 | [b.sub.n,0](m), [A.sub.n](m), [[theta].sub.n](m)), (8)

where [A.sub.n](m) = [[A.sub.1n](m), [[tau].sub.n](m), [A.sub.2n](m)] is the actions of SU in channel n. Let [b.sub.n](0) denote the initial belief vector; it is equal to the stationary probability vector. [b.sub.n,0](0) = [[alpha].sub.n]/([[alpha].sub.n] + [[beta].sub.n]), [b.sub.n,1](0) = [[beta].sub.n]/([[alpha].sub.n] + [[beta].sub.n]). At the end of slot m, the Su transmitter receives four different observations; the belief vectors [b.sub.n,0](m) and [b.sub.n,1](m) are updated in different cases. [[theta].sub.n](m) = 0 means the real state of the channel n in slot m is idle. The belief vector in slot m+1 is [b.sub.n,0](m + 1) = [P.sub.00]. [[theta].sub.n](m) = 1 means the real state is occupied. [b.sub.n,0](m + 1) = [P.sub.10]. When [[theta].sub.n](m) = 2, the SU senses channel n and the sensing result is occupied, but the SU cannot obtain the real state of that channel. From the Bayes rule, the belief vector updated formula is given as the following equation:

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. (9)

Finally, when [[theta].sub.n](m) = 3, the belief vector is updated as

[b.sub.n,0](m + 1) = [b.sub.n,0](m) [P.sub.00] + [b.sub.n,1](m) [P.sub.10]. (10)

4.1.4. Reward Function. Denote [R.sub.n](m) as the immediate reward of channel n in slot m; the immediate reward of the SU system is R(m) = [[summation].sup.N.sub.n=i] [R.sub.n](m). It is associated with actions and observations. Denote [[??].sub.n](m) as the expected reward of channel n. When the channel n has not been selected for spectrum sensing, we have

[[??].sub.n](m) = [R.sub.n](m) = [r'.sub.n](m). (11)

Otherwise, the expected reward of the SU system [??](m) is calculated as

[??](m) = [N.summation over (n=1)] [[??].sub.n](m) = [N.summation over (n=1)] [2.summation over (z=0)] Pr ([[theta].sub.n](m) = z | [b.sub.n](m - 1), [A.sub.n](m), [s.sub.n](m)) [R.sub.n](m), (12)

where z [member of] {0,1,2} denotes the observation value space. Pr([[theta].sub.n](m) = z | [b.sub.n](m - 1), [A.sub.n](m), [s.sub.n](m)) is the probability of observation; it is associated with the belief vector in last slot, the actions and the real state of the current slot. The expression of [??](m) is shown as follows:

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. (13)

Based on the above discussion and analysis, the procedure in the POMDP framework is shown as Figure 4.

4.2. Solution to POMDP. In the proposed scheme, the design objective is to develop an optimal selective sensing and access policy in each slot, in order to maximize the expected total reward obtained in the finite M slots. The complete problem formulation based on POMDP is given by

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (14)

subject to

[P.sub.d,n]([[tau].sub.n] (m)) [greater than or equal to] [P.sub.d,th] n [member of] {1, 2, ..., N} m [member of] {1,2, ...,M}. (15)

It is a constraint POMDP problem, which requires an intractable randomized policy to achieve optimality. However, the objective function can be separated from the constraint, if the SU trusts the current sensing result and accesses the channel based on the sensing result in the current slot [28, 30]. The sensing time is obtained from [P.sub.d,n]([[tau].sub.n](m)) = [P.sub.dth], n [member of] {1, 2, ..., N}. The problem becomes an unconstrained POMDP problem.

After obtaining the sensing time, the problem reduces to a simple one. Two questions are considered: which channels are selected for spectrum sensing and which transmission powers are selected for transmission. The narrowband channels are independent with each other. The actions of SU system can be divided into the combination of each channel's actions. Then, we can calculate the optimal actions of each channel independently.

4.2.1. Optimal Policy. In order to calculate the optimal policy of channel n effectively, a value function [J.sub.m]([b.sub.n](m - 1)) is proposed, which denotes the maximum expected remaining reward accumulated from slot m to the frame horizon slot M, when the current belief vector is [b.sub.n](m - 1). Using Bellman equation, we have

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], (16)

where

[b.sub.n](m) = [GAMMA]([b.sub.n] (m - 1), [A.sub.n](m), [[theta].sub.n](m) = z). (17)

It represents the updated knowledge of channel state based on the actions and observations of SU in slot m. [[??].sub.n](m) is given by (13). The value function contains two parts: the immediate expected reward obtained in slot m and the maximum expected future reward. The optimal policy is obtained via a fast point-based solution method [35].

4.2.2. Myopic Policy. The solution of optimal policy leads to great computational complexity, especially when the number ofchannels is large. In order to address this problem, a myopic policy is proposed, in which the SU maximizes the immediate expected reward in the current slot m. The myopic policy solution is given by

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. (18)

Generally, the myopic policy balances the computational complexity and the optimality of solution. Dynamic programming can be used to find the solution.

4.2.3. Synchronism among Channels. In the proposed solutions, we calculate an optimal policy and a myopic policy of each channel independently, instead of calculating all of the channels at the same time. The access point is [P.sub.d,n]([[tau].sub.n](m)) = [P.sub.d,th], and the sensing time [[tau].sub.n](m) is obtained. They maybe different from each other. However, in this paper, the SU senses the selected channels with WSS; the sensing time durations of the selected channels should be the same. Therefore, a question exists: how can one ensure the synchronism among the selected channels?

Denote [P.sup.n.sub.d,th] as the detection probability threshold of the selected channel n. In order to maximize the current reward of the slot and ensure the synchronism of the selected channels, we adjust the [P.sup.n.sub.d,th] according to the difference between the selected channels. The sensing time of the selected channel n is obtained from

[P.sub.d,n]([[tau].sub.n](m)) = [P.sup.n.sub.d,th]. (19)

Then the [P.sup.n.sub.d,th] is formulated as a function of sensing time, [P.sup.n.sub.d,th] ([tau](m)). The optimization problem is formulated as

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (20)

subject to

[P.sup.n.sub.d,th] ([tau](m))[greater than or equal to] [P.sub.d,th], (21)

[[tau].sub.n](m) = [tau](m), n [member of] [OMEGA], m [member of] {1, 2, ..., M}. (22)

The detection probability thresholds of each channel are adjusted, based on the sensing time duration. The [P.sup.n.sub.d,th] ([tau](m)) should be set as larger than the predefined threshold [P.sub.d,th], in order to ensure the protection of PUs. Constraint (21) is converted to the interval of sensing time [tau](m), based on the convex character of the function [P.sup.n.sub.d,th] ([tau](m)). One-dimensional exhaustion search method is used to solve this problem. The sensing time in slot m is optimized to balance maximizing the immediate current expected reward and keeping synchronism among the selected channels.

5. Numerical Results

In this section, the proposed optimal selective sensing and access policy will be compared with myopic policy and random policy under different simulation conditions. In random policy, the SU selects channels for spectrum sensing, and the access actions are also selected randomly. The sensing time durations of the three policies are the same, which are obtained from the proposed optimization algorithm (20). The slot size of PUs is fixed, the same as the sensing period of SU system, T = 10 ms. The channel power gains of channels are ergodic stationary. For the sake of simplicity, we assume the adaptive modulations are not used [33, 34]. When the transmission power is [P.sub.1n], the transmission rate is [R.sub.00n](m) = 0.06 Mbps. When the power is [P.sub.2n], the transmission rates are [R.sub.01n](m) = [R.sub.11n](m) = 0.02 Mbps. The bandwidth of each narrowband channel is B = 1 MHz and the sampling frequency is [f.sub.s] = 2 MHz. The decision threshold is [[epsilon].sub.n] = 1.5, the predefined detection probability is [P.sub.d,th] = 0.9, and the number of slot is M = 30.

Figure 5 shows the performance of SU's aggregate throughput under different total channel number N. We consider two cases: N = 6 and N = 2; N denotes the number of subchannels. The idle probability [P.sub.n]([H.sub.0]) and the signalto-noise ratio (SNR) [[gamma].sub.n] are shown in Table 1. It is found that the performance of SU in the optimal policy is better than others. The reason is that the SNR [[gamma].sub.n] affects the sensing time of that channel. When the channel number is large, the channel with smaller SNR affects the whole sensing time duration efficiently. In optimal policy, the SU obtains greater throughput gains when it selects some channels for spectrum sensing, not all of the channels. Comparing with the myopic policy, the selection is more accurate. The SU in random policy may select the channels with lower SNR, which affects the sensing time and the throughput of the SU system. When the channel number is large, the SU in the optimal policy obtains greater throughput than others.

In Figure 6, we study the performance of SU's aggregate throughput under different idle probability of two adjust PU channels. The idle probability of channel n is [P.sub.n](H0) = 0.3 + (n - 1)(0.1 + 0.05 * ([lambda] - 1)). When [lambda] = 1, 2, 3, the idle probabilities of the channels are 0.3, 0.4, 0.5, 0.6, 0.3,0.45, 0.6, 0.75, and 0.3, 0.5, 0.7, 0.9, respectively. It is found that the SU in the optimal policy obtains greater throughput than others under different cases. This is because the idle probabilities of channels affect the updating of belief vector. Under the optimal policy, the SU selects the suitable sensing channels and access actions.

Figure 7 illustrates the performance of aggregate throughput under optimal sensing time and fixed sensing time. Both are under the myopic policy. In the fixed sensing time case, the SU senses the channels in each slot with [[tau].sub.n](m) = 2 ms, n [member of] {1, 2, 3, 4}, m [member of] {1, 2, ..., 30}. It is found that the SU with the optimal sensing time obtains greater throughput. The reason is that the sensing time in each slot is optimized; it balances the tradeoff between maximizing the throughput and sensing efficiency.

Figure 8 illustrates the performance of aggregate throughput under different access strategies. All of them are under the optimal policy. The number of channels is four, and the probability of channel n is [P.sub.n]([H.sub.0]) = 0.3 + 0.2 * (n- 1); the SNR is [r.sub.n] = 2.1 + 0.2(n - 1), n [member of] {1, 2, 3, 4}. It is found that the SU with the mixed access strategy can obtain larger throughput than others. In the underlay access strategy, the SU accesses the channels directly without sensing; the sensing time is saved. However, the transmission power is fixed as P2n. In the overlay access strategy, the SU accesses the channels just when the sensing result is idle. The SU in the mixed access strategy can access the channels, no matter the sensing result is idle or occupied. Thus, the SU can obtain greater throughput.

6. Conclusion

In this paper, we propose a selective spectrum sensing and access strategy in cognitive radio sensor networks. In order to maximize the aggregate throughput of SU system and reduce the spectrum sensing energy consumption, the SU selects some channels for spectrum sensing, accesses these channels based on the sensing results, and accesses the other channels directly. According to the dynamic spectrum environment, a selection making algorithm based on PODMP theory is proposed. An optimal policy and a myopic policy are proposed to solve the POMDP problem. Theoretical analysis and numerical results show that the proposed selection making algorithm can better balance maximizing the throughput of SU system and avoiding unacceptable interference to PUs.

http://dx.doi.org/10.1155/2014/749739

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

The work in this paper is partly supported by programs of Natural Science Foundation of China under Grant nos. 60903170, U0835003, and U1035001.

References

[1] R. Yu, Y. Zhang, S. Gjessing, C. Yuen, S. Xie, and M. Guizani, "Cognitive radio based hierarchical communications infrastructure for smart grid," IEEE Network, vol. 25, no. 5, pp. 6-14, 2011.

[2] R. Yu, C. Zhang, X. Zhang, L. Zhou, and K. Yang, "Hybrid spectrum access in cognitive radio based smart grid communications networks," IEEE Systems Journal, vol. 8, no. 2, pp. 577587, 2014.

[3] C. Yang, Y. Fu, Y. Zhang, S. Xie, and R. Yu, "Energy-efficient hybrid spectrum access scheme in cognitive vehicular Ad hoc networks," IEEE Communications Letters, vol. 17, no. 2, pp. 329-332, 2013.

[4] Z. Liang, S. Feng, D. Zhao, and X. Shen, "Delay performance analysis for supporting real-time traffic in a cognitive radio sensor network," IEEE Transactions on Wireless Communications, vol. 10, no. 1, pp. 325-335, 2011.

[5] S. Haykin, "Cognitive radio: brain-empowered wireless communications," IEEE Journal on Selected Areas in Communications, vol. 23, no. 2, pp. 201-220, 2005.

[6] O. B. Akan, O. B. Karli, and O. Ergul, "Cognitive radio sensor networks," IEEE Network, vol. 23, no. 4, pp. 34-40, 2009.

[7] A. S. Zahmati, X. Fernando, and A. Grami, "Application-specific spectrum sensing method for cognitive sensor networks," IET Wireless Sensor Systems, vol. 3, no. 3, pp. 193-204, 2013.

[8] L.-C. Tseng, F.-T. Chien, A. Marzouki, R. Y. Chang, W.-H. Chung, and C. Huang, "Self-organized cognitve sensor networks: distributed channel assignment for pervasive sensing," International Journal of Distributed Sensor Networks, vol. 2014, Article ID 183090, 10 pages, 2014.

[9] E. Romero, J. Blesa, A. Araujo, and O. Nieto-Taladriz, "A game theory based strategy for refucing energy consumption in cognitive WSN," International Journal of Distributed Sensor Networks, vol. 2014, Article ID 965495, 9 pages, 2014.

[10] Y.-C. Liang, Y. Zeng, E. Peh, and A. T. Hoang, "Sensing-throughput tradeoff for cognitive radio networks," IEEE Transactions on Wireless Communications, vol. 7, no. 4, pp. 1326-1337, 2008.

[11] R. Yu, Y. Zhang, L. Yi, S. Xie, L. Song, and M. Guizani, "Secondary users cooperation in cognitive radio networks: balancing sensing accuracy and efficiency," IEEE Wireless Communications, vol. 19, no. 2, pp. 30-37, 2012.

[12] W.-Y. Lee and I. F. Akyildiz, "Optimal spectrum sensing framework for cognitive radio networks," IEEE Transactions on Wireless Communications, vol. 7, no. 10, pp. 3845-3857, 2008.

[13] G. Bansal, Md. J. Hossain, and V K. Bhargava, "Optimal and suboptimal power allocation schemes for OFDM-based cognitive radio systems," IEEE Transactions on Wireless Communications, vol. 7, no. 11, pp. 4710-4718, 2008.

[14] X. Kang, Y. C. Liang, H. K. Garg, and R. Zhang, "Optimal power allocation for fading channels in cognitive radio networks: ergodic capacity and outage capacity," IEEE Transactions on Wireless Communications, vol. 8, no. 2, pp. 940-950, 2009.

[15] H. Chen, C. K. Tse, and F. Zhao, "Optimal quantisation bit budget for a spectrum sensing scheme in bandwidth-constrained cognitive sensor networks," IET Wireless Sensor Systems, vol. 1, no. 3, pp. 144-150, 2011.

[16] R. Yu, Y. Zhang, M. Huang, and S. Xie, "Cross-layer optimized call admission control in cognitive radio networks," Mobile Networks and Applications, vol. 15, no. 5, pp. 610-626, 2010.

[17] Y. Liu, S. Xie, Y. Zhang, R. Yu, and V. C. M. Leung, "Energy efficient spectrum discovery for cognitive radio green networks," Mobile Networks and Applications, vol. 17, no. 1, pp. 6474, 2012.

[18] M. Wellens, J. Riihijarvi, and P. Mahonen, "Empirical time and frequency domain models of spectrum use," Physical Communication, vol. 2, no. 1, pp. 10-32, 2009.

[19] S. Xie, Y. Liu, Y. Zhang, and R. Yu, "A parallel cooperative spectrum sensing in cognitive radio networks," IEEE Transactions on Vehicular Technology, vol. 59, no. 8, pp. 4079-4092, 2010.

[20] Y. Liu, S. L. Xie, R. Yu, and Y. Zhang, "An efficient MAC protocol with selective grouping and cooperative sensing in cognitive radio networks," IEEE Transactions on Vehicular Technology, vol. 62, no. 8, pp. 3928-3941, 2013.

[21] Z. Quan, S. Cui, A. H. Sayed, and H. V. Poor, "Optimal multiband joint detection for spectrum sensing in cognitive radio networks," IEEE Transactions on Signal Processing, vol. 57, no. 3, pp. 1128-1140, 2009.

[22] Y. Pei, Y. Liang, K. C. Teh, and K. H. Li, "How much time is needed for wideband spectrum sensing?" IEEE Transactions on Wireless Communications, vol. 8, no. 11, pp. 5466-5471, 2009.

[23] P. Paysarvi-Hoseini and N. C. Beaulieu, "Optimal wideband spectrum sensing framework for cognitive radio systems," IEEE Transactions on Signal Processing, vol. 59, no. 3, pp. 1170-1182, 2011.

[24] P. Paysarvi-Hoseini and N. C. Beaulieu, "On the benefits of multichannel/wideband spectrum sensing with non-uniform channel sensing durations for cognitive radio networks," IEEE Transactions on Communications, vol. 60, no. 9, pp. 2434-2443, 2012.

[25] H. Sun, A. Nallanathan, C. Wang, and Y. Chen, "Wideband spectrum sensing for cognitive radio networks: a survey," IEEE Wireless Communications, vol. 20, no. 2, pp. 74-81, 2013.

[26] X. Kang, Y.-C. Liang, H. K. Garg, and L. Zhang, "Sensing-based spectrum sharing in cognitive radio networks," IEEE Transactions on Vehicular Technology, vol. 58, no. 8, pp. 4649-4654, 2009.

[27] M. G. Khoshkholgh, K. Navaie, and H. Yanikomeroglu, "Access strategies for spectrum sharing in fading environment: overlay, underlay, and mixed," IEEE Transactions on Mobile Computing, vol. 9, no. 12, pp. 1780-1793, 2010.

[28] Q. Zhao, L. Tong, A. Swami, and Y. Chen, "Decentralized cognitive MAC for opportunistic spectrum access in ad hoc networks: a POMDP framework," IEEE Journal on Selected Areas in Communications, vol. 25, no. 3, pp. 589-599, 2007

[29] A. T. Hoang, Y.-C. Liang, D. T. C. Wong, Y. Zeng, and R. Zhang, "Opportunistic spectrum access for energy-constrained cognitive radios," IEEE Transactions on Wireless Communications, vol. 8, no. 3, pp. 1206-1211, 2009.

[30] Y. Chen, Q. Zhao, and A. Swami, "Joint design and separation principle for opportunistic spectrum access in the presence of sensing errors," IEEE Transactions on Information Theory, vol. 54, no. 5, pp. 2053-2071, 2008.

[31] W. S. Jeon and D. G. Jeong, "Adaptive sensing scheduling for cognitive radio systems," Computer Networks, vol. 56, no. 14, pp. 3318-3332, 2012.

[32] S. M. Gong, P. Wang, W. Liu, and W. Yuan, "Maximize secondary user throughput via optimal sensing in multi-channel cognitive radio networks," in Proceedings of the 53rd IEEE Global Communications Conference (GLOBECOM '10), pp. 1-5, Miami, Fla, USA, December 2010.

[33] H. Jiang, L. Lai, R. Fan, and H. V. Poor, "Optimal selection of channel sensing order in cognitive radio," IEEE Transactions on Wireless Communications, vol. 8, no. 1, pp. 297-307, 2009.

[34] R. Fan and H. Jiang, "Channel sensing-order settingin cognitive radio networks: a two-user case," IEEE Transactions on Vehicular Technology, vol. 58, no. 9, pp. 4997-5008, 2009.

[35] T. Smith and R. Simmons, "Point-based POMDP algorithms: improved analysis and implementation," http://uai.sis.pitt.edu/ papers/05/p542-smith.pdf.

Weijun Li, (1) Chao Yang, (1) Zhenquan Wu, (1) Beihai Tan, (1) and Yuli Fu (2)

(1) School of Automation, Guangdong University of Technology, Guangzhou 510006, China

(2) School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China

Correspondence should be addressed to Chao Yang; chyang513@gmail.com

Received 20 April 2014; Accepted 8 July 2014; Published 17 July 2014

Academic Editor: Rong Yu

TABLE 1 Denotation N = 6 N = 2 Definition [P.sub.n] 0.3 + 0.1(n-1) 0.3 + 0.4(n-1) Idle probability ([H.sub.0]) [[gamma].sub.n] 2.1 + 0.1(n-1) 2.1 + 0.2(n-1) SNR (dB)

Printer friendly Cite/link Email Feedback | |

Title Annotation: | Research Article |
---|---|

Author: | Li, Weijun; Yang, Chao; Wu, Zhenquan; Tan, Beihai; Fu, Yuli |

Publication: | International Journal of Distributed Sensor Networks |

Article Type: | Report |

Date: | Jan 1, 2014 |

Words: | 6917 |

Previous Article: | A function private attribute-based encryption. |

Next Article: | Design approach based on EtherCAT protocol for a networked motion control system. |

Topics: |