Abstract
We address the problem of gathering information in sensor webs consisting of sensors nodes, wherein a round of communication sensor nodes have messages to be sent to a distant central node (called the base station) over the shortest path. There is a wide range of data gathering applications like target and hazard detection, environmental monitoring, battlefield surveillance, etc. Consequently, efficient data collection solutions are needed to improve the performance of the network. In this article, we take into account the fact that interference can occur at the reception of a message at the receiver sensor. In order to save redundant retransmissions and energy, we assume a known distribution of sources (each node wants to transmit at most one packet) and one common destination. We provide a number of scheduling algorithms jointly minimizing both the completion time and the average packet delivery time. We define our network model using directional antennas and consider Ring, Tree, and Grid Network (and its generality) topologies. All our algorithms run in low-polynomial time.
Introduction
Recent advances in commercial IC (Integrated Circuits) fabrication technology have made it possible to integrate signal processing and sensing in one integrated circuit. These devices are popularly known as wireless integrated network sensors (WINS) and include micro-electromechanical systems (MEMS) technology components such as sensors, actuators, RF component, and CMOSS building blocks. WINS combines microsensor technology and low-power computing and wireless networking in a compact system. Sensor nodes are dispersed over the area of interest and are capable of RF (radio frequency) communication, and contain signal processing (DSPs) engines to manage the communication protocol and data processing before transmission. The individual nodes have a limited capability, but are capable of achieving a large task through coordinated effort in a network that typically consists of hundreds to thousands of nodes.
Networks of such devices can autonomously perform various sensing tasks such as environmental (seismic, meteorological) monitoring and military surveillance, enemy tracking, target detection, distribution of timing and position information, and multi-hop communication [1]. The sensor could be sensing the temperature, the pressure, oil leak, radiation, etc. Generally, these networks are referred to as wireless ad-hoc sensor networks or simply sensor networks. It can also be a collection of mobile sensor nodes that dynamically form a temporary network without the use of any existing network infrastructure or centralized administration. In other words, the primary application of such networks has been in disaster relief operations, military use, conferencing, and environment sensing.
A typical application in web is gathering of sensed data at a distant central processing system named the base station (BS) (or the root node in the network graph). This root node is assumed to be with greater computational, storage, and transmission capabilities than the rest of the nodes in the network. The root node typically serves as an entry point to the sensor network, integrating the sensor network with wired network. In each round of this data gathering application, all the data from all nodes need to be collected and transmitted to the BS, where the end-user can access the data. In some sensor network applications, data collection may be needed only from a region and, therefore, a subset of nodes will be used. A simple approach to accomplishing this data gathering task is for each node to transmit its data directly to the BS. Since the BS is typically located far away, the energy cost to transmit to the BS from any node is quite high. Therefore, an improved approach is to use few and multi-hop transmissions as possible to the BS. In contrast, in many emerging and envisioned applications, sensor networks will be both distributed and wireless (in terms of communication and power) [2]. Distribution is necessary for improving the sensing quality—when the precise location of a signal is unknown, then distributed sensors will allow sensing to take the place closer to the event of interest than by any signal sensor. Distribution also improves robustness to environmental obstacles, which is especially crucial in situations where sensing requires line-of-sight. The sensor units, thus, need to rely on finite, local energy sources and wireless communications channels. Finally, shorter range communication is generally much cheaper than longer range communication because the radio-signal power can drop off with a quadratic power of distance [3]. As a result, it is much cheaper to transmit information using multi-hopping among sensor units.
One way to reduce the amount of data that must be transmitted (and reduce energy) in radio networks is scheduling forwarded information gathered by sensor nodes. The scheduling process is intended to prevent collisions that might arise from improper or inefficient use of the network resources by random messaging across the network without taking into account the network model. Then, we aim to solve the problem for a given certain topology of radio network and a network model, initial information (messages) located at some nodes, and a single designated destination. We consider the Ring and the Tree networks and give optimal scheduling solutions that achieve a minimum completion time as well as a minimum average delivery time. For the Grid network topology (and its extensions) we propose an approximation algorithm to our problem providing a 1.5 approximation ratio for maximum completion time. We present low-polynomial time solutions for our problem for the above-mentioned network topologies and we also provide some useful insights.
Our research can be practically implemented in those networks; for example, whenever a node has a packet to transmit, it sends a very short message (to save battery energy) called a “Schedule Request” to a central computer (BS) that serves as the only destination in the network. The requests can be sent over an upstream control channel (or multiple upstream control channels) using, for example, ALOHA or CDMA random-access schemes. The base station is assumed to have full information about the input and the network topology. It produces a schedule and periodically transmits the schedule requests (called a MAP message) to the nodes that requested to send. This is done over a separate downstream control channel. Note that the separation in channels is between the downstream and upstream control channels and the channel in which the data messages are transmitted.
In our previous work [4], we consider linear, two-branch, and star (or multi-branch) network topologies. For each topology we provided an optimal schedule for routing all the messages to the base station, jointly minimizing both the completion time and the average packet delivery time, while all our algorithms run in polynomial time. It should be noted that the presented (optimal) data gathering algorithms are centralized and require cooperation between nodes which is not necessarily compatible with the requirements of sensor networks. Therefore, for stronger requirements, these algorithms may no longer be practical. However, they continue to provide a lower bound on data gathering time of any given collection schedule.
We focused our analysis on systems equipped with directional antenna since from comparison results (with respect to completion time) between directional antenna systems to omni-directional antenna systems obtained by Florens and McEliece [44] it follows that the former outperforms the latter by 50% on Linear Network. The idea of using directional antenna in wireless communication is not new. It has been already extensively used in base station of cellular networks for frequency reuse, to reduce interference, and to increase the capacity of allowable users within a cell. However, the applications of directional antenna to wireless ad-hoc or sensor network to reduce the transmit power of each node to achieve power-efficiency in the routing problem is relatively new.
Our problem was partly addressed over the past few decades. A number of works (see [5–21]) discuss radio networks under a similar network model, but
with a different target function that leads to maximizing the number of transmissions in one hop
without referring to specific sources and destinations across the networks. This problem and its
variations are known to be NP-hard, and the suggested solutions are heuristic approximation
algorithms. Other works (see, e.g., [22–29]) dealt with our problem considering Grid and Tree
topology, but under other (weaker) network models. For example, authors in [22] used the same target function as we suggest, but the
discussion is based on several variations of “hot potato” routing. In this model each
node can successfully receive and transmit more than one message simultaneously. This is a
completely different model from the classical radio network model, which we chose to apply in our
analysis. Furthermore, papers that are concerned with “hot potato” routing offer upper
and lower bounds on performance in terms of order of magnitude, while in our work, we produce exact
results or tight bounds to our problems. Similarly, other papers (see [30–34])
assume the same target function to minimize the completion time and the model under which each node
can successfully receive and transmit more than one message simultaneously is explored in Ring,
Tree, Grid networks, and general graphs. Y. Choi et al. [35] present a protocol for routing data messages from any
sensor to the base station in a sensor network two-dimensional grid topology, by using and
maintaining a spanning tree with root serving as the base station completely ignoring the
interferences. Sridhran and Krishnamachari [36] presented some problem of converge-casting flows with rate control from nodes to the
root of the given routing tree of the network. Lau and Zhang [37] and Krumme et al. [38] also study the gossiping problem of communicating a unique
item from each node in a graph to every other node under two-dimensional grid network topology. They
have suggested that the gossiping problem can be studied under four different communication models,
which have different restrictions on the use of the links, as well as the ability of a node in
handling its incident links. The four models being considered are: the full-duplex, all-port model, the full-duplex, one-port model, the half-duplex, all-port model, and the half-duplex, one-port model, which can be identified by the labels F∗, F1, H∗,
and H1 respectively.
In their [37, 38] notations, we assume a network model denoted H1 or called “The half duplex one-port model,” since this model of communication makes the weakest assumptions about both hardware and software capabilities. Gronkvist [39] assumes a stochastic model for the general network topology problem and presents a number of results under this model. Some other papers (see [40–43]) transformed a network into an undirected graph G(V,E) with V as the set of nodes and E as the set of edges and modeled the transmission area and the interference area as balls in graph by introducing two parameters—d T , the transmission radius and d I the interference radius with d I ≥ d T . They deal with gathering information in specific radio networks: Line, Grid, with the same target function of minimum completion time, ignoring the requirement of minimizing average delivery time and using omni-antenna. They also show [43] that in the case of general network the problem is NP hard. Finally, Florens and McEliece [44] consider exactly our problem under a criterion of minimum completion time, ignoring the requirement of minimizing average delivery time. In fact, their scheduling strategy does not consider the idle time of the messages and produces unnecessary dependences among messages. This, consequently, causes unnecessary delays for messages. For example, it is unreasonable not to transmit a message toward the destination if it can be transmitted without any delay. They [44] also do not provide any time-complexity analysis of their algorithms.
This article is organized as follows. First, we explain in detail the network and channel model with a precise definition of our problem. Next, we address the Ring Network case. After that, we consider the Tree Network problem. The optimal scheduling strategy under both target functions for Ring network and Tree Network is explained in Sections 3.3 and 3.4, respectively. Finally, we consider a Grid Network and its extensions. We propose an approximation algorithm to our problem providing 1.5 approximation ratio for maximum completion time, where the approximation bound holds for any BS location in the Grid Network. We conclude the article with directions for further research.
General Network and Channel Model
A sensor or ad-hoc network is modeled as a directed graph G(V,E) with
N nodes (in the case of Grid network N × N
nodes), V is a set of nodes, each of them representing a communication device,
where each node v ∊ V is a sensor that can transmit and
receive data; E is a set of edges connecting nodes. There is an edge between node
v and w if and only if v can hear
w's transmissions when v points its directional
transmission antenna towards w. The network has a special node
v0, the Base Station (BS), which
serves as a destination for all messages. This node is assumed to be the root of the graph with
large computational, storage, and transmission capabilities. The root node typically serves an entry
point to the sensor network, integrating the sensor network with an external wired network. We assume that at time t = t0, each node
v ∊ V has at most one message to transmit to the
destination. This is referred to as a legal input. We assume that all the information about the input and topology of the network is available at
the BS and there are separate, collision free, control channels between the BS and the other
nodes. We also assume that every node in the network including the BS has the same transmission power
r and that a node cannot transmit and receive message simultaneously. In our model
we assume that the capacity of each node's buffer is one message. We also assume the use of directional antennas. The signal from node v to node
w propagates in a straight line in the direction of node w without
dispersing to other directions. We also assume that if a message arrives successfully to the
receiver, the receiver can send an acknowledgment to the sender using directional antenna on a
separate channel.
Based on the above, the conditions for a successful transmission are—∀v, w ∊ V a message from node v that is transmitted to node w, arrives successfully at node w if for all simultaneous transmissions from ∀u ∊ V, u≠v, w using directional antennas pointed in the direction of v the following relations hold: |v - w| ≤ r, |u - w| ≥ (1 + β)r, β > 0 (here v stands for the location of node v). We also assume in our model that time is slotted and the one-hop transmission consumes one time slot (TS). As we mentioned above the node can either transmit or receive in one time slot. This model of the channel is called in the literature S-TDMA channel model.
In summary, we model our network by a rooted graph, where the root represents the BS and an edge represents an existing wireless connection (a link) between two stations. A necessary condition for the connection existence between v and w is the fact that v and w are at a distance between them of at most r. We denote by d(v, w) the distance, measured in the number of hops, between node v and node w.
Problem Statement and Our Performance Measure
In this section we carefully define our problem. We are interested in solving the problem for various network topologies—Ring, Binary Tree, and Grid networks.
Problem Statement
Assuming some network topology with N nodes (or N × N nodes in the case of Grid network), M of which have messages to be send to BS (each node has to transmit at most one message), and assuming our network model with the fact that BS is receiving the requests for transmission from the nodes that have a message to send to the BS on separate, collision-free channels, the purpose is to find an algorithm that schedules and routes all the messages to the BS in a minimum time (primary criterion) and also minimizes the average message-delivery time (a secondary criterion, which is equivalent to minimization of the sum of the message idle times).
General Target Functions
We wish to find a scheduling and routing solution for every possible input set of messages to
destination. We denote by
T
end
min
the minimum time
for all messages to reach the destination, and by T
i
the time it takes for message m
i
to reach the
destination. The delay time or idle time Δ
i
of a
message m
i
is a total sum of delays that
m
i
incurs starting at
t
0
until arriving to the destination. Denote by
S the minimum sum of idle times for all messages. Thus,
First, we investigate a Ring Network topology, with each sensor playing a role of node, see Fig. 1.

The ring network.
The Ring consist of N nodes including the BS and each node denoted by
v
i
, i = 0, 1.N
− 1. The network has M,
M ≤ N − 1 messages to transmit to the BS. In this
topology, it is obvious that the transmissions towards the BS can go two ways, clockwise or
counterclockwise, i.e., node v
i
can transmit either to
node vi-1 or to node
vi+1. We also assume that the distance between
any two adjacent nodes is less than or equal to r. Following our problem definition
we would like to prove the existence of an optimal scheduling algorithm that can handle any type of
a legal input (at t
0
any sensor keeps at most one
message). The optimality of the algorithm is measured in terms of
T
end
min
and S. We develop
the proof in stages, by proving the existence of such an algorithm that schedules and routes all the
messages to the BS in a minimum completion time (primary criterion) and also minimizes the average
message-delivery time. We denote by
We compute, for each message m
i
the values
Thus, we obtain for each message m
i
the couples
Algorithm Ring: If
After the direction for each message is determined, we apply Two Branch Network Algorithm [4].
Theorem 1
The scheduling produced by the Ring Network Algorithm schedules and routes all the messages to the BS in a minimum completion time (primary criterion) and also minimizes the average message-delivery time.
Proof
According to our model, the capacity of each node's buffer is one message. It is obvious
that two different messages can not be sent to the BS in overlapping paths in different directions
since otherwise there must be a node with a buffer capacity 2. As a consequence of it we will find a
pivot node message on the Ring. The pivot node message is the message such that all
the messages that are located closer to the BS (including this message) will be transmitted to BS in
the same direction and the remaining messages, (if any) will be transmitted in the other direction.
In other words, in the general case there are at most two groups of the messages (as we will see
latter), one part will be transmitted in clockwise direction, while the other part will be
transmitted counterclockwise. Let us denote by
After we apply the Linear Network Algorithm independently to the same input in
both directions there are two cases: Case 1. For all the messages we have Case 2. For one message holds
In this case it is easy to see that applying Two Branch Network algorithm [4] gives
Theorem 2
Given a ring network, minimizing the sum of idle times for all messages does not lead to minimizing the total completion time.
Proof
Let us look at more specific example (case 1, Theorem 1, for all
The running time of this algorithm is dominated by the running time of the Two Branch Network algorithm which is O(N2).
The base station (BS) plays a role of the root of a binary tree graph and each sensor play a role
of nodes. In the Binary Tree Network (BTN) G(V,E,L), the root is connected to the
(possibly empty) left and also the right sub-trees that are BTNs. Every such connection is a
Line Network (see [4]),
where we call the endpoints of a connection main nodes. In other words, main nodes
in BTN are connected by line networks with sensors that serve as edges, see Fig. 2. The number of the line networks in the graph denoted by
L and we numerate them from the left to the right as depicted in Fig. 2. We denote by

The BTN network.
In this topology, we define a legal input x ∊ X as a
collection of sensor nodes (exclude the main nodes and the BS) that have at most one message to
transmit to the base station at t = t0. The goal
is to transmit M messages to the BS,
Let us denote by
Again, in order to do that, we develop the proof in stages.
Algorithm BTN. Every node in line networks behaves as a normal node in line network algorithm, i.e., if some node contains a message that can be sent to its (right) neighbor (with no message), then we send it. Every main node acts as following; if it has a message, then the message is transmitted to it's parent. If the main node does not have a message to send and only one node of two possible nodes from the children line network connections has a message to transmit, the main node receives it. Moreover, once the main node decides a direction from which it receives the messages it continues to receive them until there is no message from this direction. The main node does not serve the other line network connected to it from the children direction as long as the messages from the direction that has been started to be served by the main node continue to arrive. In the case when the main node has been idle and now there are two messages from two different directions to arrive at this node, it arbitrarily starts to serve one of the lines. The BS applies Two Branch Network algorithm [4].
Theorem 3
The BTN Algorithm optimizes both criteria.
Proof
Notice that the maximum transmission rate of any main node is 1/2. This is due to the fact that according to our model the main node cannot transmit and receive the message at the same time. The conclusion from the above observation is that in order to achieve maximum efficiency it is sufficient to supply the messages to all main nodes at rate 1/2.
We begin our explanation for one arbitrary sub-tree, say the left sub-tree. We would like to
“create” groups of messages of maximal length [4] that can be transmitted at the maximum rate (1/2). By using
the algorithm above, according to the Two Branch Network algorithm and the Linear Network algorithm
[4] at
t
0
, we have at every j-line a number,
say z, of independent groups of maximal length of first order. We denote them as
In addition, in the BTN, we denote by
In addition, it is clear that if any two groups of messages arrive to a main node simultaneously,
the time that it takes for the two groups to pass through the main node is independent on the group
that the main node starts with. Finally, we obtain two logical lines with
Remark 1
Notice that the main nodes may have messages at time t 0 . The algorithm remains the same.
Remark 2
The above scheme can be generalized to deal with k-ary trees. The only difference is that we construct independent groups of messages of maximal length for k subtrees and BS applies a strategy of k-Star Network algorithm.
The running time of the above scheme is dominated by the running time of the Two Branch Network algorithm, which is O(N2), with N standing for number of nodes.
In this section we will address the problem of a grid topology. The network has NxN nodes. In our model definition each node can do at most one operation at a given time slot, meaning that it can not transmit and receive a message at the same time; it can at most transmit one message or at most receive one message. There are two kinds of nodes as depicted in Fig. 3: sensor and relay nodes. The sensor nodes are located in the sensing zone. The sensor node has the same function as in the previously discussed networks; it senses the information which is transformed to a one message to be transmitted to the BS. Relay nodes, are located outside the sensing zone on the right and the down border of the Grid Network.

Grid network.
The relay node acts as a messages deliverer; it just receives the messages from the sensing zone
and transmits them to the BS (the relay node operates similarly to sensor node in sense that it
performs only one operation per one time slot). The coordinate of node
v
i, j,
0 ≤ i, j ≤
N − 1 is denoted by P
i,
j
(x
i
,
y
j
) and the distance between any two adjacent nodes is
the same and equals 1. For now, we will place the destination (BS) at node
P0,0(x0, y0)
(the relay nodes are coordinated at P0,
j(x0,
y
j
) and
Pi,0(x
i
,
y0), j = 1, .,
n,
i = 1,.,n). In our model a message can
advance to the destination only on the shortest path from the source to the destination. Therefore,
a message from point P
i,j
(x
i
, y
j
)
can move to the destination only in the right or the down direction as marked in Fig. 3. Therefore, if the length of the shortest
path from the source to the destination (BS) is d =
x
i
+
y
j
steps (hops) long, then in order to reach a
destination, we need to move x
i
steps in the
x direction and y
i
steps in the
y direction. Thus, the number
In what follows we propose an approximation algorithm to our problem providing 1.5 approximation
ration for maximum completion time. Before we introduce our heuristic and prove its performance and
bounds we will present some definitions, observations, and lemmas that will assist us in the
analysis of our algorithm. Let us define by Dmax
(Dmin) the diagonal that contains at least one message at the maximum
(minimum) distance from the BS and
dmax(dmin) be the corresponding distances,
respectively. Denote by
Observation 5
Given x ∊ X, so that any two adjacent nodes v i,j , v k,l satisfy |(x i + y j ) - (x k + y l )| ≥ 2, it is possible to send all the messages to BS with T end min = dmax and S = 0.
Observation 6
Given a Grid Network of size NxN with M messages to be sent at
t0 to the BS, it always happens that
It is very important to mention that for achieving this lower bound (as we will see later in the example of lemma 7), the input must enable a partition into two groups of messages with the messages belonging to one group having no affect on the movement of messages from other
We denote by S[a,b] a set of all messages with a distance from BS being between values a and b.
Lemma 7
Assuming a Grid network including M messages to be sent at
t0 to the BS, any scheduling algorithm fulfills the lower bound
Ω(T
end
) = M +
d
min
− 1, if it fulfils the necessary condition
that:
Proof
According to d
min
definition, it always happens that
Below, we show an example that the condition described in Lemma 7 is not sufficient. Suppose we have the following input—four messages coordinated at (3, 2), (4, 2), (5, 2), (6, 2) respectively in the Grid Network.
It is easy to see that this input fulfils the necessary condition of lemma 6, but according to the network model we will never succeed in achieving this bound, since in any scenario at least two close messages (at distance at most 1) will compete at the same time slot to be sent to BS. We call this situation a collision.
Let
The algorithm numerates diagonals
Our algorithm treats two possible types of diagonals from H. The first type of
diagonals includes diagonals that contain at least two messages. In this case, we treat this
diagonal
After we finish scheduling messages from
The second type of diagonals includes those from H with one message to send.
Suppose we are dealing with diagonal
After we finish scheduling messages from
Lemma 8
Grid algorithm explained above satisfies criteria (1) and (2) when he input fulfills condition of Observation 5.
Proof
Since the distance between any two adjacent messages is at least 2, the algorithm does not insert any additional delay. □
Lemma 9
The Grid Algorithm schedules the messages without collisions.
Proof
The algorithm schedules two types of diagonals. The first type concerns about the diagonals
containing more than one message. It is easy to see that the messages from any diagonal of such kind
are divided into two groups: those to be routed to the BS towards x-axis and those
to be routed to the BS towards y-axis. Thus, the collisions between the messages
from the two groups are impossible. (For a diagonal containing only one message we even do not have
groups.) In addition, no collision occurs at the BS station since the algorithm gives to the
messages successive arrival time (every group arrives at BS with 1/2 rate. The second type of
diagonals includes those containing exactly one message. The method based on values
It remains to be seen that no collisions are possible between the messages of different diagonals. This follows immediately due to the fact that we insert an additional delay time between transmissions at consecutive time slots from adjacent diagonals according to algorithm. □
Theorem 10
Given x ∊ X with n messages when
Proof
According to lemma 7,
Proposition 1
The second constraint condition is unbounded in the Algorithm Grid
Proof
We construct an example when we have to schedule the special input with all diagonals in the grid
containing one message to send. The messages are located in the following manner: each message from
an odd numbered diagonal D
i
is located at node
(i − 1, 1), 3 ≤ i ≤ 1 +
2(N − 1), and each message from an even numbered diagonal
D
j
is located at node (1, j −
1), 4 ≤ j ≤ 2 + 2(N − 1). It is clear
that the optimal schedule that is applied to this input leads to
The proof follows. □
The running time of the proposed algorithm is affected by efficient maintenance of maximal and
minimal coordinates in nodes with messages at each diagonal and computing values
Remark 3
The algorithm above can be generalized to work with graphs that represent connected (partial) paving of plane by grids of the same size, see for example Fig. 4. Moreover, the base station can be located anywhere in the graph. The basic idea is to divide (logically) the graph into 4 quarters and determine the quarter with a closest message to BS. Then we continue to send messages to BS until we have a time slot with no message. In that case, we look for the next closest message to BS that has not been sent yet. In order to avoid collision between the last sent message and the new one we send them in different directions (we can always do it).

Cube and its planar representation.
This article continues to investigate the problem that we began in [4]: how to minimize the time completion and the sum of delays needed for gathering information at the base station, from the sensor network nodes. We assume a half-duplex, one-port model equipped with directional antennas. Polynomial time algorithms for Ring Network, Tree Network, and Grid Network topologies have been designed and analyzed. We provide optimal solutions for Ring and Tree networks and for the Grid Network, we present a 1.5 approximate solution. A possible future research includes examination of more general topology models such as a general graph problem, as well as tight analysis of other target functions, e.g., including the cost of transmission depending on battery energy.
