Maintaining Optimal Communication Chains in Robotic Sensor Networks using Mobility Control

11
Mobile Netw Appl (2009) 14:281–291 DOI 10.1007/s11036-008-0102-0 Maintaining Optimal Communication Chains in Robotic Sensor Networks using Mobility Control Cory Dixon · Eric W. Frew Published online: 30 September 2008 © Springer Science + Business Media, LLC 2008 Abstract This paper presents a decentralized mobility control algorithm for the formation and maintenance of an optimal cascaded communication chain between a lead sensor-equipped robot and a control station, using a team of robotic vehicles acting as communication relays in an unknown and dynamic RF environment. The gradient-based controller presented uses measurements of the signal-to- noise ratio (SNR) field of neighbor communication links, as opposed to relative position between nodes, as input into a localized performance function. By using the SNR field as input into the control system, the controller is reactive to unexpected and unpredictable changes in the RF envi- ronment that is not possible with range-based controllers. Since the operating environment is not known a priori to deployment of a robotic sensor network, an adaptive model- free extremum seeking (ES) algorithm, that uses the motion of the relays to estimate the performance function gradient, is presented to control the motion of 2D nonholonomic vehicles acting as communication relays using the gradient- based controller. Even without specific knowledge of the SNR field, simulations show that the ES decentralized chaining controller using measurements of the SNR field, will drive a team of robotic vehicles to locations that achieve the global objective of maximizing capacity of a cascaded communication chain, even in the presence of an active jamming source. C. Dixon (B ) · E. W. Frew Aerospace Engineering Sciences Department, University of Colorado at Boulder, Boulder, CO, USA e-mail: [email protected] Keywords relays · robotics · communication chains · ad hoc networks · unmanned aircraft 1 Introduction This paper presents a decentralized mobility control algo- rithm for the formation and maintenance of an optimal cascaded communication chain between a lead sensor- equipped robot and a control station using a team of robotic vehicles acting as communication relays. The mobility con- troller presented in this paper is based on maximizing the capacity of a local 3-node network to maximize the end-to- end capacity of the entire communication chain, and there- fore provide an optimal high-bandwidth communication link between the lead node and the control station, by using measurements of the local signal-to-noise ratio (SNR) field. Because the mobility controller is based on the gradient of the local 3-node network capacity performance function, and not on geographic position or range, the controller can form and maintain an optimal, and connected, communica- tion chain in unknown dynamic RF environments that may also contain active jamming noise sources. Since it is not possible to predict the gradient of the SNR field in a physical environment, extremum seeking (ES) methods are applied to the decentralized gradient controller to provide a real- time estimate of the SNR gradient using motion of the relay. In wireless sensor networks consisting of mobile robotic nodes, a common mode of operation is to form a commu- nication relay chain between a lead (exploring) node and a control station [10, 17]. In this type of operation the lead node is typically deployed to explore a remote region of interest that is beyond direct RF communication range, or out of line-of-sight, to the control station. To provide non- line-of-sight service and extend the communication range

Transcript of Maintaining Optimal Communication Chains in Robotic Sensor Networks using Mobility Control

Mobile Netw Appl (2009) 14:281–291

DOI 10.1007/s11036-008-0102-0

Maintaining Optimal Communication Chains

in Robotic Sensor Networks using Mobility Control

Cory Dixon · Eric W. Frew

Published online: 30 September 2008

© Springer Science + Business Media, LLC 2008

Abstract This paper presents a decentralized mobility

control algorithm for the formation and maintenance of

an optimal cascaded communication chain between a lead

sensor-equipped robot and a control station, using a team

of robotic vehicles acting as communication relays in an

unknown and dynamic RF environment. The gradient-based

controller presented uses measurements of the signal-to-

noise ratio (SNR) field of neighbor communication links,

as opposed to relative position between nodes, as input into

a localized performance function. By using the SNR field

as input into the control system, the controller is reactive

to unexpected and unpredictable changes in the RF envi-

ronment that is not possible with range-based controllers.

Since the operating environment is not known a priori to

deployment of a robotic sensor network, an adaptive model-

free extremum seeking (ES) algorithm, that uses the motion

of the relays to estimate the performance function gradient,

is presented to control the motion of 2D nonholonomic

vehicles acting as communication relays using the gradient-

based controller. Even without specific knowledge of the

SNR field, simulations show that the ES decentralized

chaining controller using measurements of the SNR field,

will drive a team of robotic vehicles to locations that achieve

the global objective of maximizing capacity of a cascaded

communication chain, even in the presence of an active

jamming source.

C. Dixon (B) · E. W. Frew

Aerospace Engineering Sciences Department,

University of Colorado at Boulder, Boulder, CO, USA

e-mail: [email protected]

Keywords relays · robotics · communication chains ·ad hoc networks · unmanned aircraft

1 Introduction

This paper presents a decentralized mobility control algo-

rithm for the formation and maintenance of an optimal

cascaded communication chain between a lead sensor-

equipped robot and a control station using a team of robotic

vehicles acting as communication relays. The mobility con-

troller presented in this paper is based on maximizing the

capacity of a local 3-node network to maximize the end-to-

end capacity of the entire communication chain, and there-

fore provide an optimal high-bandwidth communication

link between the lead node and the control station, by using

measurements of the local signal-to-noise ratio (SNR) field.

Because the mobility controller is based on the gradient of

the local 3-node network capacity performance function,

and not on geographic position or range, the controller can

form and maintain an optimal, and connected, communica-

tion chain in unknown dynamic RF environments that may

also contain active jamming noise sources. Since it is not

possible to predict the gradient of the SNR field in a physical

environment, extremum seeking (ES) methods are applied

to the decentralized gradient controller to provide a real-

time estimate of the SNR gradient using motion of the relay.

In wireless sensor networks consisting of mobile robotic

nodes, a common mode of operation is to form a commu-

nication relay chain between a lead (exploring) node and a

control station [10, 17]. In this type of operation the lead

node is typically deployed to explore a remote region of

interest that is beyond direct RF communication range, or

out of line-of-sight, to the control station. To provide non-

line-of-sight service and extend the communication range

282 Mobile Netw Appl (2009) 14:281–291

of the lead node (and thus its operational range), additional

robotic nodes are deployed in a convoy fashion behind the

lead sensor-equipped vehicle to act as communication relays

in a cascaded communication chain, relaying high priority

data back to a control station. For this work, the relaying

nodes are tasked to act solely as communication relays

and their only objective is to find and maintain a maximal

capacity of the communication chain while the lead node

explores the environment.

Typically, the mobility control algorithms presented in

literature for these types of robotic team deployments as-

sume a “disc communication model,” where nodes can

communicate at some fixed data rate if they are within a

range of r of each other. Therefore, in these mobility control

algorithms the objective of an individual relay is to maintain

a minimum separation distance between the two network

nodes it is relaying data between. In essence the goal of

these controllers is to maintain network connectivity. Since

these controllers do not take into account the radio trans-

mission properties in the operational RF environment, they

are not reactive to changes in the environment. Furthermore,

these controllers do not lead to optimal communication

chains when localized noise sources are present in the

environment. In fact, depending upon the capacity model

used it can be shown for certain locations of a noise source

that a relaying node that positions itself based on a range

based controller can lead to worse performance than using

the direct communication. Therefore, changes in the RF

environment, such as the introduction of a jamming node,

will lead to a reduction of network quality-of-service that

the nodes cannot react to.

The work in this paper seeks to improve the communi-

cation capacity of a cascaded communication chain, over

that of a range-based controller, in a complex and time

varying RF environment containing localized noise sources

using a reactive and adaptive controller. Instead of using

range-based measurements as inputs for the decentralized

controller, the controller presented in this paper uses mea-

surements of the SNR with its two communication neigh-

bors. The measurement of the individual link SNRs, and

their gradients with respect to the position of the relay,

are used as inputs into a local performance function to

generate a gradient that the relay node follows to improve

the local 3-node capacity. By improving the local 3-node

chain capacity, it will be shown the relays maximize the

capacity of the entire chain and maintains connectivity of

the chain. For small low power sensor networks that have

limited transmit and computation powers, mobility may be

the only means to overcome significant changes in the RF

environment once they are deployed.

Due to the complexity of the RF environment in the phys-

ical world, and the difficulty in predicting RF propagation

properties with terrain, weather, and antenna affects, it is not

practical to assume that the nodes can predict the gradient of

the SNR field based solely on their geographic position. Just

like it is unpractical to assume a communication capacity

given only node separation distances. In addition, even if the

RF propagation environment has been previously measured,

the presence of a jamming node at an unknown location will

mean that an accurate prediction of the gradient is impossi-

ble. However, there are methods available for estimating the

positional gradient of a measurable field using the motion

of a vehicle within a sampled environment. Thus, after it

is shown that the decentralized gradient-based controller

will drive the team of relays to optimal locations when the

gradients are fully known, the problem will be reformed into

an ES framework [2] where the positional gradient of the

performance function is estimated from the motion of the

nodes.

Fundamental to the ES chaining algorithm is that to esti-

mate the gradient of the communication performance field,

cyclic motion of the vehicle is required. For generality in

application of this work to different vehicle types, a bicycle-

like kinematic vehicle model [12], exhibiting Dubins’ vehi-

cle constraints [8], is assumed and a Lyapunov Guidance

Vector Field (LGVF) controller [9] is used to provide the

cyclic motion of the vehicle by driving the vehicle to a

globally stable limit cycle (i.e. a circular orbit) about a

virtual center point. The ES controller uses the estimate of

the gradient from the cyclic motion of the vehicle to move

the virtual center point (or orbit point) to a location for

optimal relaying.

Finally, since this work is motivated by our Ad Hoc

Unmanned Aircraft (UA) and Ground Network (AUGNet)

[5] research into networks that combine ad hoc networks

with UA acting as airborne communication relays, this

paper presents simulations of the ES chaining controller

running on UA. In the simulations, there is a team of UA

acting as communication relays between a ground based

leader node and a control station. The ground based leader

node is representative of a mobile sensor, or a human

explorer such as a firefighter [22], that is allowed to freely

wander the environment but has relatively slow dynamics

compared to the aircraft. The performance of the aircraft and

the radios are modeled off of values taken from AUGNet

experiments.

1.1 Contributions

It is common in literature to use the mobility of the net-

worked nodes to maintain connectivity using range-based

metrics. This paper expands this idea further and shows that

the mobility of the nodes can be used to specifically control,

and thus improve, the quality-of-service of the network.

Beyond using mobility control to optimize a network QoS,

this paper shows that the mobility of networked vehicles can

Mobile Netw Appl (2009) 14:281–291 283

be used to overcome the effects of being actively jammed

from RF noise sources by measuring the SNR field.

The focus of this paper is on forming an optimal cascaded

communication chain using decentralized mobility control

of the relays, where optimal is defined as maximizing the

capacity of the chain from the leader node to the control

station, i.e. then end-to-end capacity of the communication

chain. Thus, this paper introduces a decentralized gradient-

based mobility controller that maximizes the local 3-node

network capacity, using only local SNR information, and

proves that the controller will result in achieving the global

objective.

Finally, since it is not practical to predict the gradient of

the SNR field in physical environments, or even possible

when localized noise sources are located in the environment

at unknown locations, this paper presents the gradient-

based chaining controller in an ES framework that has

been designed for use on planar nonholonomic vehicles. By

using the ES framework, the gradient of the performance

function used can be estimated on-line in real-time. Thus,

this paper presents a solution that can be deployed on robots

in physical environments and not just in simulation.

2 Background & related work

While there has been significant work in robotic team

control requiring network communications (e.g. [4, 16, 21]),

only a small body of work [3, 7, 11, 19] exists that ex-

plicitly incorporates communication objectives into larger

multi-objective control framework. Although the goal is to

optimize network parameters in these works, the perfor-

mance metrics are typically transformed into position based

constraints and cost functions using the disc communication

model.

For example, in [11] the authors make the claim that

the set of optimal positions of the relay nodes lies entirely

on the line between the source and destination nodes, and

that the relay nodes must be evenly spaced along this line.

However, as will be shown in this paper, in a physical

environment the assumptions required for position based

control are typically invalid since localized noise sources,

terrain affects, power differences, and antenna patterns will

cause the optimal location to move off of the center line

between the two end nodes and possibly away from the

geometric center point of the line. This work overcomes

this by defining the optimal communication chain in a more

generalized sense using the SNR instead of position. The

communication chain that is formed is more robust and is

reactive to changes in the RF environment.

ES algorithms have been presented in [23] to drive a

nonholonomic vehicle in a sampled environment, however

driving the vehicle directly using the ES framework limits

the application to limited vehicle types. Specifically the ES

algorithm in [23] can only be used on vehicles that can

go forward and backwards as the controller modulates the

forward velocity of the vehicle, while holding a constant

turn rate. The use of the LGVF controller in this work to

generate a circular motion, with forward vehicle velocity

and bounded turn rate capabilities, can be applied to a much

wider class of robotic vehicles; from simple point mass

models to UA. In addition, the circular motion of the vehicle

due to the LGVF is more natural for certain vehicles such

as UA.

The authors of [23] have proposed a second part to

their work (but is still unpublished) where the velocity of

the vehicle is held constant and the turning rate of the

vehicle is modulated by the ES algorithm. This resulting

motion of this controller is a forward motion of the vehicle

with a “wiggle”. The idea being that if one can mount a

sensor on a long boom to the vehicle, a small wiggle of the

vehicle will result in a large displacement of the sensor in

the sampled environment. However, on some vehicles it is

not practical to mount a long boom sticking out the nose

of the vehicle. In addition, there are some environments

where the displacement of the sensor on a boom will still

not provide a large enough displacement for the sensor to

measure the change in the sampled environment, either due

to the field structure or the sensor resolution. Thus, driving

the vehicle in a circular motion, about a virtual center point,

provides a more generic ES framework for mobility control

of vehicles.

3 Problem statement

Let a planar cascaded relay network of nodes ni, at posi-

tions pi ∈ R2, be designated N = {n1, n2, · · · , nN}, where

nodes n1 and nN are the source (S) and destination (D)

nodes, respectively. The source node is the leader node and

the destination node is the control station for the scenario

used in this work. Let R = {n2, n3, · · · , nN−1} ⊂ N be the

set of relay nodes in N , with R = N − 2 being the number

of relays in the network.

A cascaded network indicates that the topology of the

network links are taken to be fixed and ordered from source

to destination, e.g. a communication chain as shown in

Fig. 1. For this paper, the routing order of the nodes is

determined by a higher level algorithm that chooses when

to deploy a relay node to assist the leader. The focus of

this paper is given a robotic team of R relays and network

topology decided by a higher-level algorithm, form an opti-

mal communication chain. This paper does not specifically

address the question of when to deploy an additional relay

to improve the capacity of the communication chain, e.g.

we do not seek to answer the question of how many relays

284 Mobile Netw Appl (2009) 14:281–291

1

2

3

4

5

C21

C32

C43

C54

2 Mbps 1 Mbps

Figure 1 A planar cascaded network showing a directed communica-

tion chain with a fixed communication ordering of the nodes. For this

chain, the throughput from node 1 to node 6 is only 1 Mbps

provides the optimal end-to-end capacity given the current

RF environment.

Define the capacity of the network N to be C(N ). Then

the problem considered in this paper is to solve the global

optimization problem

p∗i = arg max

ni∈R,pi∈R2

C(N ) (1)

by moving the set of relay nodes, subject to vehicle

dynamics

pi = f(P, ui

)(2)

using a decentralized controller based only on local infor-

mation of the SNR field. The next section will present the

radio propagation and capacity models that are assumed for

the problem and simulations presented in this paper. These

are presented to help motivate the use of the SNR as input

into the control system.

3.1 RF propagation and capacity models

The SNR of an RF communication link is defined as

Sij(pi, pj

) = Pij(pi, pj

)

N(pi

) (3)

where Pij(pi, pj) is the power received by node i at position

pi ∈ R2

from the transmission of node j, located at pj ∈R

2. N(pi) is the environmental noise seen by node i at

location pi, and includes thermal and interference noise. For

simplification of notation let Sij = Sij(pi, pj).

The Shannon-Hartley Theorem states that the channel

capacity c, which is the theoretical maximum rate of clean

(or arbitrarily low bit error rate) data that can be sent with a

given average SNR is [20]

cij(pi, pj) = B log2

(1 + Sij(pi, pj)

), (4)

where B is the bandwidth of the channel, and cij(pi, pj) is

the channel capacity for node j at position pj transmitting

to node i at position pi. The Shannon-Hartley Theorem

provides a useful relation in that the maximum achievable

rate is related to the SNR of the channel. By increasing

the SNR of a wireless channel, the ability of the channel

to send more data is increased, providing a higher capacity

capability.

For this paper it is assumed that the RF environment

can contain localized noise sources such that N(pi) �= N(pj),

giving Sij �= Sji ⇒ cij �= c ji. That is to say, even though node

i can receive a transmission from node j, node j may not

be able to decode a transmission from node i. This is a

fundamental assumption for any geographic, range-based

controller, and for the definition of an optimal chain given

in [11].

3.2 Optimal communication chain

For a cascaded network communication chain, the achiev-

able chain capacity can be directly related to the individ-

ual link capacities along the chain. The relation of the

local link capacity to the full chain capacity is defined

by the type of network being considered. In this paper

a cascaded relay network [13] is considered where the

relays can transmit and receive at the same time. This is

possible if the relay has two antennas and frequencies, one

for receiving and one transmitting. While this is typically

not the case for cheap wireless network nodes that use a

single omnidirectional antenna, this network model is valid

for modelling repeater based networks used by emergency

personnel [22].

Because of the assumption of simultaneous transmission

and reception of signals by a relay, the throughput capacity

of a cascaded network chain is limited by the link with

smallest throughput capacity. Figure 1 provides a graphical

example of the problem where the link between nodes 4 and

5 is limited to 1 megabit per second (Mbps), either due to

distance or environmental noise, and the rest of the chain

has a 2 Mbps link capacity. It is clear from the figure that

even if node 1, the source node, tries to transmit at 2 Mbps

to node 5, the destination node, that the link between nodes

4 and 5 will limit the resulting throughput through the chain

to be 1 Mbps.

In a cascaded communication chain with simultaneous

transmission and reception the capacity of the communica-

tion chain is given as

C(N ) = mini∈[2,N]j=i−1

{cij(pi, pj)

}(5)

for the directed communication flow from the source to the

destination.

Optimal chain capacity is found by maximizing the min-

imum individual link capacity by moving the relays nodes

in the environment so that

C∗ = maxpi∈R2

mini∈[2,N]j=i−1

{cij(pi, pj)

}, (6)

where C∗is the globally optimal communication capacity

for the cascaded chain.

Mobile Netw Appl (2009) 14:281–291 285

Since link capacity is a monotonically increasing func-

tion of SNR,1

maximizing the minimum link capacity can

be achieved by maximizing the minimum link SNR along

the chain. Thus, the performance of the chain defined by

Eq. 6 can be restated in terms of the SNR instead of link

capacity.

S∗ = maxpi∈R2

mini∈[2,N]j=i−1

{Sij(pi, pj)

}(7)

It is desirable to use this objective function over that in

Eq. 6 because unlike link capacity, the SNR of a link is

directly measurable by a node. Furthermore, if a goodput

link capacity model is assumed, such as the piece-wise

linear model given in [18] then there may be regions in

the environment where the radios have reached throughput

saturation. That is, even with an increase in SNR there is

no increase in the link capacity. Thus, in this case if the

controller used the link capacity it would stop as soon as

throughput saturation was reached even though there may

be a direction to move that increases the link’s SNR even

further. So by maximizing the minimum link SNR, the relay

nodes will find the most robust position for all link capacity

models, not just Shannon’s channel capacity.

4 Decentralized gradient-based chaining controller

It is well known (e.g. see [7, 15]) that gradient-based con-

trollers are well suited for decentralized control in robotic

teams where the vehicles are required to maximize, and

adapt to, a changing objective function. Typically, smooth

artificial potential functions are developed to model inter-

robot constraints and achieve team objectives based on

the positions of neighboring vehicles. For this work, the

artificial potential function (referred to as the local objective

function) is specifically designed to achieve the global team

objective of maximizing end-to-end chain capacity using

the SNR of neighbor communication links as the input.

Define the local performance function Ji : R × R → R+

to be

Ji = min{

Si,i−1, Si+1,i}

(8)

which is only a function of the SNR of the links with node

i’s neighbors, and follows the form of the global objective

stated in Eq. 7. The affect of this performance function is to

move the relay towards the direction of improving the link

with the weakest SNR.

The use of the min() function means that Ji is non-

smooth when Si,i−1 = Si+1,i, and therefore the gradient

of Ji is not defined. Therefore, when Si,i−1 = Si+1,i the

1 ∂c∂S > 0.

gradient of Ji needs to be understood in terms of the

least-norm element of the generalized gradient [6]. The

reader is referred to [6] for detailed discussion on using

the least-norm element of the generalized gradient and using

non-smooth feedback in dynamic systems. For this paper

the least-norm element of the gradient of Ji will be denoted

∇ i Ji and is defined as

∇ i Ji =

⎧⎪⎪⎨

⎪⎪⎩

∇i Si,i−1, if Si,i−1 < Si+1,i

∇i Si+1,i, if Si,i−1 > Si+1,i

Ln{co{∇i Si,i−1, ∇i Si+1,i, }}, if Si,i−1 = Si+1,i

(9)

where ∇i is the gradient with respect to node i’s location,

co{} is the convex closure, and Ln{} is the least-norm

element of a set. A key subtlety of the generalized gradient

and this objective function is that even if the SNR of the

two links are equal (and therefore their capacities), it does

not mean that the throughput is maximized. There might be

a direction for the relay node to move that will improve the

SNR of both links even more, as determined by the convex

closure of the two neighbor link gradients.

Let xi ∈ R2

be the state vector for node i subject to

xi = ui (10)

where ui : R2 × R

2 → R2

is the control input to node i.First order dynamics are presented in this section for ease of

discussion. Extension to higher order dynamics is possible

with a correct choice of controller or performance function.

Finally, define the local decentralized controller for node ias

ui = −k∇ i Ji. (11)

where k ∈ R+

is the controller gain.

Due to the fact that localized noise sources may be

present in the environment, it is possible for Ji to have

multiple critical points, i.e. local and global extremum

points. Because of this, a gradient-based controller cannot

guarantee convergence to the global optimum when local-

ized noise sources are present in the environment. However,

this is not considered a downfall for this application. The

reason is that even though it may be impractical to find

the globally optimum relay location in very complex RF

environments, the gradient based controller will always seek

improve the communication capacity of the chain from its

initial conditions. Thus, if the system is initialized using

a position-based solution the chaining controller can only

improve the chains capacity. If the environment does not

contain localized noise sources, then the local performance

function given will only have a single stable critical point.

It will now be shown that this decentralized gradient-

based controller achieves the objective of the global op-

286 Mobile Netw Appl (2009) 14:281–291

timization problem by showing the set of critical points

of Ji correspond to the locations that maximize Eq. 7. Let

X = [x1, x2, · · · , xn]T, and U = [u1, u2, · · · , un]T

such that

X = U. (12)

Since ∇i Si,i−1, ∇i Si+1,i �= 0, from Eq. 9 the critical points

of Ji are located at positions where Si,i−1 = Si+1,i. That is

∇ i Ji = 0 ⇒ Si,i−1 = Si+1,i (13)

and thus

X = 0 ⇒ S2,1 = S3,2 = · · · = SN,N−1. (14)

When all of the SNR of the individual links are equal, then

the generalized gradient of the global objective function

is equivalent to the localized gradient. That is, when

S2,1 = S3,2 = · · · = SN,N−1, and only when, then

∇S =

⎢⎢⎢⎢⎣

∇1J1

∇2J2

...

∇ n Jn

⎥⎥⎥⎥⎦

(15)

Since ∇ i Ji = 0 at this condition,

X = 0 ⇒ ∇S = 0 (16)

and therefore the set points of the decentralized controller

achieves locations of optimal relaying.

A simulation of a network consisting of two point-mass

robotic relays is presented in Fig. 2 to show the response of

the system following the chaining controller. In the simula-

tion, the two relay nodes are initialized close to the location

of the destination node, modeling the initial deployment

of the relay nodes from a base station the desires higher

bandwidth communications from the source node. Without

the presence of the localized noise source, the controller

deploys the relays to the nominal range based positions,

indicated by the circles, as these are the optimal locations

when there is no localized noise sources. When then noise

source is introduced at t = 250 s, the robots sense the

change in the SNR field and bend around the source to

maximize the chain throughput. The final location of the

relay nodes is indicated by the diamonds. The time plots

on the right of the figure show that the chaining controller

is able to improve the capacity of the communication chain

significantly above that of the range-based solution, indi-

cated by the dashed line.

5 Electronic chaining ES controller

It was shown in the previous section that if the local gradi-

ents are known by the nodes, then a decentralized controller

(based on the localized gradients) can be used to drive

individual nodes to their globally optimal relaying locations.

However, in a physical environment with unknown local-

ized noise sources, either due to faulty nodes or active jam-

ming, the structure of the SNR field is impossible to predict,

and therefor the gradient can not be directly determined. A

way to estimate the gradient of the performance objective in

real-time, and by each mobile node, is required so that the

system may be driven to optimal operating positions (set

points).

ES [2] controllers are adaptive, model-free controllers

designed to drive the set point of a dynamic system to

an optimal, but unpredictable location defined by a perfor-

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500

1000

1500

2000

2500

3000

3500

Source Dest

R 1R 2

Noise

X – Position [m]

Y –

Po

siti

on

[m

]

0

10

20

30

40

Lin

k S

NR

[d

Bm

]

0 100 200 300 400 500

Link 1Link 2Link 3

0

50

100

150

200

Ch

ain

Cap

acit

y [M

bp

s]

0 100 200 300 400 500

Time [s]

GradientRange–BasedDirect

Figure 2 Simulation of a planar cascaded relay network using two

point-mass robotic relay nodes. At time t = 250 s a jamming noise

source is introduced showing the ability of the gradient controller to

react to changes in the RF environment and improve the capacity

above that of a range-based solution

Mobile Netw Appl (2009) 14:281–291 287

mance function that is only known to have an extremum

point. That is, given a sufficiently smooth cost function

J: R × Rm → R, ES controllers seek to solve in real time

the optimization problem

θ∗(t) = arg maxθ∈Rm

J(t, θ) (17)

where J is an unknown, possibly time varying, cost function

of the input parameter θ such that Dθ J (t, θ∗) = 0 and

D2

θ J(t, θ∗) < 0.2

The standard ES algorithm works by generating a mea-

sure of the local gradient of the mapping J(θ) by injecting a

perturbation signal, α cos(ωt), directly into the plant. The

output of the plant will also be sinusoidal, with a DC

(or constant) offset that the HPF removes. This signal is

then demodulated by β sin(ωt − γ ) and low-pass filtered to

obtain the gradient estimate. The gradient estimate is then

used to update the estimate of the optimal location, θ . See

[1, 2] for formal discussions, including stability proofs and

design guidelines, on single and multivariable ES.

In two dimensions, the input into the performance func-

tion has the appearance of a circular perturbation about

a moving (i.e. time varying) orbit center point. It is this

specific structure that the ES algorithm presented in this

paper takes advantage of in that some vehicles, like UA,

also exhibit a cyclic (circular) motion about an orbit center

point when they are station keeping since they must always

maintain a forward speed.

A block diagram of the decentralized ES chaining algo-

rithm is shown in Fig. 3 and consists of a LGVF Controller

steering a 2D kinematic vehicle operating within an ad hoc

network. The basic ES framework within the controller is

used to estimate the gradient of the communication perfor-

mance field that is used to drive the motion of the orbit

center point for the LGVF controller using virtual point

mass dynamics with a bounded center point velocity.

The most significant difference in the design of this ES

algorithm is that it is a self-exciting system. That is, there

is a natural limit cycle that persists in the system (the

orbital motion of the vehicle) and this limit cycle provides

the required dither signal into a measurable performance

function. Because the limit cycle that exists due to the

plant dynamics generates the sinusoidal dither signal, the

performance and stability of the controller are dependent

upon the performance capabilities of the vehicle. Thus to

maintain stability of the ES chaining algorithm, appropriate

values for the ES filters, the ES feedback gain kES, and the

maximum center point velocity must be designed for each

different vehicle type with different performance abilities.

2 Diθ (·) denotes the ith directional derivative of J w.r.t. θ .

Figure 3 Decentralized ES algorithm for a 2D kinematic vehicle

using a LGVF controller to provide the orbital motion of the vehicle

about a virtual center point driven by the gradient estimate of the

performance of a communication chain

5.1 Kinematic vehicle model

It is assumed that the robotic nodes in the network are

equipped with a low-level control system that presents a 2-

D kinematic model for use by the higher-level ES algorithm.

Let pj ∈ R2, denoted as pj = [xj, yj]T

, be the position of vehi-

cle jwith inertial speed [xj, yj]T ∈ R2

that evolves according

to the standard (Cartesian) bicycle-like kinematic model

xj = vjcos ψj

yj = vjsin ψj (18)

ψj = vj cj

where [xj, yj]T ∈ R2

is the two-dimensional inertial position

of node j, ψj ∈ [0, 2π) is the track angle (compass heading),

vj is the commanded speed (held constant), and cj is the

bounded path curvature. The bicycle kinematic model is

chosen over a unicycle model because this model covers a

wider class of 2D nonholonomic vehicles, moving in only

a forward direction and that cannot turn on the spot, such

as bicycles, cars, and autonomous underwater and aerial

vehicles [12].

It should be noted that the major difference of the

bicycle-like model from the unicycle model is that the head-

ing rate is a function of the vehicle speed and the curvature

constraints of the vehicle. For bicycles, the curvature is

related directly to the steering angle of the front wheel.

For an aircraft in a steady-state coordinated turn, the path

curvature is

c(v) = g tan φ

v2, (19)

where φ is the aircraft bank angle and g is the gravitational

constant.

Due to vehicle performance constraints, the path curva-

ture for a vehicle is bounded by upper and lower limits. For

an aircraft at a speed v,

ωmax(v) = g tan φmax

v(20)

288 Mobile Netw Appl (2009) 14:281–291

where φmax is the maximum bank angle of the vehicle at

speed v. Thus the steering input into vehicle j is bounded

such that

∣∣uj

∣∣ ≤ ωmax and gives a minimum orbital radius of

rmin(v) = v

ωmax(v). (21)

For bicycles and car-like vehicles, the path curvature bound

is directly related to the physical limitations in the motion

of the steering wheels.

This minimum radius, as will be seen later, is the effec-

tively the lower bound on the final error (or distance) of

the vehicle from the optimal communication location, which

will be the location of orbit center point for the loiter circle.

While the orbit center point can be driven to the location of

optimal communication, the robotic relay will always be at

best no closer than rmin.

Because of the wide range of dynamics and physi-

cal constraints of different types of robotic nodes, it is

not practical to drive the vehicle speed or heading di-

rectly by the ES dither signal as done in [23]. Instead a

Lyapunov guidance vector field (LGVF) controller is used

to drive the vehicle to an orbital (limit cycle) motion about

a center point. The center point is then driven with virtual

point mass dynamics by the ES framework in the chaining

algorithm.

5.2 Lyapunov guidance vector field controller

To provide the sinusoidal perturbation signal required by

the ES framework. A LGVF controller [9] is used to drive

the vehicle to a circular limit cycle about a virtual center

point, pcp ∈ R2. Since the vehicle is orbiting pcp, the ES

framework does not drive the vehicle directly to the optimal

communication location, but instead pushes pcp to the op-

timal communication location that the vehicle orbits about

using the LGVF controller.

The LGVF controller is split into two components, a

guidance vector field generator and a heading tracker (HT)

controller. The HT drives the robotic relay to the desired

loiter circle at a radial distance of rd from the orbit center

point pcp = [xcp, ycp]Tas given by the generated vector field

f (pr) =[

xd

yd

]

= β

[−(

r2 − r2

d) −2rrd

−2rrd −(r2 − r2

d)

][x − xcp

y − ycp

]

(22)

+[

xcp

ycp

]

where r2 = pr · pr = (x − xcp

)2 + (y − ycp

)2

is the squared

radial distance of the UA from the loiter center point, pcp, β

is a non-negative scalar that guarantees convergence to the

desired loiter circle when the center point is moving [9], and

vcp = [xcp, ycp]Tis the center point velocity.

The guidance vector field gives the desired velocity,

which is used to generate a turn rate command to the vehicle

through the HT. Let eψ = ψ − ψd where ψd is the desired

compass heading given as

ψd = arctan

(yd

xd

). (23)

The heading angle error is driven to zero by the turn rate

command

ω = ψd − λ · (ψ − ψd) (24)

where

ψd = v

rd. (25)

This controller is globally stable limit cycle about pcp and is

stable for any value of vcp. However, since it is assumed that

the vehicles have bounded turn rate capabilities, rd should

be chosen such that rd > rmin.

5.3 Center point dynamics

For the ES framework to be stable, and to generate the

appropriate gradient estimate, the system needs to exhibit

three different time scales [14]:

1. Fast – tracking of the center point

2. Medium – the orbital motion

3. Slow – the LPF filter in the ES

Since the amplitude and excitation frequency of the pertur-

bation signal are set by v and rd, the fast and slow dynamics

must also be functions of the vehicle performance. Due to

the speed constraints placed on the dynamics of the center

point, the convergence rate of the center point to the optimal

location is bounded by the maximum speed of the center

point.

For the error dynamics of the center point to be fast,

and to maintain the cyclic orbit about the center point, the

motion of the center point must be slow as compared to

the speed of the vehicle, i.e. vcp << vj . In the ES chaining

algorithm the center point velocity is bounded by Vcp so that

vcp ≤ Vcp and Vcp << vj.

It should be pointed out that center point velocity satu-

ration is required in the loop because even though we can

choose k small enough that the speed of the orbit center

point remains slow, as compared to the UA for a given

environment and performance function, the output of the

ES framework depends upon the magnitude and shape of

the performance function, which is not necessarily known

a priori. Thus, there could be unexpected environments in

Mobile Netw Appl (2009) 14:281–291 289

which if the center point speed was not bounded, it could

reach the maximum flight speed of the aircraft. At this point,

the motion of the UA about the center point is no longer

cyclic and is not generating the periodic perturbation signal

of the performance function required for the ES framework

to generate the gradient estimate.

6 Electronic chaining simulations

Since this work is motivated by our AUGNet [5] research

into networks that combine ad hoc networks with UA acting

as airborne communication relays, this section presents sim-

ulations of the decentralized ES chaining controller running

on UA. The performance of the aircraft and the radios are

modeled off of values taken from AUGNet experiments. For

the simulations, the aircraft are limited to a maximum 30

degree bank angle, flying at 25 m/s, and their ordering is

preset and maintained depending upon the starting location

of the UA. The maximum center point velocity is set to

5 m/s.

Though it is not know by the ES controller, the radios in

the simulation are assumed to follow the standard exponen-

tial decay model

Pij = Pji = Krd−αij (26)

where Kr is the link gain, dij is the separation distance of

the receiver from the transmitter, α is the exponential decay

rate, and Pij is the received power. The radio values are

set to Kr = 3822 and α = 3.5. For the simulation with a

noise source, the noise source is taken to be a faulty radio

transmitting with Kr = 382. Note, the choice of using the

exponential decay model was for ease of programming in

the simulation. However, by assuming that the noise source

is a faulty radio that is acting as a jamming node, even in the

simulation the SNR models are no longer concentric circles

about the radio nodes. Instead, depending upon the power

and location of the noise source, the SNR contour lines are

extremely skewed and non-symmetric.

Figure 4 shows results from a simulation with a single

UA, two end nodes and no localized no source. In Fig. 4a,

the position of the orbit center point is shown as a dashed

line within the cyclic orbital flight path of the UA. From this

figure one can see that when the UA was far away, it headed

directly in the direction of improving the minimum SNR

(which is the SNR from the far right node) at the maximum

speed of the center point. Figure 4b shows just the X-Y

position of the orbit center point is shown to highlight the

bounded convergence rate of the ES algorithm. For t ∈(50, 500) s the positional errors (especially on the Y-axis)

show the bounded convergence rate due to the bounded

center point speed.

Figure 5 shows a simulation run with three UA and two

static end nodes. At the beginning of the simulation, the

UA relays are aligned along the chain as would be a result

of running a position based controller. At time t = 0 s a

noise source located at [2500,1000] m is introduced. Since

position based controllers would not sense the change in

the RF environment, the nodes would maintain their current

position. However, using the electronic chaining algorithm

the figure shows that the UA react appropriately to the

jamming signal source and form a bowed communication

chain. Figure 5b shows that at t = 0 s the minimum SNR at

the orbit center points along the communication chain was

less then 19 dBm and that the electronic chaining controller

was able to improve the minimum value to above 24 dBm

by moving the location of the vehicles orbit center point.

Again, it is worth pointing out that in the simulation the

gradient of the SNR field is estimated on-line using the

500 1000 1500 2000 2500 3000 3500 4000 4500

0

500

1000

1500

2000

2500

3000

35

40

45

50

55

60

35

40

45

50

60Node 1

UAV 1

UA Locations over Time

X–Position [m]

Y–P

osi

tio

n [

m]

0 500 1000 15001000

1500

2000

2500

3000UA Orbit Point Locations over Time

XP

osi

tio

n [

m]

0 500 1000 15002000

2200

2400

2600

2800

3000

Time [s]

Y

Po

siti

on

[m

]

Node 2

a b

Figure 4 Simulation of a single UA airborne relay between two static nodes without any localized noise source. The motion of the center point

is the dashed line with the orbital motion of the UA required for the ES framework

290 Mobile Netw Appl (2009) 14:281–291

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500

1000

1500

2000

2500

3000

3500

Noise Source

10

1520

2530

35

40

10

15

20

25

30

3540

4Node 1 Node 2UAV 1 UAV 2 UAV 3

UA Locations over Time

X–Position [m]

Y–P

osi

tio

n [

m]

0 200 400 600 800 1000 1200 1400 1600 1800 200018

19

20

21

22

23

24

25Minimum SNR at Orbit Center

Time [s]

Min

SN

R [

dB

m]

a b

Figure 5 Simulation of three UA relay nodes reacting to a localized noise source. Motion of UA within the environment is also shown with the

noise source location and the SNR contours of the two end nodes

motion of the vehicle and the change in the measured SNR

values.

7 Limitations & future work

The decentralized controller based on the gradient of the

SNR field is shown to maximize the capacity of a cascaded

relay network with simultaneous transmission and reception

of RF signals. However, this is not a common network used

on cheap and inexpensive robots. While it is an appropriate

model for use on a single airborne relay using a repeater

network, as in the case of wildland firefighters, to be more

general a different performance function is required for

packetized networks such as 802.11b. In these networks,

each hop along the chain adds to the total delay of the

system and maximizing the link with the minimum SNR

does not necessarily maximize the end-to-end chain capac-

ity. Current work by the authors is in finding a local per-

formance function that has a spatially distributed gradient

mapping to be used for different types of packet based

networks.

A limitation of the ES frameworks is that cyclic motion

of the vehicle must be maintained to sustain an estimate of

the positional gradients required. For the application of this

work on using UA as communication relays, this is not a

problem since aircraft must maintain forward flight speed

and the most natural motion is to maintain an orbital flight

pattern for acting as a communication relay. In fact, this nat-

ural cyclic motion of a UA flying orbits to station keep is a

unique benefit to aircraft and was the original motivation for

using ES methods. However, in the application of this work

to cheap robots with limited battery power this continual

motion to estimate the gradient will simply require too much

power to sustain. A more general solution will need to take

into account the benefit of updating the gradient estimate to

the cost of the motion.

8 Conclusion

In this paper a definition of an optimal communication

chain of relay nodes in an ad hoc network was presented

based on the SNR instead of relative position. By using the

SNR instead of position, a communication chain of robotic

relays can respond to changes and unexpected features in

the RF environment that is not possible with position based

chaining solutions. Since the operating environment is gen-

erally not known a priori to deployment of a network, an

adaptive model-free ES chaining algorithm was presented

to control the motion of 2D nonholonomic vehicles acting

as communication relays. Even without specific knowledge

of the SNR field, the ES algorithm is able to drive the

team of vehicles to an optimal locations with only local

measures of the SNR. The mobility of the vehicle was

modeled as a bicycle-like kinematic model and is chosen

because the model covers a wider class of 2D nonholonomic

vehicles, including UA. An orbital motion of the vehicle

due to a LGVF controller was applied to ES in a unique

way in that the orbital motion of the vehicle about an orbit

center point generated the dither and demodulation signals

required by the ES algorithm. A specific application using

UA was simulated to highlight the focus of this work in

using UA as airborne communication relays. Even though it

is realized that this work may have limitations, the gradient-

based controller presented in the ES framework provides a

Mobile Netw Appl (2009) 14:281–291 291

means for a robotic team to find and maintain an optimal

cascaded communication chain.

References

1. Ariyur K, Krstic M (2002) Multivariable extremum seeking feed-

back: analysis and design. In: Fifteenth international symposium

on mathematical theory of networks and systems, Notre Dame,

12–16 August 2002

2. Ariyur KB, Krstic M (2003) Real-time optimization by extremum-

seeking control. Wiley, New York

3. Basu P, Redi J (2004) Movement control algorithms for realization

of fault-tolerant ad hoc robot networks. IEEE Netw 18(4):36

4. Beard RW, Stepanyan V (2003) Synchronization of information in

distributed multiple vehicle coordinated control. In: IEEE confer-

ence on decision and control, IEEE, Maui, December 2003

5. Brown TX, Argrow B, Dixon C, Doshi S, Thekkekunnel

R-G, Henkel D (2004) Ad hoc uav ground network (augnet). In:

AIAA 3rd “Unmanned unlimited” technical conference, AIAA,

Chicago, 20–23 September 2004

6. Clarke FH (1975) Generalized gradients and applications. Trans

Am Math Soc 205:247–262, April

7. Cortes J, Martinez S, Karatas T, Bullo F (2004) Coverage

control for mobile sensing networks. In: IEEE transactions

on robotics and automation. IEEE, Piscataway, May 2002,

pp 1327–1332

8. Dubins LE (1957) On curves of minimal length with a constraint

on average curvature, and with prescribed initial and terminal

positions and tangents. Am J Math 79(3):497–516, July

9. Frew EW, Lawrence D (2005) Cooperative stand-off track-

ing of moving targets by a team of autonomous air-

craft. In: AIAA guidance, navigation, and control conference,

San Francisco, August 2005

10. Gerkey BP, Mailler R, Morisset B (2006) Commbots: distributed

control of mobile communication relays. In: Proc. of the AAAI

workshop on auction mechanisms for robot coordination (Auc-

tionBots). Boston, July 2006, p 5157

11. Goldenberg D, Lin J, Morse AS, Rosen BE, Yang YR (2004)

Towards mobility as a network control primitive. In: Mobihoc ’04.

ACM, Tokyo, 24–26 May 2004

12. Indiveri G (1999) Kinematic time-invariant control of a 2d non-

holonomic vehicle. In: 38th Conference on decision and control

(CDC’99), Phoenix, December 1999

13. Kramer G, Gastpar M, Gupta P (2005) Cooperative strategies

and capacity theorems for relay networks. IEEE Trans Inf Theory

51(9):3037–3063, September

14. Krstic M, Wang HH (2000) Stability of extremum seeking feed-

back for general nonlinear dynamic systems. Automatica 36:595–

601

15. Martinez S, Cortes J, Bullo F (2007) Motion coordination with

distributed information. IEEE Control Syst Mag 27(4):75–88, Au-

gust

16. Olfati-Saber R, Murray RM (2003) Flocking with obstacle avoid-

ance: cooperation with limited information in mobile networks. In:

Conference on decision and control (CDC), Maui, December 2003

17. Pezeshkian N, Nguyen HG, Burmeister A (2007) Unmanned

ground vehicle radio relay deployment system for non-line-of-

sight operations. In: IASTED robotics and applications (RA 2007),

Würzburg, 29–31 August 2007

18. Rappaport TS, Na C, Chen J (2004) Application through-

put measurements and predictions using site-specific infor-

mation. In: IEEE 802.11 plenary session, IEEE doc. 802.

110-04-1473-00-000t, 17 November. IEEE, Piscataway

19. Sweeney J, Brunette T, Grupen YYR (2002) Coordinated teams of

reactive mobile platforms. In: International conference on robotics

and automation, IEEE, Washington, DC, 11–15 May 2002

20. Taub B, Schilling DL (1986) Principles of communication sys-

tems. McGraw-Hill, New York

21. Yang L, Passino KM, Polycarpou M (2003) Stability analysis

of m-dimensional asynchronous swarms with a fixed com-

munication topology. IEEE Trans Automat Contr 48(1; ana-

lyzed. Such stability analysis is of fundamental importance):

76–95, December

22. Zajkowski T, Dunagan S, Eilers J (2006) Small uas com-

munications mission. In: Eleventh biennial USDA forest ser-

vice remote sensing applications conference, Salt Lake City,

24–28 April 2006

23. Zhang C, Arnold D, Ghods N, Siranosian A, Krstic M

(2006) Source seeking with nonholonomic unicycle without

position measurement—part i: tuning of forward velocity. In:

45th IEEE conference on decision and control, San Diego,

13–15 December 2006