Next: Current Limitations and Future
Up: PRouST Switch Components
Previous: Aggregator
  Contents
  Index
Subsections
More on Aggregation
There are two paradigms for topology aggregation currently supported
in PNNI: the Symmetric-node approach and the Star
approach. In a Symmetric-node model the peer group topology is
collapsed to a single node, and one value (usually the worst-case
value), called the radius, is advertised for each parameter.
While this scheme greatly reduces the information propagated, it may
result in very poor routing decisions, particularly if the number of
border nodes and the variance in their connectivity is large. The
Star model augments the Symmetric-node approach as follows: topology
is aggregated into a representation with one central virtual node
called the nucleus and
border nodes. This central node is
explicitly connected to all of the border nodes via spokes. The
spokes may be given weights different from the radius, in which case
they are referred to as exceptions. In the final level of
sophistication, PNNI aggregation allows the aggregated representation
to specify logical links directly between borders; such links are
called bypasses.
A desirable aggregation scheme must
- Transform a peer group structure into a compact form.
This is necessary to capitalize on the intended purpose of
aggregation, namely to reduce background traffic required to maintain
routing information for large networks.
- Retain information about those aspects of the topology that
are most critical to making good routing decisions. The aggregation
process results in each switch seeing a suitably coarsened (and
hopefully adequate) view of the entire network topology: A switch sees
the true topology only for its own peer group, but has progressively
more aggregated views of farther regions of the network. Since this
aggregated topology information will be used to select the routes for
connection requests, we must make sure that our aggregation scheme
does not have too detrimental effect on route optimality.
- Be efficiently computable. Aggregation will need to be
re-performed every time peer group topology changes significantly.
Having a lightweight aggregation algorithm ensures that we can have a
reasonable criterion defining a ``significant change'', without
putting an unacceptable burden on the network switches that have to
compute the aggregated representation.
To date, no specific Symmetric-node or Star aggregation schemes have
been defined in the PNNI standard. In this paper, we present two
approaches for topology aggregation with respect to a single metric
parameter. Specifically, we describe two techniques for computing good
Star representations of a peer group.
We represent a peer group as a multi-weighted undirected graph
with nodes
, edges
, where
is the fixed set of
border nodes, and
are the required PNNI topology state parameters. The edge vector
consists of the
values of the parameters for the link joining
and
.
Let
be the value of the best path between the
non-border node
and border node
with respect to
. Specifically,
Define
, the boolean distance
between a pair of border nodes
and
;
Note: The boolean distance between two nodes is a set. It is easy to
see that the boolean distance is a metric, defined in terms of
equality, containment and union of sets.
Define
, the set of candidate nodes as
.
It is clear from this definition, that
consists of
all non-border nodes in the peer group that lie on a simple path
between some pair of border nodes.
We define the status of every candidate node
with
respect to the parameter
, and denote this quantity as
. We also define the status of the peer group
with respect to parameter
, and denote this quantity by
.
The precise definitions of
and
depend on the
particular parameter
, and are given below:
The central status nodes of the peer group relative
to the pair
are defined to be the set of nodes
such that
. A central status node may be
thought of as a node on which one can center the smallest radius
sphere (with respect to parameter
), and still capture all of the
border vertices.
Whenever we need to aggregate, we first generate a weighted complete
graph on border nodes from the original peer group. We present two
methods for generating the weights
(between borders
and
) on the edges of this complete graph:
- (I)
-
is the measure of the shortest path between
and
.
Complexity. We compute the best paths from every border node,
to every other node in the network, by running Dijkstra's algorithm
times, starting once from each of the border nodes. This requires
steps. The space needed is
,
which is what is required to hold the adjacency list graph
representation of the peer group.
- (II)
-
is the measure of the best path
between
and
through a central status node.
Complexity. First, we run Dijkstra's algorithm from all border
nodes, in
steps, as in (I). This gives us
the
. Then for each non-border node
, we use
the computed distances to the border nodes to compute
,
requiring
additional steps. We select a central status node
appropriately, by keeping track of the best value of the
as we compute them. Finally, since we have already
computed the distances from the border nodes to the chosen central
status node, we can compute the pairwise shortest distances between
all border nodes in an additional
steps. The space
requirement is the same as for method (I) above.
If
is the weight on the edge joining
and
in the
complete graph then this is the value that we wish to advertise in the
parent group as the cost of traversing the peer group between border
nodes
and
.
Figure 7.3:
Basic steps for topology aggregation
 |
We would like to represent the original peer group as a star graph
with weights
advertised on the edges such that
for all
.
In general, with
border nodes, such system of equations can be
represented as
where, A is
matrix, D
is
column vector of the costs to be
advertised, and X is
column vector of unknown weights to
be determined for the star representation of the Logical Group Node.
This system
is an over determined system of equations which is
usually an inconsistent system. That is, given an
system
with
, we cannot in general find a vector
for
which
.
In such cases, one resorts to finding a vector,
for
which
is closest to
in the least square
solution to the system
. Indeed,
in
is such
that
is minimum over all
. Thus to solve
, we solve
The above represents an
system of linear equations,
referred to as the normal equations. The
obtained are the
values advertised as exception spoke edges of the star graph
that represents the peer group in the next higher level. The size of
this aggregated representation, and hence the communication incurred
per peer group, is
.
This approach is well-suited for peer groups which experience
unpredictable traffic patterns, since it computes the star
representation that minimizes, in a least squares sense, the
average distortion over all transits of the peer group.
We compute
only once (off-line) beforehand in
steps7.2.
Each time we need to re-aggregate: we multiply the precomputed
matrix by the current value of the vector
. This
multiplication can be done in
time. The space required to
hold the precomputed value of
and to perform the
necessary multiplication is
.
Thus the time requirement to perform an aggregation using Approach 1
(including preprocessing) is
, and the
space requirement is
.
A transit is an undirected path through the peer group
whose endpoints are distinct border nodes. We call an unordered pair
of border nodes
critical if we cannot tolerate any
discrepancy between the information advertised in the aggregated
representation, and the true value
of the parameter for the
transit between
and
. Note: In the entire discussion
that follows, we deal with unordered pairs, implying that the element
denoted
is equivalent to the element denoted
.
Select a set of preserved transits
is critical
. Our approach requires
; the precise
conditions on this set
will be explained shortly.
Define the set of non-critical transits
distinct border nodes,
. Note that
.
Now for each element
define the linear equation
And for each element
define the linear equation
The
and
together specify a system of
linear equations in
unknowns, where
is
the number of border nodes. The
and
are the
unknowns which have to be determined. We represent this system in
matrix form
,
where
and
are the
coefficients corresponding to the linear equations
and
respectively.
Since each
introduces a new free variable
, it
is clear that the coefficient matrix
of the above system is
non-singular if and only if the system consisting of just the
has a solution.
We can now more fully describe how to select the set
: We sample
the actual probabilities of the transits, (or get some estimate of the
traffic patterns between every pair of borders nodes
and
).
Then we choose the
most demanding of these traffic routes, with
the additional constraint that the resulting system of
be non-singular.
We compute the unique solution
, and give the
following interpretation to the values of the components of the solution
vector: The value of
will be the quantity advertised on the
exception spoke between the nucleus and border node
in the
star graph. The value of
is the additive distortion between
the truth and what is advertised about the (non-critical) transit
between border nodes
and
.
For any fixed border node
we consider the multiset
consisting of the
transit errors,
We define
=
, where
is some aggregation
function7.3, and consider
to be a measure of the
average error inherent in the star representation for paths through
border
of the peer group.
The final representation of the peer group is the set of
(on the exception spokes of the star), and the set of errors
. These
values serve as a ``disclaimer'', specifying the
average error inherent in the star representation for transits through
border
. The representation constructed by this approach is of
size at most
.
Unfortunately, the current version of the PNNI standard does not have
any provision for advertising the set of errors
. We recommend
that the PNNI aggregation model be augmented to permit the assignment
of topology state parameters to the nucleus. If such a scheme is
adopted, the set of errors
could be associated with the nucleus
and included as part of the aggregated representation. Remote
switches trying to decide whether or not to route through a particular
peer group could then take into account not only its aggregated
representation, but also the
, which are the peer group's
self-assessment of the fidelity of its advertised representation. In
the absence of such an extension to PNNI, we can still use Approach 2,
and only advertise the set of
on the exception spokes of the
star (without the
).
This approach is well-suited for peer groups which experience regular,
characterizable traffic patterns, since it computes the star
representation that completely preserves the truth about the most
commonly occurring transits.
We compute
only once (off-line) beforehand in
steps, just as in Approach 1.
Each time we need to re-aggregate: we multiply the precomputed
matrix by the current value of the vector
. This
multiplication can be done in
time, and the total space
required to store
and perform this multiplication is
. Finally, we compute the
using the aggregation
function
as above, (and if
runs in time linear in the size of
its argument set
), this is completed in at most
time.
Thus the time requirement to perform an aggregation using Approach 2
(including preprocessing) is
, and the
space requirement is
.
If this representation is still found to deviate unacceptably from the
true topology, it can be augmented in a systematic way to reduce the
distortion, but at the cost of a larger representation. Let us assume
that we desire to finish with a representation that is within
multiplicative distortion of the true topology. To achieve
this we can add bypass links between borders
and
if
where
is the actual cost of the
transit and
is the cost currently advertised. The weight of such
the additional bypass link would be
.
We want to advertise
as the cost of traversing the peer group
between border nodes
and
. The least square solution in
Approach 1 provides the best
,
and results in
being advertised as the
distance between border
and
. In some instances we may find
. If we desire this advertisement to
be conservative, we can add a bypass link with value exactly
between border nodes
and
.
We measure performance of an aggregation scheme
as follows:
Define
where
is the true cost of traversing the peer group through
the borders
and
and
is what the aggregated
representation advertises. For perfect aggregation one would like
and
to be equal for all i and j
. To
quantify the extent to which this is achieved, we define for each pair
,
of border nodes,
where
are real numbers between
and
, chosen to
represent the frequency of transits between borders
and
.
A value of
that is closer to
signifies more frequent
transits between the borders
and
, making it more important
to minimize the distortion in the compact representation for the cost
of that transit.
Specifically, we view
as the degree to which the advertised
cost matches the true cost of the transit between borders
and
. It is easily verified that for any such transit,
equals
if and only if the advertised cost
exactly
matches its true value
. Furthermore, any deviation of the
advertised value from the true value is reflected in the the
corresponding
to an extent that is linearly proportional to
the importance assigned to the transit (as specified by the scalar
).
We now define the performance of an aggregation scheme K
with respect to a peer group with N border nodes as:
Then
, and higher values of
indicate better performance. Also, it is clear that
if the aggregation scheme is perfect, that is,
,
then
.
Both approaches presented are viable, according to the criteria
presented in section 2:
The proposed schemes transform a peer group structure into a compact
form; the final representation requires only
space, where
is the number of border nodes of the peer group.
Within the confines of an
space star, the proposed schemes
optimally retain the information about peer group structure that is
critical for making good routing decisions. Approach 1 seeks to
minimize the average distortion over all transits in a least squares
sense; approach 2 builds a star that completely preserves the
most
critical transits. The schemes are dual: the first is best suited for
peergroups where traffic patterns are unpredictable; the second is
suited for peergroups where traffic patterns can be characterized.
The representations are efficiently computable since the most
expensive operations (like inverting large matrices) are performed
off-line and only need to be done once. In fact, if we we assume, not
unreasonably, that the number of border nodes is significantly smaller
than the number of nodes in the peer group, say
, then the time to aggregate using either of our schemes
becomes7.4
. Note further that when a connection request arrives at
the peer group, a route computation must be carried out to select a
path for the connection. This route computation itself, using
Dijkstra's algorithm, would take
time. Thus,
if the peer group leader performs re-aggregation every time
new
connection requests have entered the peer group, then the
computational work required to do the re-aggregation can be amortized
against the unavoidable work required to compute the routes for the
connections themselves. Our schemes can therefore be executed
scalably in a temporal sense: larger peer groups will need to
re-aggregate their topology less frequently if switches are not to be
excessively burdened by the computational demands of re-aggregation,
but the length of this minimum time interval between re-aggregations
grows only linearly in
, the number of border nodes.
Next: Current Limitations and Future
Up: PRouST Switch Components
Previous: Aggregator
  Contents
  Index
proust-dev@cmf.nrl.navy.mil