[Iccrg] Proposal for ICCRG operations
Xiaoliang (David) Wei
weixl at cs.caltech.edu
Sat Feb 4 16:36:16 GMT 2006
Thanks, all. About the benchmark, I have been thinking about a few questions
that might be interesting in the process of designing a benchmark:
1. Who is expected to use the benchmark?
My understanding is that there may be different groups of users that are
interested in different properties of a congestion control algorithm.
For example, distributed computation applications would prefer a low
latency of the worst flow (this is max-min fairness). Data transfer users
may emphasize on a higher average throughput. Media servers would like to
have stable delay... As Linux allows users to select their preferred
congestion control algorithm, network administrator is capable to select
different algorithms in their private network, based on the major
applications they have. In this case, I think it make sense to provide a few
different metrics for different users.
If the benchmark is to be used to identify a TCP candidate for
standardization, we probably need to consider even more: incremental
deployability (interaction with Reno), the impact of the algorithm to the
network (link loss rate, queue fluctuation) and etc. In this case, we may
also want to measure the realistic workload in Internet and feed it back to
the benchmark.
2. Who will perform the benchmark tests, and how?
As the e2e mailing list talking on some strange behaviors in NCSU's
performance evaluation due to some buggy implementation, it brings up the
problem that the same congestion control algorithm may perform very
different because of a bug in the implementation, or even a hardware defect
in the testbed.
I think it will be better if we can have a standard hardware testbed
that can repeatedly evaluate an implementation of congestion control
algorithm. So, the protocol designers can iterate the process of
design->implementation->testing and make sure that their algorithms are
correctly evaluated. In this issue, I think WAN-in-Lab may help (at least
partially) as a standard testing platform, if we can build a automatic
testing system based on it.
just my 2 cents.
-David
Xiaoliang (David) Wei Graduate Student in CS at Caltech
http://www.davidwei.org
====================================================
----- Original Message -----
From: "Lachlan Andrew" <lachlan.andrew at gmail.com>
To: "Michael Welzl" <michael.welzl at uibk.ac.at>
Cc: "Aaron Falk" <falk at isi.edu>; "Steven Low" <slow at caltech.edu>; "Lachlan
Andrew" <lachlan at cs.caltech.edu>; "iccrg" <iccrg at cs.ucl.ac.uk>; "Pei Cao"
<cao at theory.stanford.edu>; "David Wei" <weixl at cs.caltech.edu>;
<tmrg-interest at icsi.berkeley.edu>
Sent: Friday, February 03, 2006 3:57 PM
Subject: Re: [Iccrg] Proposal for ICCRG operations
Greetings all,
On 03 Feb 2006 09:32:27 +0100, Michael Welzl <michael.welzl at uibk.ac.at>
wrote:
> So, the index should contain:
>
> * normalized throughput
> * loss
> * fairness (let's use the Jain Fairness Index)
>
> It could then look something like this:
>
> WI = (T*t + L*l + F*f) / 3
>
> where WI is "Welzl Index" (tm) ;-)
> T is normalized throughput
> L is normalized loss
> F is the Jain fairness index (which is already a value
> in the 0..1 range)
A good way to summarise throughput and fairness is just to take the
*harmonic* mean of the flows' throughputs instead of the arithmetic
mean (or equivalently, the aggregate througput). That corresponds to
measuring the mean time to transfer a given number of bytes, which is
the quantity actually observed by an application.
This metric avoids the arbitraryness both of using a sum or product,
with the problem of selecting weightings, and also of Jain's index.
(If one flow out of many is starved, Jain's index can still report
high "fairness", but the harmonic mean of rates goes to zero
highlighting the problem.) Similarly, if one flow (or protocol) is
unable to achieve a high rate, Jain's index says it is "unfair" for
another to out-perfom it. The harmonic mean correctly indicates that
rates 0.5 and 1 are better than rates 0.5 and 0.5 (but less good
than 0.75 and 0.75).
(Trying to maximise the harmonic mean of the rates is just a special
case of "utility maximisation", using alpha=2 in Mo and Walrand's
alpha-fairness framework. It could be argued that this utility
function is itself arbitrary, but it seems less so than summing
thoughput and "fairness".)
Combining throughput and fairness of throughput can be done
systematically, since they both just reflect the average rates. A
bigger challenge is incorporating loss/delay and the other metrics
Wesley Eddy mentioned (stability, convergence time etc). For file
transfers, it could be argued that these only matter insofar as they
affect the total transfer times. This applies to both long- and
short-lived flows (but not to real-time flows). However, there is
still a problem of how to weight the times: The transfer time of each
short flow should get less weight than a long flow, but weighting them
in proportion to the amount of data may put too little weight on them.
Cheers,
Lachlan
--
Lachlan Andrew Dept of Computer Science, Caltech
Phone: +1 (626) 395-8820 Fax: +1 (626) 568-3603
More information about the Iccrg
mailing list