On 5/19/07, Bill Davidsen <[email protected]> wrote:
Ray Lee wrote:
> Is the S.D. columns (immediately after the average) standard
> deviation? If so, you may want to rename those 'stdev', as it's a
> little confusing to have S.D. stand for that and Staircase Deadline.
> Further, which standard deviation is it? (The standard deviation of
> the values (stdev), or the standard deviation of the mean (sdom)?)
>
What's intended is the stddev from the average, and perl bit me on that
one. If you spell a variable wrong the same way more than once it
doesn't flag it as a possible spelling error.
Note on the math, even when coded as intended, the divide of the squares
of the errors is by N-1 not N. I found it both ways in online doc, but I
learned it decades ago as "N-1" so I used that.
<nod> sqrt(N-1) is almost always the correct form, and gives the
"sample standard deviation". sqrt(N) (or the "population standard
deviation") should only be used when you can prove that you're not
doing any sub-sampling of the population at all, which is pretty much
never the case.
In practice, for any decent values of N, the difference between the
two is usually insignificant compared to everything else going on.
Okay, here's a bonus, http://www.tmr.com/~davidsen/sched_smooth_02.html
not only has the right values, the labels are changed, and I included
more data points from the fc6 recent kernel and the 2.6.21.1 kernel with
the mainline scheduler.
You rock.
The nice thing about this test and the IPC test I posted recently is
that they are reasonable stable on the same hardware, so even if someone
argues about what they show, they show the same thing each time and can
therefore be used to compare changes.
Yup, these look good to me.
As I told a manger at the old Prodigy after coding up some log analysis
with pretty graphs, "getting the data was the easy part, the hard part
is figuring out what it means." If this data is useful in suggesting
changes, then it has value. Otherwise it was a fun way to spend some time.
Right. At minimum, the average +- standard deviations of the glxgears
case tells a pretty consistent story, especially if one looks at the
deviation as a percentage of the average. (Mainline is far more
erratic than either SD or CFS, and CFS seems to be able to maintain
high throughput and low jitter.)
I'm not quite sure what to read into the fairness test, other than
mainline is obviously different for the other two in terms of
throughput. Deviations are across the board, but perhaps that's more a
reflection of how much (little) data was collected for this one?
Dunno.
Anyway, as you point out, the usefulness of the tests chosen could
still be up for debate, but I think you've shown that the results are
pretty stable, and at least can serve as a good measure between
versions of schedulers.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Kernel Newbies]
[Netfilter]
[Bugtraq]
[Photo]
[Stuff]
[Gimp]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Video 4 Linux]
[Linux for the blind]
[Linux Resources]