Results 1 to 10 of 10

Thread: One Month Of Monitoring The Linux Kernel Performance

  1. #1
    Join Date
    Jan 2007
    Posts
    10,207

    Default One Month Of Monitoring The Linux Kernel Performance

    Phoronix: One Month Of Monitoring The Linux Kernel Performance

    For those that may have forgot, at the start of December we launched the Phoronix Kernel Test Farm to begin benchmarking the Linux kernel on a daily basis using the automated tools that we provide via the Phoronix Test Suite and Phoromatic. Towards the middle of December we then unveiled the Phoromatic Tracker, which exposes these test results in real-time to the public. Well, it's now been a month of monitoring the kernel's performance and the entire Linux 2.6.33 kernel development cycle thus far, with many interesting findings.

    http://www.phoronix.com/vr.php?view=14480

  2. #2
    Join Date
    Oct 2008
    Posts
    134

    Default

    I always miss some more statistical information about these benchmarks. Whats the RMDS (root mean square deviation) like with 10 runs per day for example?

  3. #3
    Join Date
    Aug 2007
    Location
    Europe
    Posts
    400

    Default

    Quote Originally Posted by crispy View Post
    I always miss some more statistical information about these benchmarks. Whats the RMDS (root mean square deviation) like with 10 runs per day for example?
    crispy, I miss some more statistical information about the RMDS. What is the power of the RMDS (root mean square deviation) with 10 runs per day?

    Is the RMDS the correct tool for this? Is the data normally distributed? Do you want to find outliers? What do you want to do? Maybe some more robust statistics are the way to go here - http://en.wikipedia.org/wiki/Robust_statistics.

  4. #4
    Join Date
    Oct 2008
    Posts
    134

    Default

    Sorry, some time since ive done these error calculations, what I was thinking of was standard deviation.

  5. #5
    Join Date
    Aug 2007
    Location
    Europe
    Posts
    400

    Default

    Quote Originally Posted by crispy View Post
    Sorry, some time since ive done these error calculations, what I was thinking of was standard deviation.
    Don't worry, you are not alone.

    Most people unfortunately do use standard deviation, ordinary least regression, etc.

    However, these methods are very very sensitive to outliers. Their high power are only applicable under very restricted circumstances, that rarely are valid. They can be good to detect outliers, but, many wish to determine a trend or a reliable central value for few observations, and then they are really bad.

    Look at http://www.mathworks.com/access/help.../robustfit.gif to get a picture.

    The R application (http://www.r-project.org/) and its robust package (http://cran.r-project.org/web/views/Robust.html) is a good start to get a feel for how it behaves for different data sets.

    Please, have a peek.

    Trust me, you will never want to go back to standard deviation based statistics!


  6. #6
    Join Date
    Oct 2008
    Posts
    134

    Default

    So you agree we should get just a little of some kind of statistical information about the data right?

  7. #7
    Join Date
    Aug 2007
    Location
    Europe
    Posts
    400

    Default

    Quote Originally Posted by crispy View Post
    So you agree we should get just a little of some kind of statistical information about the data right?
    There are some available already. Look at http://www.phoromatic.com/kernel-tracker.php? There you can set thresholds for relative changes.

  8. #8
    Join Date
    Oct 2008
    Posts
    134

    Default

    Im not interested in regressions and the threshold for changes is not what im looking for either.

    But it would be very nice with a larger dataset to calculate standard deviation for each datapoint and plot it something like this: http://www.access-board.gov/research...ge002_0004.gif

    Then it will give us an idea if the changes in performance for each kernel version is a result of the code or just standard variations of the dataset...

    I mean, in any type of science you do this simple data analysis...

  9. #9
    Join Date
    Nov 2007
    Location
    Die trolls, die!
    Posts
    525

    Default

    @crispy

    I'd really like to see this, too. I am interested in the range of the values.

  10. #10

    Default

    Quote Originally Posted by crispy View Post
    Im not interested in regressions and the threshold for changes is not what im looking for either.

    But it would be very nice with a larger dataset to calculate standard deviation for each datapoint and plot it something like this: http://www.access-board.gov/research...ge002_0004.gif

    Then it will give us an idea if the changes in performance for each kernel version is a result of the code or just standard variations of the dataset...

    I mean, in any type of science you do this simple data analysis...

    The Phoronix Test Suite already does this if you run a result file with "phoronix-test-suite analyze-all-runs", it just is not implemented in the web interface on Phoromatic at this time. Though you can assume that it is always less than 3.5% deviation otherwise the run-count dynamically increases (see an earlier Phoronix posting about statistical significance and Phoronix Test Suite). Other stats can be added in if you 1. provide patches or 2. explain it all quite well what you would like and how.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •