
Originally Posted by
sabriah
Oh, I notice you use the geometric mean. What is the rationale for that?
* The geometric mean is typically used for processes which show exponential growth, often NP-hard issues. It is not very sensitive to outliers.
* The harmonic mean is typically used for rates, such as MB/sec. It is not very sensitive to outliers.
* The median is very good at finding trends and is the least sensitive to outliers and crap data.
* The arithmetic mean is the most sensitive to outliers and random data.
As you want to find the outliers as indicators of a regression, and not ignore them, I guess the traditional arithmetic mean may after all be the most relevant for you. Any takers?