How are measures of central tendency affected by the shape of the data?
Answer
Common measures of central tendency are the mean, median, and mode. The mode of the data is affected by the number of peaks in the distribution. Multiple peaks in the data result in multiple modes. While the number and location of the peaks have some impact on the mean and median, these estimators are also affected by whether or not the data is skewed.
Skew refers to whether or not the data is symmetrical. If the data is right-skewed (positive skew) it has a tail on the right side, and if the data is left-skewed (negative skew) it has a tail on the left side. If the data is not left- or right-skewed, it is symmetrical.
If the data is relatively symmetrical, the mean and median will produce similar estimates of the center. However, the graphs below show that when the distribution is skewed, the mean is pulled further in the direction of the tail than the median. For this reason, the median is often a more useful estimator of center when the distribution is heavily skewed or there are extreme outliers in the data.