fooled by average #1: “our company pays averagely the same as our competitor”

August 2, 2010

are you familiar with this ” our company pays averagely the same as our competitor?”

Regardless the use of “average” in above sentence is intentional or simply out of misunderstanding, it is wrong to use only it for representing a population or a group of data.

I’ve been amazed that until today, many people is still using just “average” or “mean” as the representation of a set of data. And more people just take it without question. With the widespread use of spreadsheet, we have to be better than that.

“Average” could be misleading, because it does not tell us the distribution of the data or population. I’ll show you why.

Imagine there are two company: Company A and Company B, each has 10 employees (to make it simple).  In below table, I list the salary of each  employee. NOW, you can see that the AVERAGE of employees salary for Company A equals to Company B. But, you know they are NOT the SAME, don’t you?

When the sample is only 10 data points, you can see with your eyes that there are difference. But if the data points are more than 200, you need other tool.

In this case, standard devation can help.

Standard deviation shows the variation of data in one group. For instance, in Company B, there is one employee (maybe the CEO) who has very high salary while there are several employees are paid less than 4; it means, the variance is high–> shown by the higher standard deviation vs. Company A.

Hence, while the average salary is the same, Company B does not pay employees similar to Company B.

In summary, whenever you hear someone tell you that average X equals to average Y, you need to ask “what is the standard deviation?”. You need to see whether the variance is also similar and whether there is an outlier that drag the average up or down.


basic principle: you should know the shape/distribution of data, standard deviation and the average/median for making a simple conclusion of data

simple, not simplistic

July 7, 2010

I am a fan of simple solution, but NOT simplistic one.

Simplistic means oversimplification of a problem (without deep understanding of more data/facts), taking the most “obvious” solution, taking shortcut and the fastest result. For instance, in simplistic solution, if you cannot sleep for more than 2 nites you are taking some sleeping pills to help you. That’s simplistic.

Simple means taking the uncomplicated solution based on understanding of the facts and data. Uncomplicated means you don’t need the 100% accuracy for solving all the issues but focus on critical few over the trivial many; hence, the 80/20 rule. In the case of you cannot sleep, you will try to collect info/facts on why you cannot sleep (may need an experience people/doctor).  Your root causes could be:

a. stress from office workload

b. taking too much nap during the day

c. your husband is snoring*

In simple solution, you need to take care whichever the real problem is. In either case, none of taking any sleeping pill will help. In fact, that solution can bring more harm because not only it hides you from the real root cause but also it will create another problem such as addiction.

This is quite common sense, but the issue is many big issue comes from this simple thing.

I’ll show you in my next post.


if the real cause is because your husband is snoring while sleeping, the solution is not kick him out of the house. That’s simplistic:). The simple solution can be just using a pair of earplug?