Hi I have a set of data(message with different size) in excel like (over 5000 entries):
message Name MessageSize
message1 0.5M
message2 10.2M
message3 2.1M
...
message n-1 40.52M
message n 12.12M
I have successfully generated a frequency table and a bar chart with it.
e.g. Different range of size have how many message occurred ?
0 - 5M 4000
5 - 10M 454
5 - 15M 234
..
...
45-50M 2
Below are some statistics for this set of data:
Mean Median Mode Skewness Kurtosis
2.085443743 0.884527206 9.762702942 5.067491934 37.95900794
Standard deviation min max
3.642138843 0.006651878 49.12574959
But when I tried to draw a normal distribution via excel NORMDIST() it's has turned very ugly because it's probably not a Normal distribution ! As mean, median & mod are totally different !
How could I find out what kind of distribution the data belongs to ? or model a statistic model for it ?
Thanks a lot.
I think stats here is a great community, lots of useful answers in a short while just like stackoverflow :)