Questions tagged [histogram]

A histogram is a graphical representation of the frequencies of a continuous variable. The variable is divided into bins and a bar is drawn for each bin, proportional to its frequency in the data.

515 questions
116
votes
4 answers

Assessing approximate distribution of data based on a histogram

Suppose I want to see whether my data is exponential based on a histogram (i.e. skewed to the right). Depending on how I group or bin the data, I can get wildly different histograms. One set of histograms will make is seem that the data is…
guestoeijreor
  • 1,161
  • 3
  • 8
  • 3
111
votes
11 answers

Calculating optimal number of bins in a histogram

I'm interested in finding as optimal of a method as I can for determining how many bins I should use in a histogram. My data should range from 30 to 350 objects at most, and in particular I'm trying to apply thresholding (like Otsu's method) where…
Tony Stark
  • 1,213
  • 2
  • 9
  • 5
52
votes
5 answers

R - QQPlot: how to see whether data are normally distributed

I have plotted this after I did a Shapiro-Wilk normality test. The test showed that it is likely that the population is normally distributed. However, how to see this "behaviour" on this plot? UPDATE A simple histogram of the data: UPDATE The…
Le Max
  • 3,559
  • 9
  • 26
  • 26
48
votes
5 answers

How to assess the similarity of two histograms?

Given two histograms, how do we assess whether they are similar or not? Is it sufficient to simply look at the two histograms? The simple one to one mapping has the problem that if a histogram is slightly different and slightly shifted then we'll…
Mew 3.4
  • 591
  • 1
  • 5
  • 5
35
votes
4 answers

How is Poisson distribution different to normal distribution?

I have generated a vector which has a Poisson distribution, as follows: x = rpois(1000,10) If I make a histogram using hist(x), the distribution looks like a the familiar bell-shaped normal distribution. However, a the Kolmogorov-Smirnoff test…
luciano
  • 12,197
  • 30
  • 87
  • 119
26
votes
5 answers

How to test if my distribution is multimodal?

When I plot a histogram of my data, it has two peaks: Does that mean a potential multi-modal distribution? I ran the dip.test in R (library(diptest)), and the output is: D = 0.0275, p-value = 0.7913 I can conclude that my data have a…
user1260391
  • 401
  • 1
  • 4
  • 4
24
votes
4 answers

Benefits of using QQ-plots over histograms

In this comment, Nick Cox wrote: Binning into classes is an ancient method. While histograms can be useful, modern statistical software makes it easy as well as advisable to fit distributions to the raw data. Binning just throws away detail that is…
MvG
  • 480
  • 4
  • 11
21
votes
4 answers

Comparing two histograms using Chi-Square distance

I want to compare two images of faces. I calculated their LBP-histograms. So now I need to compare these two histograms and get something that will tell how much these histograms are equal (0 - 100%). There are many ways of solving this task, but…
20
votes
1 answer

Goodness of fit for 2D histograms

I have two sets of data representing stars parameters: an observed one and a modeled one. With these sets I create what is called a two-color-diagram (TCD). A sample can be seen here: A being the observed data and B the data extracted from the…
Gabriel
  • 3,072
  • 1
  • 22
  • 49
20
votes
4 answers

Difference between histogram and pdf?

If we want to visibly see the distribution of a continuous data, which one among histogram and pdf should be used? What are the differences, not formula wise, between histogram and pdf?
user1102
18
votes
1 answer

How the Pearson's Chi Squared Test works

Following a recent down vote I have been trying to check my understanding of the Pearson Chi Squared test. I usually use the chi squared statistic (or reduced chi squared statistic) for fitting or checking the resulting fit. In this case the…
Bowler
  • 1,061
  • 1
  • 11
  • 23
17
votes
7 answers

Could any equation have predicted the results of this simulation?

Suppose there is a coin that has a 5% chance of landing on HEADS and a 95% chance of landing on TAILS. Based on a computer simulation, I want to find out the following : The average number of flips before observing HEADS, TAILS, HEADS (note: not…
stats_noob
  • 5,882
  • 1
  • 21
  • 42
16
votes
2 answers

QQ plot does not match histogram

I have a histogram, kernel density and a fitted normal distribution of financial log returns, which are transformed into losses (signs are changed), and a normal QQ plot of these data: The QQ plot shows clearly that the tails are not fitted…
15
votes
3 answers

My distribution is normal; Kolmogorov-Smirnov test doesn't agree

I have a problem with the normality of some data I have: I've done a Kolmogorov test which says it isn't normal with p=.0000, I don't understand: the skewness of my distribution =-.497, and the kurtosis =-0,024 Here is the plot of my distribution…
14
votes
3 answers

Need help identifying a distribution by its histogram

I have the sample population of a certain signal's registered amplitude maxima. Population is about 15 million samples. I produced a histogram of the population, but cannot guess the distribution with such a histogram. EDIT1: File with raw sample…
mbaitoff
  • 757
  • 1
  • 8
  • 16
1
2 3
34 35