1

I have data of number of customers entering a business and number of customers leaving a business every 15 minutes. I need to predict the number of people that will be entering and leaving the business on the next 15 and 30 minutes. I can also save old data for 2-3 weeks.

I want a simple model with a high success rate. I read about linear regression and also read that this might not be the best model cause of negative values and Poisson regression might suit better.

I would be happy to read recommendations regarding my situation.

tmnt13
  • 11
  • 2

1 Answers1

1

You probably have seasonality, or more precisely , since you may have intra-daily patterns, and these may differ by the day of week.

In addition, your data are . Unless you have large numbers (rule of thumb: more than 30 in each 15 minute bucket), you should address this and not use a normal distribution approximation.

I would recommend a Poisson regression with predictors accounting for the seasonalities, e.g., using harmonics or bump functions. Two weeks of historical data is very little to capture seasonalities, though, especially differing patterns by day of week. Try to get more data.

Be sure to use a good accuracy measure for evaluating your forecasts, e.g., the . If you use the , your forecasts may be biased, or you may "find" that a flat zero forecast is "best":

Alternatively, and even better, use proper to assess your forecasts. I have written a little paper (Kolassa, 2016, IJF) on this that you may find interesting.

Stephan Kolassa
  • 95,027
  • 13
  • 197
  • 357