It's easy to think of factors that airlines take into account, and how the price will vary over time. There are no shortage of articles or medium posts about that. But how can I frame this as a regression problem?
I think this reduces to something like estimating the rate of demand, given time till takeoff, the price of the ticket, and supply of similar seats (any pricing model ought to contain these factors, bare minimum). Okay, smells a bit like Poisson regression, and at time $t$ the airline prices its flights in order to maximize expected profit over some time interval, perhaps subject to some risk constraint. But I think there are some problems with that:
- The Poisson's implied variance in demand is probably much too low. As far as I know, noise due to hidden variables will be hard to incorporate because of their stochastic nature (otherwise, negative binomial would do the trick).
- I don't see a straightforward way to update to "stochastic state changes". Suppose tickets are selling like hotcakes for a particular destination at a certain time - obviously demand for this flight is higher than our model thought, so we should raise the ticket price to reflect that. For the same reason as 1, I don't think there's a simple update rule to employ, like in a Kalman filter.
- This model completely neglects carrying capacity. Not many people need to fly from Wyoming to Alaska, but if the flight is almost fully booked early on, then the pool of potential customers could be exhausted. That means prices should go down, not up.
While I understand that airlines in perfect competition don't want to tell each other how they do this, I wonder if there is a simple answer I'm not thinking of, or if there's another industry with similar problems that can afford to release literature.