I am working on developing an insurance risk predictive model. These models are of "rare events" like airline no-show prediction, hardware fault detection, etc. As I prepared my data set, I tried to apply classification, but I couldn't obtain useful classifiers because of the high proportion of negative cases.
I don't have a lot of experience in statistics and modeling data beyond a high school statistics course so I'm kinda confused.
As first thought, I have been thinking of using an inhomogeneous Poisson process model. I classified it based on event data (date, lat, lon) to get a good estimate of the chance of a risk at a particular time on a particular day in particular place.
I'd like to know, what are the methodologies/algorithms to predict rare events?
What do you recommend as an approach to tackle this problem?