I have the following distribution of payment receipts with respect to their due date. Anything less than zero is prepaid and after zero is late. How should I formulate my basic approach to modelling the distribution of lateness?
I have considered a log normal model, but this would not work with the prepayments with a negative due difference. It is too skewed to use a normal distribution (M3 = 1.78) and the excess kurtosis is 3.54 (M4 = 6.54). I've also considered a Gaussian smoothing algorithm, but this is far too slow (10,000's of payments would need to be iterated over 100,000s of times for my purposes). Therefore, I'm looking to model this with a continuous function that is not an infinite sequence.
The goal is to compute a discrete probability for each number of days late (or early) for a future payment due.