I'm modeling the ratio of total medical cost in one 12 month period and the subsequent 12 month period (total_medical_cost_ratio = post_total_medical_cost/pre_total_medical_cost).
The data is skewed, heavy-tailed with 1.1% of the values = $0 in the pre period.
What is the best way to represent the ratio of two values when the range of values in the denominator includes zero?
I'd prefer not to exclude these records, which is throwing away valuable data. I've also tried substituting 0.01 for 0 in the denominator, but this increases the skewness and kurtosis of the data more than I'm comfortable with (mean pre/post ratio inflates to 18,000 from 4.4!).