My GLM is as follows:
logit.final <- glm(Claim_Occurrence ~ Sum.Insured100kto200k + Sum.Insured200kto300k +
Sum.Insured30kto50k + Sum.Insured50kto100k +
Sum.Insured300Kplus,
family = binomial(link = "logit"), offset = Exposure.Years.Earned)
I am trying to predict whether a claim will be reported in a vehicle or not, based on sum insured. The base level of the Sum.Insured
categorical variable is Sum.Insured0to30K
. Exposure years is the offset term, which is between 0 and 1. For example, a 0.5 would mean 6 months and 1 would mean a year.
If the fitted intercept is -2.64997, does this mean the odds of a claim occurring in a vehicle with sum insured 0 to 30K is 7.07% (i.e., $\exp -2.64997)$)? Would the offset term have any influence on this odds / interpretation?
EDIT:
I read somewhere that the coefficient of an offset is 1. So to incorporate the offset in my interpretation, would the odds be $\exp(-2.64997 + 1) = 19\%$?
EDIT 2:
Okay, as per advise in the answer, I have removed Exposure Years Earned from offset term, and included it as a predictor.
My revised glm model is now as follows:
logit.final <- glm(Claim_Occurrence ~ Sum.Insured100kto200k + Sum.Insured200kto300k + Sum.Insured30kto50k + Sum.Insured50kto100k + Sum.Insured300Kplus + Exposure.Years.Earned, family = binomial(link = "logit"))
My intercept is now -3.6464, and coeff estimate of Exposure years earned is 2.0046.
So if I want to find probability of claim occurrence of a vehicle with sum insured 0 to 30K, and exposure years earned worth of 1.083, would it be Exp(-3.6464) x Exp(2.0046) x 1.083 = 20.98% ?