0

I have constructed a zero-inflated beta regression model in gamlss. I find the output of this model somewhat confusing to interpret and was hoping that someone may be willing to correct my interpretation, fill in the blanks or correct me where I am wrong?

My model describes the impact of several environmental and climatic variables on variation in the maximum abundance count for rabbits. A value of 0 for prop.abun.max represents the maximum abundance count for a particular three-day monitoring period and a value of 1 (re-coded to 0.9999) for prop.abun.max represents an abundance count of 0. Values of prop.abun.max >0 and <1 represent an abundance count that was less than the maximum count for that monitoring period and greater than 0. For ever three day monitoring period I have one maximum abundance count, represented as 0, and two abundance counts that are less than the maximum, represented as their variation away from the maximum, i.e >0 but <=1.

As I have so many 0's in my data and the values of prop.abun.max vary from 0-1, I have implemented a zero-inflated beta regression. My model is below.

Family:  c("BEZI", "Zero Inflated Beta") 

Call:  gamlss(formula = prop.abun.max ~ season + time + temp +      last.rain.bom + rain + wind + cloud + re(random = ~1 |  
    site), sigma.formula = ~1, nu.formula = ~season +      time + temp + last.rain.bom + rain + wind + cloud +  
    re(random = ~1 | site), family = BEZI, data = na.omit(subset3)) 

Fitting method: RS() 

------------------------------------------------------------------
Mu link function:  logit
Mu Coefficients:
                Estimate Std. Error t value Pr(>|t|)    
(Intercept)   -0.6400525  0.3356874  -1.907 0.057181 .  
seasonAutumn   0.0836078  0.1546704   0.541 0.589075    
seasonWinter  -0.2218074  0.1958630  -1.132 0.258029    
seasonSpring  -0.0319850  0.1473012  -0.217 0.828195    
time           0.0581566  0.0621388   0.936 0.349807    
temp          -0.0129982  0.0104319  -1.246 0.213392    
last.rain.bom  0.0044248  0.0119330   0.371 0.710954    
rain          -0.0168664  0.0248624  -0.678 0.497864    
windlight      0.0490963  0.1109939   0.442 0.658455    
windmedium     0.1446764  0.1686862   0.858 0.391522    
windstrong     0.6715118  0.1935037   3.470 0.000569 ***
cloud         -0.0008299  0.0013417  -0.619 0.536512    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

------------------------------------------------------------------
Sigma link function:  log
Sigma Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)  1.55686    0.07311    21.3   <2e-16 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

------------------------------------------------------------------
Nu link function:  logit 
Nu Coefficients:
               Estimate Std. Error t value Pr(>|t|)  
(Intercept)    0.215386   0.635802   0.339   0.7349  
seasonAutumn  -0.195175   0.312707  -0.624   0.5328  
seasonWinter  -0.233812   0.384758  -0.608   0.5437  
seasonSpring  -0.092157   0.291694  -0.316   0.7522  
time          -0.260762   0.117130  -2.226   0.0265 *
temp          -0.012965   0.020335  -0.638   0.5241  
last.rain.bom  0.011202   0.023347   0.480   0.6316  
rain           0.050998   0.035835   1.423   0.1554  
windlight     -0.082610   0.213268  -0.387   0.6987  
windmedium    -0.035651   0.323898  -0.110   0.9124  
windstrong     0.020968   0.409759   0.051   0.9592  
cloud          0.001351   0.002611   0.518   0.6050  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

------------------------------------------------------------------
NOTE: Additive smoothing terms exist in the formulas: 
 i) Std. Error for smoothers are for the linear effect only. 
ii) Std. Error for the linear terms maybe are not accurate. 
------------------------------------------------------------------
No. of observations in the fit:  503 
Degrees of Freedom for the fit:  41.62353
      Residual Deg. of Freedom:  461.3765 
                      at cycle:  7 
 
Global Deviance:     367.5272 
            AIC:     450.7743 
            SBC:     626.4501 

From this post I understand that the zero-inflated beta model essentially has three components, one (Mu) that describes the mean of the beta distribution for the interval (0,1), a second (Sigma) that represents the precision (or shape) of the beta distribution, and a third (Nu) that represents the Bernoulli distribution part of the model and the probability of observing a zero value.

Now how to interpret the model output? Both the Mu and Nu components of the model have a logit link, so if we exponentiate these coefficients we get odds ratios. If we exponentiate the significant coefficient in the Nu component of the model, time = -0.012965, we get an odds ratio of 0.77. This suggests that for ever unit increase in time there is a 30% lower likelihood of a zero (maximum abundance in this case).

I now get lost when interpreting the Mu component of the model. For example, in the Mu component of the model, the coefficient of time is 0.0581566, which exponentiates to an odds ratio of 1.06. What is the interpretation of this given that the response is not discrete 0/1, but rather is continuous between the bounds of 0 and 1? Is the interpretation still in reference to the odds/likelihood of a 0/1? Does this mean for every unit increase in time there is a 6% higher likelihood of a 1? How would the significant coefficient for windstrong = 0.6715118 be interpreted, I assume it is interpreted relative to the reference category for wind which is no wind?

Pat Taggart
  • 677
  • 2
  • 9

0 Answers0