I have to test which factors influence game damage in fields. I mapped areas with damage and those without. It was not always possible to map 100% of a field, so there are also areas where it is unsure. Since the "unit" damage is not objective because it is not possible to determine where one ends and the next starts, I put a grid over the area and calculated for every cell independently the distance to different structures (forest, roads etc.). The resulting data look like this:
| damage | id | dist_forest | dist_maiz | dist_roads |...
|0 | 51| 30| 20| 70|...
|0 | 51| 20| 10| 60|...
|0 | 52| 60| 10| 80|...
|0 | 52| 40| 70| 10|...
|0 | 52| 20| 60| 50|...
|1 | 53| 10| 10| 50|...
|1 | 53| 05| 20| 30|...
|1 | 54| 20| 30| 20|...
|1 | 54| 30| 20| 90|...
|1 | 54| 40| 10| 10|...
(I have about 100 individual polygons which lead to 100000 lines when resolved in square meters)
I wanted to use a binary logistic regression with random effects. To solve the problem with the non-independent data I was adding the id of the damage-polygons as a random factor. The resulting model was like this:
glm <- glm(damage ~ dist_forest + dist_maiz + dist_roads + (1|cat), family=binomial(logit),data=data)
The problem ist now, that all my parameters are all highly significant. I already asked that here and it was suggested to use a special model for spatial data.
if data are (substantially) spatially dependent then standard significance tests don't apply any way!
Does anybody have any suggestions how to proceed further?