I'm struggling to figure out the right model specification for my data.
Here's my design:
a between-subject longitudinal study, with two Type
of participants (Test and Control), with 8 Subject
in each Type, each of which is tested multiple times (different numbers of times across participants, unfortunately). At each test, subjects produce a lot of speech, from which I've derived a measure of their language development (MLU
, continuous variable ranging from ~1 to ~3). The DV is, for each sentence they say, whether it contains a certain Word
(binary, coded as 1 or 0).
I'm interested in whether the two Types
of people change differently on the DV, as their MLU increases -- ie. the interaction between Type and MLU. I also want to account for random effects of Subject (though I'm not sure how possible this is with only 8 subjects per type).
I'm struggling to figure out the right 'maximal' model. I've come up with something like this, but I'm not sure if it's conceptually right:
Word ~ MLU*Type + (MLU|Type/Name)
One conceptual question I'm unclear about is that it seems there are, in principle, two different ways to test for an effect of Type in this design, and I'm not sure how different they are, or which one makes more sense. One approach is just to treat Type as a standard fixed effect. The other is to compare two models that differ on whether Type is included as a Random effect. E.g. something like:
model1<- MLU + (MLU|Type/Name)
model2<- MLU + (MLU|Name)
anova(model1, model2)