I'm reading a paper here that does an analysis that I find odd, and I'm wondering whether it is reasonable to do what they are doing. The paper isn't available online but I think I can describe the important part:
The authors had a bunch of university students taking a science course, and they measured their "efficacy" at two points in time (pre-test before a course, and then post-test after the course).
They hypothesized that the students with the lowest pre-test efficacy would gain the most from pre to post (and that the students with the highest pre-test efficacy would improve the least). So, they divided students into quartiles based on pre-efficacy score. Then they did a two-way within ANOVA. The IVs were "trial" (pre or post) and "quartile" (1, 2, 3, or 4). The DV was efficacy. They are looking for a trial:quartile interaction (hoping that post-hoc tests will show that the lowest quartile will gain tons and the highest quartile will gain little).
I guess what concerns me is:
The "quartile" is not really a within-subjects IV, because for each subject it is constant. I imagine that in a long format, their data would have had two rows for each subject (for the efficacy scores on pre and post), and quartile would have been the same in both.
The students in the top quartile have the least to gain. The lowest quartile could very well gain more than the top quartile could actually gain (ceiling effect on the top quartile).
Also worries me that the quartile and pre-efficacy scores are highly correlated. I don't know whether this is OK, but to me the quartile is calculable from the pre-efficacy, so including both seems suspicious.
Why quartiles? Why not halves or thirds, or somehow just use the pre-efficacy directly instead of breaking it into quartiles? For example, something like
(post - pre) ~ pre
and then a negative sig effect of pre would show that as pre increases the difference score decreases?
I'm hoping to do an analysis like this, but couldn't get myself to move forward in my current state of unease over the way this analysis was done. I am not a statistician at all so I could be completely wrong here.
Comments are appreciated as to whether the analysis is sound or not.