Consider the following scenario: A group of students (N=85) is given a test to assess their knowledge, the test contains a single question with four possible responses: 0,1,2,3. Students have to make a test twice: before and after a class, to check whether the class had any effect on their knowledge.
Question: which statistical test is the most appropriate in this setting? (paired t-test, Wilcoxon, permutation test, chi-square with several categories, etc)
What bothers me is the fact, that the integer numbers are not real (R) numbers, but rather 'categorical', so application of the t-test etc doesn't seem to me the best approach (although the sample size is relatively large, which in principle, does not contradict the t-test assumptions)