interrater-reliability with likert-scale and different raters

Question

I am collecting data from a survey that includes a likert scale of 5 points. Each rater answers 18 questions (probably different questions). For each question I will have answers from 10 different raters. How can I calculate the interrater reliability, so that I can eliminate a rater who was giving random answers?

A rough example of the data, where x is the rater answer.

One possible way, on a question level, calculate the difference between one rater's answer and the average of the answers of the other 9 raters for the same question, and have some threshold to eliminate raters. Or I can use intraclass correlation (ICC(1,k)) but how can I know which raters to eliminate? Or are there any other possible ways?

any help would be great,thanks,

score 2 · Accepted Answer · answered Mar 06 '19 at 10:29

2

I eventually used Krippendorf's alpha, following the intuition in this question. Comparing inter-rater agreement between classes of raters

answered Mar 06 '19 at 10:29

Lossan

131
5

Krippendorf's alpha is a good measure to use to help assess the idea of "agreement." – LSC Mar 06 '19 at 11:11

interrater-reliability with likert-scale and different raters

1 Answers1