I have two data sets to compare. Each is a list of billed amounts by diagnosis codes. The data differs in that the diagnosis codes may be different for some of the billed amounts. There are approximately 32 different diagnosis codes that were used. What type of statistical analysis is most appropriate in comparing these two datasets and why?
I was told to complete a two way anova. Is that correct?