How to check psychometric type of scales?

Question

It is well known, that there are diffrent types of scales (see Wikipedia Level of measurement).

In psycholinguistics there is Semantic Differential technique. It uses a scale (usually 5 or 7 grades) which type as ordinal or interval is being debated. As Wikipedia says, the majority think it is ordinal. But at the same time processing researchers use arithmetic mean and Factor Analysis: and it uses + and - math operations. Is it critical error? May be, nevertheless, in fact the intervals between closest grades are equal - how to check it? What is the cause of scale's type?

Of possible interest: [How to transform ordinal data from questionnaire into proper interval data?](http://stats.stackexchange.com/q/27927/930) or [Measuring the Mind](http://en.wikipedia.org/wiki/Measuring_the_Mind), by Denny Borsboom. — chl, Oct 06 '13 at 08:50

score 3 · Answer 1 · edited Oct 06 '13 at 10:12

3

You don't really "check" the type of scales; you have to use logic and reasonableness to figure out what is sensible to do. Stevens' set of scales is not set in stone and has problems.

Likert scales are, in fact, in between ordinal and interval.

Technically, an ordinal scale is one in which any transformation can be applied that retains order and the meaning will stay the same. So, you could code a Likert scale 1, 2, 3, 4, 5. Or 0, 1, 2, 3, 4. Or 0, 0.00001, 2, 17, 19101821.2. But the last one isn't reasonable.

For factor analysis, much has been written. Search on "ordinal factor analysis" and you'll find some things. There are articles by e.g. Joreskog and Sorbom. Different people have different opinions.

Can you add Likert items? Technically, no. But people do so all the time and it gives reasonable results.

On the other hand, some additions make little sense.

edited Oct 06 '13 at 10:12

Nick Cox

48,377
8
110
156

answered Oct 05 '13 at 22:59

Peter Flom

94,055
35
143
276

3

I believe the scaling device you choose reflect practical matters and an underlying measurement model. [Likert scales](http://en.wikipedia.org/wiki/Likert_scale) were proposed as a way to scale persons (not stimuli, in contrast with Thurstone's approach, or both, as in Guttman's approach), and adding individual response scores altogether would make sense in this context, see, e.g., [Unidimensional Scaling](http://www.sagepub.com/books/Book445) by McIver and Carmines. – chl Oct 06 '13 at 10:10
As usual @chl raises good points – Peter Flom Oct 06 '13 at 10:15

ttnphns · Answer 2 · 2013-10-06T14:29:44.287

@Peter has given a good answer. I just want to add one point: it is important how the scale is presented or formatted.

For most people, the less the scale's notches are subscribed the more the scale is interval rather than ordinal. Compare

(disagree)| --- | --- | --- |(agree)
(disagree)1 --- 2 --- 3 --- 4(agree)
totally disagree --- rather disagree --- rather agree --- totally agree

where the 1st scale is just a grating to measure while the 3rd one is clearly categorical, ordinal. Labels involves verbal semantics what isolates the points away from being landmarks and towards being islands.

Osgood's rating scale (used in semantic differential) is like the 1st or 2nd above; in addition, it is bipolar - that is, two equally fair epithets (or objects) symmetrize the scale, which measures proximity to either of them. Such bipolar proximity measuring device seems to be further closer to interval away from ordinal, in comparison to a unipolar intensity measuring device (such as shown above), because symmetric opposition de-granulates the "landscape" between the opposites.

So, the way scale is typically presented in a semantic differential makes one think it is fairly interval.

With ordinal scale it is of course incorrect to do arithmetics (such as computing mean or summing to a total score) or check whether the data distribution is normal. The distinction between interval and ordinal implies the notion of underlying feature which is measured to produce an observed value. If the relation between the underlying and the observed is assumed to be linear, we speak of interval (equiinterval) scale. If the relation is assumed monotonic and is somehow known (e.g. postulated), then the scale is non-equiinterval; such a scale can be easily transformed into equiinterval.

If the relation between the underlying and the observed is assumed monotonic and unknown, there comes ordinal scale. Ordinal scale can be transformed into interval if the transformation rule is worked out. We may draw such rules from our pragmatic desire of maximizing some quantity in the analysis we conceive. For example, one might want linear correlations between items to be as strong as possible. Then the transformation which maximizes the correlations can be solved for. This process of quantifying categorical data is often referred to as optimal scaling.

I would add that some researchers consider that the way we treat data (e.g., 'ordinal'--that is, a set of ordered response categories--, or 'interval'--leaving alone the controversy that data are truly continuous in very rare cases--) depends on the underlying *measurement model*. Beside Borsboom, De Boeck and coworkers have a nice paper on [distinguishing between categories and dimensions](https://ppw.kuleuven.be/okp/_pdf/DeBoeck2005ACAPF.pdf). A 'practical' aspect of that distinction is the way we conceptualize several mental constructs, e.g. [mental disorders](http://bit.ly/17c4J1N). — chl, Oct 06 '13 at 10:00

score 0 · Answer 3 · answered Oct 08 '13 at 20:49

The question about the `true' nature of a scale is indeed a tricky one. But a quick pragmatic answer is that, depending on the analysis that you are interested in running, it might be fairly straight forward to deal with the scale as ordinal. I would suggest doing so because that would be the more conservative position, as the interval nature of the scale would be a stronger assumption.

Now, regarding how to deal with the data... in general you could review the literature in item response models and the myriad of models offered there. However, if you are in a literature that relies mostly in factor analysis, you can easily deal with ordinal data by using polychoric correlations (See the polycor in R for instance) instead of the traditional Pearson product moment, and then running the factor analysis on that correlation matrix.

Even easier is to use a software like Mplus, which will allow you to declare your variables as ordinal, and it will run the a generalized version of traditional factor analysis automatically.

Again, this is a very interesting and contentious issue, but if your interest is mainly practical, you could simply treat the data as ordinal, and if you are in the mood of exploring, you could also run it under traditional FA and see if/how the results vary.

Just a side note `it might be fairly straight forward to deal with the scale as ordinal`, also `simply treat the data as ordinal`. These are burdening advices. Analysis of ordinal data is more difficult and is less developed than that of scale or nominal data. No "simplicity" there. — ttnphns, Oct 14 '13 at 00:26
Hi, I appreciate that you seem to think that the field of item response theory is underdeveloped, and that indeed it might be more complicated than assuming continuity of the observed variables. However, I do think that considerably improvements have been achieved in software like R, Stata, Mplus and LatentGold that make such analysis considerable more accessible and I believe it is an option worth considering. — David, Oct 29 '13 at 19:31

How to check psychometric type of scales?

3 Answers3