I am aiming to create an index and I have 4 individual variables that I want to put in one component. I would use this index as an explanatory variable in multiple regression analysis later on.
- However, as the correlation between those 4 individual variables is low and they are mostly categorical and binary (but I have normalized them), it doesn't make sense to do a principal component analysis. Is this correct?
- As an alternative I thought I could do a confirmatory factor analysis and then create factor scores. Can I use those factor scores then in the regression as explanatory variable or do I introduce then any kind of bias?