For a multiple linear regression model, I have done two things to preprocess my data:
- I have scaled continuous variables with StandardScaler
- I have encoded categorical variables with OneHotEncoder
My dependent variable is the rating (a float number) that varies from 1.0 to 10.0. Do I need to perform any encoding on that variable? How can it influence my model?
I use scikit-learn
for everything listed above.