I would like to fit a linear regression model in R for predicting motorbike prices. My dataset has 13 variables, including number of kilometers driven, colour, month of the first registration, etc. The cells of some variables are empty.
Which variables should I use for the linear regression model? Should I use all 13 variables, or should I detect the relevant variables through doing Anova step-by-step and to delete in every step the variable which is not significant, or should I use the Akaike information criterion? What is the best way for the right model?