I'm playing around with data with quite few observations (~50) and several observed features. I assessed each of these features for significant correlation with the target variable and then used the significantly correlating features to build an OLS model. However, the adj. R² of the OLS is now lower than some of the correlations between the target variable and the individual predictors. I checked the residuals and they look fine.
Assuming the following situation:
- Adj. R² of the OLS is around 0.5
- R of some of the predictors up to 0.1 above that
- All p-vals of the correlations are smaller than 0.001
My gut feeling would tell me to discard the OLS and use the single feature that correlates best with the target variable. Is there any way, in which the OLS could be superior to that?
Edit, 2020-07-06: Changed R² to R for the individual predictors, as JohnnieThick pointed out.