I'm trying to do feature selection along a dataset which has:
Group Date Metric1, Metric 2, Metric 3
Group 1, 2016-03-01 1.0 1.3 2.0
Group 1, 2016-03-01 1.5 1.5 2.2
Group 2, 2016-03-03 2.0 1.8 2.4
Group 2, 2016-03-04 2.5 1.0 1.0
Group 3, 2016-03-05 1.0 2.0 1.5
Group 3, 2016-03-05 1.1 2.3 1.0
Previously, I analyzed the model using the model for feature selection with:
model <- train(ATV~., data=data, trControl=train_control, method="lasso")
importance <- varImp(model, scale=FALSE)
The problem is that when combined, the group variables are showing a different relation than when separate. The ultimate question is which are the most powerful generalized explanatory variables within each group? (i.e which feature has a powerful explanation of the DV within all groups).
I think this is a similar trait to Simpson's paradox: http://vudlab.com/simpsons/
Sorry if this seems like a basic question, but what I'm trying to understand: How would you recommend doing a generalized feature selection across these groups? I've considered a few other methods, but would like to hear your opinion before moving forward.