I'm playing with the Titanic data set, and trying to figure out what to do about the results I got from a lm that predict the age of the passenger.
How should I handle the Cabin values? Some Cabin levels are significant, while others aren't -- but they're all under the same 'Cabin' column.
So if I want to predict the age of a passenger, should I omit the cabin levels that aren't significant? What's best practice here?
Thanks!