I have a surgical database in which there are approximately 100,000 observations and 200 features. Each observation corresponds to a separate patient while the features correspond to either preoperative, perioperative, or postoperative variables (e.g. preoperative labs, length of operation, and days until death). As such some of the features are factors, while others are continuous data types. The database is not "clean" in that many of the features contain NAs because that data was not collected. Each patient has some feature with NAs in it, so when I try to select only observations without NAs that results in 0 observations.
That being all said my end goal is to see whether any of these preoperative variables might be predictive of mortality. Initially I wanted to use L1 regularization (lasso) via glmnet to perform feature selection, however because of the NAs I can't run it. Are there any alternatives or techniques to bypass this problem? I assume that it's more common than not to have a database with NAs, so I wanted to see if you guys could fill me in. Thanks!