I am working with a dataset containing ~300 predictors and ~3000 observations and building a predictive model using elastic net (and hoping to generalize to an external validation set). While the majority of observations are complete cases, there are some observations with missing values in either some of the predictors, the outcome, or both. My current approach is to remove all observations with missing outcome data from the analysis, and use mice (in R) to perform multiple imputation for missing values of the predictors. To me, this was the approach that made sense intuitively, as I was concerned about reporting performance metrics on observations that did not have observed values of the outcome.
However, I have seen that it may be valid to include observations with missing outcome data in the dataset, and let the outcome values also be handled through multiple imputation. I was curious about the conditions in which one method would be preferred over the other, if any. My suspicion is that it may be better to re-do these analyses while also imputing missing outcome data and missing predictor data, rather than using this "complete outcome" approach. Any insight is appreciated, and I'm happy to provide more information if needed!