The best answer to your question is: avoid stepwise selection.
Even in standard linear regression, stepwise selection of predictors is not a good strategy, as explained for example on this page. This is even more important in survival analysis (which seems to be your situation), as omitting any predictor that is associated with survival from a model will lead to bias in the coefficients for the remaining predictors even if they are not correlated with the omitted predictor. See this page for some introduction to this problem
With survival analysis you are typically best off by including as many predictors reasonably related to survival as you can without overfitting: typically limiting to one predictor per 10-20 survival "events" (deaths, recurrences) in standard Cox modeling, or using a penalized method like ridge regression. (Stata might not support ridge regression for survival models, however.)
If your "types" of cancer are clinically defined Stages, remember that each higher Stage is related to overall worse outcome based on studies of hundreds to thousands of patients. It's quite possible that a small study wouldn't have the power to distinguish survival of, say, Stage II from Stage III. If you have small numbers in each of 4 cancer Stages it might make sense--before you look at the survival results--to combine nearby Stages into groups (e.g., I+II versus III+IV) so that you compress the Staging information into only 2 categories.
If you really are looking at different types of cancer (e.g., lung versus breast versus prostate) you should think carefully about what you are trying to accomplish. Different types of cancers have different natural histories, standard therapies, and prognoses (the Stage numbers have different implications for survival depending on the type of cancer), so you might be better served by analyzing each type separately.