I would like to fit a single tree. In the h2o R package, I can use h2o.randomForest()
with the following options:
h2o.randomForest(y = y, x = x, training_frame = data,
ntrees = 1,
mtries = number_of_predictors_here,
sample_rate = 1)
I have a question about sample_rate
meaning. I assume in case I specify sample_rate = 1
, it will use all the data. Is this correct? Or it will still do sampling with replacement?
Will this approach provide a correct way to fit a single tree?