currently I am using the mboost R-package to estimate some additive models. When using the function gamboost(), you can control the hyper-parameters for boosting by using the option boost_control(). One argument of this option is related to the empirical risk, and there are three alternatives to compute it, "inbag", "oob" and "none". The following is the code I have used:
model <- gamboost(income,
data = datafit,
control = boost_control(mstop = 10000, nu = 0.1,
trace = TRUE),
weights = datafit$factor, # from survey design
family = QuantReg(tau = 0.05))
My questions are: 1.- I would like to know the differences between using "inbag", "oob" and/or "none" and when it is suggested to use each one. 2.- In the case of "oob", you have the option of introducing an extra vector in oobweights for the out-of-bag weights, how should this vector be? I have seen applications with only 0's and 1's but I have not seen a document about the proportion of 0's and 1's that the vector should have nor if I should also introduce here the weights from the sampling design (A priori, I have introduced this information in the option weights)