Questions tagged [party]

Questions related to regression/classification/model trees created with the R packages party and partykit for recursive partitioning.

56 questions
7
votes
2 answers

Variable importance in party vs randomForest

I am getting completely different results from cforest and randomForest with regards to variable importance (mean decrease in accuracy): require(randomForest) require(party) require(RCurl) x <-…
mguzmann
  • 575
  • 1
  • 6
  • 14
5
votes
1 answer

Evaluate glmtree model

I am using the glmtree function from the partykit package in R. I would like to know how I can evaluate the models and how I can improve them. I am growing a big tree (alpha = 0.9) and pruning with AIC as the criterion. I am using the AUC (pROC…
Dani
  • 51
  • 2
5
votes
1 answer

What is the test statistics used for a conditional inference regression tree?

In Hothorn et al, the test statistic is specified as $$ T_j(L_n, w) = vec(\sum w_i g_j(X_{ji}) h(Y_i, (Y_1,...,Y_n)^T))$$ What is the exact form of this test statistic with a continuous response and categories and numerical predictors?
goldisfine
  • 626
  • 7
  • 16
4
votes
1 answer

Why is random forest performing worse than decision tree

I have a data set with 1962 observations and 46 columns. Column 46 is the target with 3 classes 1, 2, 3. 6 of the other columns are nominal variables and the rest are ordinal variables. I have preprocessed them using as follows: for (i in…
Lyndt
  • 61
  • 3
4
votes
2 answers

Predictors in random Forest

I am building a random forest to predict a binary variable y. I have several predictors named x1..n. One predictor, lets say x1, is a very strong predictor of y but only in some cases (see below) while the others x2..n are fair predictors in all…
MassCorr
  • 173
  • 4
4
votes
1 answer

Pruning Conditional Inference Trees

I am trying to build a prediction model using classification trees. While I tried the "rpart" package, the results were not entirely satisfactory. Hence, I thought of exploring conditional inference trees as well ("party" package in R) Now, under…
Dataminer
  • 365
  • 3
  • 12
3
votes
2 answers

R result interpretation conditional inference tree result for nominal response

I have nominal responses, "yes/no/don't know", that I am using in a conditional inference tree in R. I am having trouble with how to interpret the model's output concerning one of the independent variables: gender. There are other independent…
shea
  • 141
  • 4
3
votes
2 answers

R: Cluster based on similar linear relationships

I'm looking for an unsupervised clustering technique available in R that will allow me to combine repeated measures I have taken at many independent sites, to form subgroups that have similar linear relationships (in terms of slope not intercept)…
Nick_89
  • 35
  • 6
3
votes
2 answers

Decision tree split vs importance

I recently created a decision tree model in R using the Party package (Conditional Inference Tree, ctree model). I generated a visual representation of the decision tree, to see the splits and levels. I also computed the variables importance using…
Mustard Tiger
  • 215
  • 2
  • 10
3
votes
1 answer

R Decision Tree based on imbalanced data which was up-sampled

This is a rather theoretical question, so I'm sorry if that's not appropriate to the platform. I have trained a decision tree (partykit) on an imbalanced data set, and to force the model to learn both positive and negative examples I have up-sampled…
NRG
  • 33
  • 4
3
votes
0 answers

cforest prediction taking too long

I am using the R-package party to build a random forest. The cforest function takes about 5 min to build a random forest model: cf.model <- cforest(as.factor(y) ~ ., data = train.data, controls = cforest_control(ntree = 25, maxdepth = 6)) However,…
Ashes
  • 31
  • 1
3
votes
0 answers

Missing value handling in cforest in R

I'm trying to build a random forest with 100k records and 2K variables. I have an imputation process to handle missing values while using randomForest but I want to understand what exactly happens to missing values in predictors in cforest. I looked…
2
votes
1 answer

Can model-based recursive partitioning accommodate survey weights?

I am using the model-based recursive partitioning algorithm described in Zeileis, Hothorn and Hornik (2008), available here: https://www.zeileis.org/papers/Zeileis+Hothorn+Hornik-2008.pdf I am using survey data, which requires that I use…
Kate
  • 23
  • 4
2
votes
1 answer

Influence function used in partykit for binary classification

What is the influence function used for binary classification in the R package partkit, specifically for the conditional tree (ctree). I could not find any details in the R package documentation. In the vigente I found this paragraph about…
Kozolovska
  • 1,027
  • 6
  • 11
2
votes
1 answer

Stability test of MOB algorithm (supLM)

I am interested in better understanding the M-fluctuation test of the MOB algorithm (Zeileis, Hothorn & Hornik, 2008). I have a question regarding the definition of the empirical fluctuation process, $W_{j}(t)$, and then the supLM test that is…
TTT
  • 229
  • 1
  • 8
1
2 3 4