Highest Voted 'boosting' Questions - Statistical Analysis Stack Exchange

287

votes

8 answers

Bagging, boosting and stacking in machine learning

What's the similarities and differences between these 3 methods: Bagging, Boosting, Stacking? Which is the best one? And why? Can you give me an example for each?

asked Nov 24 '11 at 16:51

Bucsa Lucian

2,979
3
13
3

162

votes

3 answers

Gradient Boosting Tree vs Random Forest

Gradient tree boosting as proposed by Friedman uses decision trees as base learners. I'm wondering if we should make the base decision tree as complex as possible (fully grown) or simpler? Is there any explanation for the choice? Random Forest is…

machine-learning random-forest cart boosting ensemble-learning

asked Sep 20 '15 at 20:44

FihopZz

1,923
4
11
9

71

votes

4 answers

How to tune hyperparameters of xgboost trees?

I have a class imbalanced data & I want to tune the hyperparameters of the boosted tress using xgboost. Questions Is there an equivalent of gridsearchcv or randomsearchcv for xgboost? If not what is the recommended approach to tune the parameters…

machine-learning cross-validation boosting

asked Sep 04 '15 at 02:23

GeorgeOfTheRF

5,063
14
42
51

62

votes

7 answers

Why doesn't Random Forest handle missing values in predictors?

What are theoretical reasons to not handle missing values? Gradient boosting machines, regression trees handle missing values. Why doesn't Random Forest do that?

random-forest missing-data boosting

asked May 16 '14 at 13:08

Fedorenko Kristina

723
1
6
6

59

votes

6 answers

Is random forest a boosting algorithm?

Short definition of boosting: Can a set of weak learners create a single strong learner? A weak learner is defined to be a classifier which is only slightly correlated with the true classification (it can label examples better than random…

machine-learning random-forest boosting bagging

asked Nov 19 '13 at 16:34

Atilla Ozgur

1,251
1
11
17

56

votes

4 answers

What is the proper usage of scale_pos_weight in xgboost for imbalanced datasets?

I have a very imbalanced dataset. I'm trying to follow the tuning advice and use scale_pos_weight but not sure how should I tune it. I can see that RegLossObj.GetGradient does: if (info.labels[i] == 1.0f) w *= param_.scale_pos_weight so a gradient…

unbalanced-classes boosting

asked Oct 30 '16 at 13:59

ihadanny

2,596
3
19
31

55

votes

2 answers

Intuitive explanations of differences between Gradient Boosting Trees (GBM) & Adaboost

I'm trying to understand the differences between GBM & Adaboost. These are what I've understood so far: There are both boosting algorithms, which learns from previous model's errors and finally make a weighted sum of the models. GBM and Adaboost…

boosting adaboost

asked Aug 01 '15 at 07:50

Hee Kyung Yoon

687
1
6
9

47

votes

1 answer

Explanation of min_child_weight in xgboost algorithm

The definition of the min_child_weight parameter in xgboost is given as the: minimum sum of instance weight (hessian) needed in a child. If the tree partition step results in a leaf node with the sum of instance weight less than…

machine-learning boosting hessian

asked Dec 04 '17 at 16:39

User123456789

613
1
5
9

44

votes

3 answers

Gradient Boosting for Linear Regression - why does it not work?

While learning about Gradient Boosting, I haven't heard about any constraints regarding the properties of a "weak classifier" that the method uses to build and ensemble model. However, I could not imagine an application of a GB that uses linear…

regression machine-learning boosting ensemble-learning gradient

asked Dec 16 '15 at 00:41

Matek

749
1
6
14

42

votes

1 answer

Relative variable importance for Boosting

I'm looking for an explanation of how relative variable importance is computed in Gradient Boosted Trees that is not overly general/simplistic like: The measures are based on the number of times a variable is selected for splitting, weighted by the…

machine-learning data-mining predictive-models cart boosting

asked Jul 19 '15 at 13:29

Antoine

5,740
7
29
53

36

votes

1 answer

Mathematical differences between GBM, XGBoost, LightGBM, CatBoost?

There exist several implementations of the GBDT family of model such as: GBM XGBoost LightGBM Catboost. What are the mathematical differences between these different implementations? Catboost seems to outperform the other implementations even by…

boosting

asked Oct 12 '17 at 11:43

Metariat

2,376
4
21
41

36

votes

2 answers

Is this the state of art regression methodology?

I've been following Kaggle competitions for a long time and I come to realize that many winning strategies involve using at least one of the "big threes": bagging, boosting and stacking. For regressions, rather than focusing on building one best…

predictive-models boosting bagging stacking model-averaging

asked Dec 10 '15 at 15:21

Maxareo

535
5
11

35

votes

3 answers

What algorithms need feature scaling, beside from SVM?

I am working with many algorithms: RandomForest, DecisionTrees, NaiveBayes, SVM (kernel=linear and rbf), KNN, LDA and XGBoost. All of them were pretty fast except for SVM. That is when I got to know that it needs feature scaling to work faster. Then…

machine-learning svm random-forest naive-bayes boosting

asked Nov 06 '16 at 15:09

Aizzaac

989
2
11
21

35

votes

1 answer

XGBoost Loss function Approximation With Taylor Expansion

As an example, take the objective function of the XGBoost model on the $t$'th iteration: $$\mathcal{L}^{(t)}=\sum_{i=1}^n\ell(y_i,\hat{y}_i^{(t-1)}+f_t(\mathbf{x}_i))+\Omega(f_t)$$ where $\ell$ is the loss function, $f_t$ is the $t$'th tree output…

optimization loss-functions boosting taylor-series

asked Mar 21 '16 at 19:04

Alex R.

13,097
2
25
49

33

votes

1 answer

What are some useful guidelines for GBM parameters?

What are some useful guidelines for testing parameters (i.e. interaction depth, minchild, sample rate, etc.) using GBM? Let's say I have 70-100 features, a population of 200,000 and I intend to test interaction depth of 3 and 4. Clearly I need to do…

r hypothesis-testing cart boosting

asked Apr 03 '12 at 03:27

Ram Ahluwalia

3,003
6
27
38

Questions tagged [boosting]