2

Is it important to scale all the features into a common range (normalized) when using random forests (bagging) in classification. Or can random forests handle features in different ranges without problems (bias to the larger values). Some features may have a value in the 1000-range and others in the 0-1 range.

andreSmol
  • 487
  • 1
  • 6
  • 14
  • 3
    There are similar questions on StackOverflow (http://stackoverflow.com/q/8961586/1414455) and Quora (http://qr.ae/80zE4). The short answer is that you don't. – tchakravarty Nov 03 '12 at 17:21
  • See https://stats.stackexchange.com/questions/255765/does-random-forest-need-input-variables-to-be-scaled-or-centered/255769#255769 – Firebug Aug 14 '18 at 22:17

1 Answers1

0

Partially answered in comments:

There are similar questions on StackOverflow (https://stackoverflow.com/questions/8961586/do-i-need-to-normalize-or-scale-data-for-randomforest-r-package) and Quora (https://www.quora.com/Machine-Learning/Should-inputs-to-random-forests-be-normalized?srid=3EJy&st=ns). The short answer is that you don't. – tchakravarty

kjetil b halvorsen
  • 63,378
  • 26
  • 142
  • 467