Why is ReLU setting negative values to zero particularly?

Question

I want to understand the logic behind keeping ReLU as $max(0,x)$ and not $min(0,x)$?

Why do we prefer positive inputs over the negative ones?

FYI sometimes it is preferable to use [leaky ReLUs](http://stats.stackexchange.com/a/222541/12359) — Franck Dernoncourt, Apr 01 '17 at 18:13

score 5 · Answer 1 · answered Apr 01 '17 at 06:48

5

The weights learned in a neural network can be both positive and negative. So in effect, either form would work. Negating the input and output weights with the $\min$ form gives the same function as with the $\max$ form. The max form is used purely by convention.

answered Apr 01 '17 at 06:48

AaronDefazio

1,551
7
11

Can I keep it as $x$ only? Sparsity can anyway be induced by dropout. (PS Ignoring the non-linearity that $max$ or $min$ form would introduce in the system) – jsdbt Apr 01 '17 at 07:23
2

Without non-linearity, your network will compute just some linear function. No need to make it deep or anything. – Yuval Filmus Apr 01 '17 at 12:54

Why is ReLU setting negative values to zero particularly?

1 Answers1