property of gradients and activations during deep network

Asked Aug 23 '21 at 13:24

Active Aug 23 '21 at 13:24

Viewed 6 times

In initialising a deep network before training, what statistical property of gradients and of activations is desirable?

asked Aug 23 '21 at 13:24

user332214

0 Answers0