NN: Should we apply weight decay to the bias?

Asked Aug 24 '14 at 06:58

Active Aug 24 '14 at 06:58

Viewed 1,262 times

In CS294A lecture notes, Andrew Ng writes (about autoencoders): "Usually weight decay is not applied to the bias terms... Applying weight decay to the bias units usually makes only a small different to the final network, however".

Is there any particular reason for which we shouldn't apply weight decay to the bias terms? Does it reduce the performance of the network?

asked Aug 24 '14 at 06:58

user54593

4

Possible duplicate of [No regularisation term for bias unit in neural network](https://stats.stackexchange.com/questions/153605/no-regularisation-term-for-bias-unit-in-neural-network) – Sycorax Aug 13 '18 at 01:23

NN: Should we apply weight decay to the bias?

0 Answers0