1

I have the same question as in this post: Dropout: scaling the activation versus inverting the dropout but for alpha dropouts: I would like to know if I need to apply the scale factor of $p$ when applying a prediction (not the training)?

Jan Kukacka
  • 10,121
  • 1
  • 36
  • 62

0 Answers0