I've read this regarding the difference between epoch
and mini-batch
.
To clarify:
- With an
epoch
value of1000
andbatch size
of50
, does that mean that the model will use each data point exactly1000
times in such an (random) order where at each iteration only50
data points are used for optimization? (meaning a total of 50*1000 calculations?) - Is every data point used exactly
1000
times?