What does decay_steps mean in Tensorflow tf.train.exponential_decay?

Question

I am trying to implement an exponential learning rate decay with the Adam optimizer for a LSTM. I do not want the 'staircase = true' version. The decay_steps for me feels like the number of steps that the learning rate keeps constant. But I am not sure about this and Tensorflow has not stated it in their documentation. Any help is much appreciated.

Just so you know, Adam already handles learning rate optimization. — ARAT, Jan 10 '19 at 21:17

score 5 · Answer 1 · answered Jan 07 '19 at 07:16

As mentioned in the code of the function the relation of decay_steps with decayed_learning_rate is the following:

decayed_learning_rate = learning_rate *
                      decay_rate ^ (global_step / decay_steps)

Hence, you should set the decay_steps proportional to the global_step of the algorithm.

What does decay_steps mean in Tensorflow tf.train.exponential_decay?

1 Answers1