activation functions of elman recurrent neural network

Question

From the Book"fundamentals of neural network", the input layer of a feedforward neural network has linear activation function. Elman recurrent NN is the same as a feedforward except that it has context layer. What should be the activation functions of each layer in an elman recurrent NN?

I have searched for this answer from many sources but many journal papers do not mention about the activation functions. Some information online says that the hidden layers should have tansig activation function.

So, What should be the activation functions of each layer in an elman recurrent NN?

score 1 · Answer 1 · edited Apr 13 '17 at 12:44

An Elman network leaves the choice of the activation function to the user, since it only specifies these equations for the recurrence:

\begin{align} h_t &= \sigma_h(W_{h} x_t + U_{h} h_{t-1} + b_h) \\ y_t &= \sigma_y(W_{y} h_t + b_y) \end{align}

Variables and functions:

$x_t$: input vector
$h_t$: hidden layer vector
$y_t$: output vector
$W$, $U$ and $b$: parameter matrices and vector
$\sigma_h$ and $\sigma_y$: Activation functions

{1}, which is one of the most cited references defining an Elman network, doesn't seem to indicate that an Elman network should use some specific activation functions either.

FYI Comprehensive list of activation functions in neural networks with pros/cons

References:

{1} Elman, Jeffrey L. (1990). "Finding Structure in Time". Cognitive Science. 14 (2): 179–211. doi:10.1016/0364-0213(90)90002-E. https://www.cs.swarthmore.edu/~meeden/cs63/f07/elman.srn.pdf ; http://dx.doi.org/10.1207/s15516709cog1402_1

Yes , i agree and thanks for replying. I saw these equations in most journal papers. But I needed some proof or some reliable sources to confirm your answer :/ — user7085565, Dec 21 '16 at 19:30

activation functions of elman recurrent neural network

1 Answers1