I am training an autoencoder which takes sampled time series sensor data in range [-1024,1024] (0 values is possible). I use mean_squared loss and Adam optimizer. During the training MAE decreases and the network converges after 150 epochs. MAE starts from 9 and converges to 3 on validation set.
However, looking at MAPE the values does not make sense at all. The MAPE is in order [2.9,4.1]x10^6.
I assume MAPE is not a good metric for this dataset but what would be the explanation the very very high MAPE values?