0

I'm training a multi-label image classifier with a dataset of 4096 images and a network model based off of DenseNet, using an SGD optimizer. The pink graph has a learning rate of 0.01, and the blue graph has a learning rate of 0.001. Both have a momentum of 0.9. What is this a symptom of? Do I need more data?

eval vs. train

0 Answers0