you have been hired by a gem mining company to develop a classification system that can classify gems as part of the automated sorting system.
you decided to use a network with one hidden layer. how would you go about determining the best number of hidden units to use in this layer