I'm learning NN and trying to understand the implication that architecture selection has in the model. Am I inferring correctly that:
- Wider neural networks can approximate more interactions between input variables
- Deeper neural networks can model more complex nonlinearities