Is there a bag of tricks for fully-connected networks training on tabular data?

Question

There are lots of discussions and research about tips and tricks that are helpful for convolutional networks training. (Like in this paper). The new architectures/optimizers/layers emerge very often, especially in the fields of computer vision and NLP. I wonder is there something similar for the fully connected networks applied to the tabular/structured data classification?

The most widely used approach I see is to use embeddings layers, concatenate them with numerical input, and pass it through the stack of fully-connected layers (probably with dropouts and batch normalization). Are there any other recommendations or state-of-the-art methods applied for the tabular data, any custom architectures suitable for this task? Or the "simple" approach outlined above is the best available solution?

I understand that people mostly use boosted trees or linear models in this case but I would like to investigate possible solutions available in the Deep Learning world.

While the focus is on debugging networks, there are a number of tricks suggested here: https://stats.stackexchange.com/questions/352036/what-should-i-do-when-my-neural-network-doesnt-learn/352037#352037 — Sycorax, Jan 27 '20 at 16:33
Very thorough and elaborated post! Thank you. I'll definitely go through it. — devforfu, Jan 29 '20 at 15:13

Dikran Marsupial · Accepted Answer · 2020-01-30T07:47:18.873

3

Yes, there are lots of them. I haven't read the second edition (published 2012), but the first is very good.

Somewhat old now, but the Comp.ai.neural-nets FAQ list also has plenty of good advice that is still valid today.

edited Jan 30 '20 at 07:47

answered Jan 27 '20 at 15:26

Dikran Marsupial

46,962
5
121
178

Great, thank you! Actually, some time ago I've already encountered the reference to this book somewhere but didn't pay proper attention to it :) – devforfu Jan 29 '20 at 15:10

Is there a bag of tricks for fully-connected networks training on tabular data?

1 Answers1