0

I am having a dataset with highly imbalanced classes (some classes have 4k examples while others have only one example). What is the best approach to handle such problem?

Traditional approaches are oversampling, undersampling and weight in loss function but to me it looks that all this approaches will totally miss the minority class. All this approaches works in a situation where there is several hundreds examples from the minority class.

Also, using GAN to learn generative model to generate new examples just for single example per class in training set is not feasible.

tMan27
  • 13
  • 3
  • Unbalanced classes are almost certainly not a problem, and oversampling, undersampling or weighting will not solve this non-problem: [Are unbalanced datasets problematic, and (how) does oversampling (purport to) help?](https://stats.stackexchange.com/q/357466/1352) – Stephan Kolassa Nov 27 '21 at 06:51

0 Answers0