(Because of the sensitive nature of the actual project, I am using an analogy here. I hope it's clear, if not, please let me know!)
My goal is to classify images as cats or dogs (binary classification). I have a large data dataset with images of cats and dogs. We know cats and dogs both come in the same 10 different colors. Our dataset contains many, many examples of dogs in all colors, but relatively few (let's say 0.1%) examples of cats and in only 3 colors.
Our model, in real life, will encounter images of cats of all colors. Additionally, we expect about 5% of images to be a cat.
How can I prepare my train, development, and test sets to make sure a model can learn to generalize to recognize cats of all colors?