I have observations X along with their labels Y. I then create a histogram of Y. I then remove observations such that the histogram still retains the same distinct shape.
Does anyone know what data reduction technique this is called?
I have observations X along with their labels Y. I then create a histogram of Y. I then remove observations such that the histogram still retains the same distinct shape.
Does anyone know what data reduction technique this is called?
What you're describing would qualify as a form of Stratified Random Sampling. (Though typically you'd stratify according to things like "Sex" and "Nationality" and not according to the bins of a histogram...)