I have the following challenge: The dataset has one dependent and one independent variable which are connected in a non-linear fashion. I am trying to give a more qualitative picture here because I am unsure how to handle the dataset.
Basically there are five regions from "one end to the other":
- "Unstable" relationship with "too few" data points
- Negative relationship with the dependent variable in negative territory
- Negative relationship with the dependent variable around "0" (seems also "unstable")
- Negative relationship with the dependent variable in positive territory
- "Unstable" relationship with "too few" data points
I differentiated regions 2-4 because it makes a difference concerning the actions to be taken for the different regions.
My question
I need an algorithm (best an implementation/tool) with which I can make the above mentioned regions exact (it is ok to stay with 5 regions). The width of the regions is not equal so the cutoff-points have to be optimized on basis of "distinctiveness". I thought of piecewise regression or optimal width histogram or even some more elusive clustering algorithms but I haven't gotten very far with these approaches (might be my fault). Perhaps you can share some ideas where to start.