I have a large scatterplot, with about 100,000 (x,y) points. The x coordinate is the set of numbers from (1 to ~100,000) - in other words, no 2 points have the same x-coordinate. The y is mostly constant (around 50-70 in value), but there are key "regions" where the y value spikes to ~120 or drops to ~20. How would I statistically differentiate these regions?
Clustering? Any other ideas?
As a bonus, if you know R, if you could reference a certain method that would be helpful as well.