Based on the book "An Introduction to Statistical Learning" I learned that the CV estimate for test MSE has an typical U shape: an increasing degree in the polynomial fits the data better initially but after some point it starts to overfit the training data, which increases the estimated test MSE and gives the U shape.
In my case, the 10-fold CV test MSE estimate is monotonically decreasing:
I am modeling transit speed across a border and I have data from several months. A scatter plot of speed vs time of the day is like this:
where minutesDay is the number of minutes since midnight. One can see that there is a jam between 7am and 10am.
Why does the test MSE estimate decreases monotonically?
Is it because the dataset contains an important fraction of almost-duplicated observations (everyday the observations follow the same pattern), so there are high chances of getting a test partition that is very similar to the training partitions?