I read somewhere that if the number of dimensions in your feature set is very high, then a non-linear kernel such as RBF (or any other) may not help in increasing accuracy compared to a linear kernel.
What is an intuitive reason for this?
The same post (I think it was one of the answers on CrossValidated) mentioned that this is a typical case when working with textual data as the number of features are usually very high.