A very simple model:
Given N labeled training examples from two classes (say red and green), we use the distance from means approach to learning a classifier - that classifies new points as either red or green. The idea is to predict the class that has a closer mean.
Mathematically, it can be shown that this model learns a hyperplane decision boundary.
One of my university's course's lecture content remarks that - "This simple approach, if using Euclidean distances, can only learn linear decision boundaries. A reason: The basic approach implicitly assumes that classes are roughly spherical and equi-sized."
What is the reason and intuition behind the classes being roughly spherical and equi-sized? Could someone please explain? Thanks a lot!