I am working on a problem involving outliers detection and I found that it was possible to perform this using one-class SVM. I have been googling it and reading some blogs and papers, but I have a doubt it seems not to be solved elsewhere.
As far as I have read, this paper originated the idea of one class SVM, and in here it is said that the idea is to map the data into the feature space and to separate it from the origin with maximum margin. However, in other papers and blogs I read that the intuitive idea is to build an hyperplane as small as possible containing all the data, so the outliers fall on the other side of the hyperplane.
In my opinion, these descriptions (maximizing the distance to the origin on one side, but making the hyperplane as small as possible on the other side) are contradictory. What am I misunderstanding?