I'd like to classify short documents, from a predefined set of words.
What algorithm would you suggest, LDA or pLSA ?
My use case
I have a list of users, and for each user a list of the pages she likes.
My goal is to classify users (documents) into classes (topics).
The documents are short, since a user is not likely to to like more than dozens of pages, and there are approximately 100k pages I care about