I have a dataframe as follows:
New_Text | New_Score
review1 | Positive
review2 | Negative
review4 | Positive
... and so on.
I want to create a model that tells whether a review is Positive
or Negative
I have been asked to use only 30% of the data as training data and the rest as test data.
Now, I can't use a simple Naive Bayes Classifier
or Support Vector Machine
because the training data is very little and the test data is very high? How to do text classification in such a case?