I am using fastai to create a text classifier that labels texts as either 0 or 1.
My data (number of 1's and 0's) for training is balanced, and I got an accuracy of 85%.
To test, I used a new unseen corpus of data - to try an mimic a real world scenario, that is not balanced, and produced a confusion matrix.
According to my confusion matrix, my precision and recall are really low at around 20 and 14 percent respectivly.
What are possible reasons for that ? What Can I do to improve these metrics ?