1

I'm trying to develop a model in R that will compare a customer review with a summary of that review that is completed by an employee. The purpose is to ensure that the employee is accurately tagging and summarizing the customer review.

In more detail:

  1. A customer leaves a review
  2. Employee reads the review and creates a summary of the review
  3. Employee creates tags such - technical problem, billing issue, etc.

Which NLP method would be best to compare the review with each summary? Is this something that's even possible?

I don't think Bag-of-Words would be useful. I understand that doc2vec can be used compare texts and find the texts that are the most similar to one another. However, how do I measure how accurately a summary reflects its original review?

Say I have 1000 reviews: Is the only way to do this if I:

  1. Manually read 500 reviews and summaries, and tag each summary as 1=good summary or 2 = bad summary.
  2. Train the model using some NLP tool.
  3. Test the model on the remaining 500 reviews and summaries?

I do not have sample data at this time. I am only doing literature review to see if this is possible.

Thanks!

pr478
  • 11
  • 1

0 Answers0