NOT A DUPLICATE. These questions are related, but mine is asking about a specific application of classifiers - flagging Stack Exchange posts. I want to know which of the 2 methods is most effective for THIS TYPE OF JOB. This question is different also because the true/false classifiers are run in a specific order (most priority to least priority). UNMARK AS DUPLICATE!
I want to write a program that can automatically review the Stack Overflow Triage queue. It will classify text as "Looks OK," "Requires Editing," and "Unsalvageable." If it's unsalvageable, it will then classify the text into a specific close reason.
I've come up with 2 methods to do this:
Use multilabel classification to classify the text into one of the closure categories. (No category has priority over another)
Go through the close reasons individually, in a hardcoded order, and decide if it belongs in the category or not. If it belongs, it flags the question. If it doesn't, it moves to the next close reason and repeats. If it hasn't been identified by the end, the question is determined to be good-quality. (The order in which the program checks the classes is important)
I don't know which way is more effective. I thought of the 2nd way because a question that is both off-topic and offensive should be flagged just as offensive. Therefore, "offensive" has a higher priority than "off-topic."
Priority order:
- Offensive?
- Spam?
- Very Low Quality?
- Blatantly Off-Topic?
- Unclear?
- Recommendation?
- Super User/Server Fault
- Opinionated?
- Too Broad?
- Vague Debug?
- No Repro/Typo?
- Requires Editing/Looks OK
Which of the 2 algorithms gets the job done better?