What is the best way to combine outputs from a binary classifier, which outputs probabilities, and is applied to a sequence of non-iid inputs?
Here's a scenario: Say I have a classifier which does an OK, but not great, job of classifying whether or not a cat is in an image. I feed the classifier frames from a video, and get as output a sequence of probabilities, near one if a cat is present, near zero if not.
Each of the inputs is clearly not independent. If a cat is present in one frame, it's most likely it will be present in the next frame as well. Say I have the following sequence of predictions from the classifier (obviously there are more than six frames in one hour of video)
- 12pm to 1pm: $[0.1, 0.3, 0.6, 0.4, 0.2, 0.1]$
- 1pm to 2pm: $[0.1, 0.2, 0.45, 0.45, 0.48, 0.2]$
- 2pm and 3pm: $[0.1, 0.1, 0.2, 0.1, 0.2, 0.1]$
The classifier answers the question, "What is the probability a cat is present in this video frame". But can I use these outputs to answer the following questions?
- What is the probability there was a cat in the video between 12 and 1pm? Between 1 and 2pm? Between 2pm and 3pm?
- Given say, a day of video, what is the probability that we have seen a cat at least once? Probability we have seen a cat exactly twice?
My first attempts at this problem are to simply threshold the classifier at say, 0.5. In which case, for question 1, we would decide there was a cat between 12 and 1pm, but not between 1 to 3pm, despite the fact that between 1 and 2pm the sum of the probabilities is much higher than between 2 and 3pm.
I could also imagine this as a sequence of Bernoulli trials, where one sample is drawn for each probability output from the classifier. Given a sequence, one could simulate this to answer these questions. Maybe this is unsatisfactory though, because it treats each frame as iid? I think a sequence of high probabilities should provide more evidence for the presence of a cat than the same high probabilities in a random order.