Given two empirical distribution A and B, how to setup a hypothesis test to test if a new data point is from distribution A?

Asked Apr 02 '17 at 06:39

Active Aug 01 '17 at 13:53

Viewed 79 times

I have two empirical distribution A and B (i.e. two continuous datasets in $\mathcal{R}$) and I am trying to set up a hypothesis test to test if a new data point is from distribution A. Naturally this calls for a likelihood ratio test. I was planning on setting the test as $$ H_0: \text{the new data point is from distribution A} $$ $$ H_1: \text{the new data point is from distribution B} $$ The likelihood ratio can be computed as $$ \frac{P(H_0|x)}{P(H_1|x)} = \frac{P(H_0)}{P(H_1)}\frac{P(x|H_0)}{P(x|H_1)} $$ However, I am not sure how I would compute the likelihood $P(x|H_0)$ and $P(x|H_1)$ here, given that A and B are empirical distribution in continuous space. I could compute the empirical CDF but I am not sure how to convert that to an empirical PDF. Should I consider fitting some kind of kernel density and use them as the likelihood?

edited Aug 01 '17 at 13:53

Firebug

15,262
5
60
127

asked Apr 02 '17 at 06:39

Jim He

Given two empirical distribution A and B, how to setup a hypothesis test to test if a new data point is from distribution A?

0 Answers0