I teach Introduction to Statistics and would like to show my students how to analyze the Pfizer Phase 3 Vaccine results. Testing a claim about 2 proportions is straightforward with the Pfizer data:
Given:
- Group (1) is the placebo group;
- Group (2) is the vaccine group
Data:
The number of people who got the 2nd dose of vaccine is 41,135 [A]. Assuming that people dropped out equally from the placebo and vaccine group, those groups were created at a 1:1 ratio [B]. So we have approximately 20,567 people in each group. Using the number of people who got covid from each group published in [A] we have:
Placebo Group: $x_1 = 162$, $n_1 = 20,567$
Vaccine Group (BNT162b2): $x_2 = 8$, $n_2 = 20,567$
Claim: $p_1 > p_2 \rightarrow p_1 - p_2 > 0$
$H_0: p_1 - p_2 \le 0$
$H_1: p1 - p2 > 0$
Test Statistic: $z = 11.8 \rightarrow \text{P-Value} < 0.0001$
Thus we have evidence for the claim that $p_1 > p_2$ meaning that we have evidence for the claim that the vaccine lowered the covid rate versus the control group.
So far, so good. But here's where things break down. The manufacturer's claim that the vaccine is shown to be 95% effective. If I use the 162 as the expected number of cases in general, then if the vaccine is 95% effective, the vaccine group should have less than 5% of the expected number of cases. 5% of 162 is 8.1 cases expected in 20567.
Control Group: $x_1 = 8.1, n_1 = 20,567$
Experimental Group: $x_2 = 8, n_2 = 20,567$
Claim: $p_1 > p_2 \rightarrow p_1 - p_2 > 0$
$H0: p_1 - p_2 \le 0$
$H1: p_1 - p_2 > 0$
Test Statistic: $0.0249 \rightarrow \text{P-Value} = 0.4901.$
This is clearly a fail meaning that with a P-Value of 0.4901, I have NOT shown that the vaccine group has less than 5% of the expected number of cases. Why is this analysis flawed? This seems related to (What does 94.5% effective mean?), but even reading this citation, its unclear to me why my hypothesis test is wrong. Again, an explanation appropriate for an INTRODUCTION to statistics class, please.
UPDATE: If I redo the analysis with an expectation of 16.2 cases instead of 8.1, then I get a p-value of 0.0477. This suggests that I can claim with a 5% significance level that the vaccine is 90% effective. Is the issue just that they can do a more detailed/sophisticated analysis to get to 95% effective or is my analysis just flawed and I've gotten lucky?